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Chapter 0 
Introduction and Review 


0.1 Introduction and Review 


If you drop your shoe and a coin side by side, they hit the ground at 
the same time. Why doesn’t the shoe get there first, since gravity is 
pulling harder on it? How does the lens of your eye work, and why 
do your eye’s muscles need to squash its lens into different shapes in 
order to focus on objects nearby or far away? These are the kinds 
of questions that physics tries to answer about the behavior of light 
and matter, the two things that the universe is made of. 


0.1.1 The scientific method 


Until very recently in history, no progress was made in answering 
questions like these. Worse than that, the wrong answers written 
by thinkers like the ancient Greek physicist Aristotle were accepted 
without question for thousands of years. Why is it that scientific 
knowledge has progressed more since the Renaissance than it had 
in all the preceding millennia since the beginning of recorded his- 
tory? Undoubtedly the industrial revolution is part of the answer. 


The Mars Climate Orbiter is pre- 
pared for its mission. The laws 
of physics are the same every- 
where, even on Mars, so the 
probe could be designed based 
on the laws of physics as discov- 
ered on earth. There is unfor- 
tunately another reason why this 
spacecraft is relevant to the top- 
ics of this chapter: it was de- 
stroyed attempting to enter Mars’ 
atmosphere because engineers 
at Lockheed Martin forgot to con- 
vert data on engine thrusts from 
pounds into the metric unit of 
force (newtons) before giving the 
information to NASA. Conver- 
sions are important! 
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theory 


experiment 


a/Science is a cycle of the- 
ory and experiment. 


b/A_ satirical 
alchemist’s laboratory. H. Cock, 
after a drawing by Peter Brueghel 
the Elder (16th century). 
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drawing of an 


Building its centerpiece, the steam engine, required improved tech- 
niques for precise construction and measurement. (Early on, it was 
considered a major advance when English machine shops learned to 
build pistons and cylinders that fit together with a gap narrower 
than the thickness of a penny.) But even before the industrial rev- 
olution, the pace of discovery had picked up, mainly because of the 
introduction of the modern scientific method. Although it evolved 
over time, most scientists today would agree on something like the 
following list of the basic principles of the scientific method: 


(1) Science is a cycle of theory and experiment. Scientific the- 
ories are created to explain the results of experiments that were 
created under certain conditions. A successful theory will also make 
new predictions about new experiments under new conditions. Even- 
tually, though, it always seems to happen that a new experiment 
comes along, showing that under certain conditions the theory is 
not a good approximation or is not valid at all. The ball is then 
back in the theorists’ court. If an experiment disagrees with the 
current theory, the theory has to be changed, not the experiment. 


(2) Theories should both predict and explain. The requirement of 
predictive power means that a theory is only meaningful if it predicts 
something that can be checked against experimental measurements 
that the theorist did not already have at hand. That is, a theory 
should be testable. Explanatory value means that many phenomena 
should be accounted for with few basic principles. If you answer 
every “why” question with “because that’s the way it is,” then your 
theory has no explanatory value. Collecting lots of data without 
being able to find any basic underlying principles is not science. 


(3) Experiments should be reproducible. An experiment should 
be treated with suspicion if it only works for one person, or only 
in one part of the world. Anyone with the necessary skills and 
equipment should be able to get the same results from the same 
experiment. This implies that science transcends national and eth- 
nic boundaries; you can be sure that nobody is doing actual science 
who claims that their work is “Aryan, not Jewish,” “Marxist, not 
bourgeois,” or “Christian, not atheistic.” An experiment cannot be 
reproduced if it is secret, so science is necessarily a public enterprise. 


As an example of the cycle of theory and experiment, a vital step 
toward modern chemistry was the experimental observation that the 
chemical elements could not be transformed into each other, e.g., 
lead could not be turned into gold. This led to the theory that 
chemical reactions consisted of rearrangements of the elements in 
different combinations, without any change in the identities of the 
elements themselves. The theory worked for hundreds of years, and 
was confirmed experimentally over a wide range of pressures and 
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temperatures and with many combinations of elements. Only in 
the twentieth century did we learn that one element could be trans- 
formed into one another under the conditions of extremely high 
pressure and temperature existing in a nuclear bomb or inside a star. 
That observation didn’t completely invalidate the original theory of 
the immutability of the elements, but it showed that it was only an 
approximation, valid at ordinary temperatures and pressures. 


self-check A 

A psychic conducts seances in which the spirits of the dead speak to 
the participants. He says he has special psychic powers not possessed 
by other people, which allow him to “channel” the communications with 
the spirits. What part of the scientific method is being violated here? 
> Answer, p. 866 


The scientific method as described here is an idealization, and 
should not be understood as a set procedure for doing science. Sci- 
entists have as many weaknesses and character flaws as any other 
group, and it is very common for scientists to try to discredit other 
people’s experiments when the results run contrary to their own fa- 
vored point of view. Successful science also has more to do with 
luck, intuition, and creativity than most people realize, and the 
restrictions of the scientific method do not stifle individuality and 
self-expression any more than the fugue and sonata forms stifled 
Bach and Haydn. There is a recent tendency among social scien- 
tists to go even further and to deny that the scientific method even 
exists, claiming that science is no more than an arbitrary social sys- 
tem that determines what ideas to accept based on an in-group’s 
criteria. I think that’s going too far. If science is an arbitrary social 
ritual, it would seem difficult to explain its effectiveness in building 
such useful items as airplanes, CD players, and sewers. If alchemy 
and astrology were no less scientific in their methods than chem- 
istry and astronomy, what was it that kept them from producing 
anything useful? 


Discussion Questions 

Consider whether or not the scientific method is being applied in the fol- 
lowing examples. If the scientific method is not being applied, are the 
people whose actions are being described performing a useful human 
activity, albeit an unscientific one? 


A Acupuncture is a traditional medical technique of Asian origin in 
which small needles are inserted in the patient’s body to relieve pain. 
Many doctors trained in the west consider acupuncture unworthy of ex- 
perimental study because if it had therapeutic effects, such effects could 
not be explained by their theories of the nervous system. Who is being 
more scientific, the western or eastern practitioners? 
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B Goethe, a German poet, is less well known for his theory of color. 
He published a book on the subject, in which he argued that scientific 
apparatus for measuring and quantifying color, such as prisms, lenses 
and colored filters, could not give us full insight into the ultimate meaning 
of color, for instance the cold feeling evoked by blue and green or the 
heroic sentiments inspired by red. Was his work scientific? 


C_ Achild asks why things fall down, and an adult answers “because of 
gravity.” The ancient Greek philosopher Aristotle explained that rocks fell 
because it was their nature to seek out their natural place, in contact with 
the earth. Are these explanations scientific? 


D Buddhism is partly a psychological explanation of human suffering, 
and psychology is of course a science. The Buddha could be said to 
have engaged in a cycle of theory and experiment, since he worked by 
trial and error, and even late in his life he asked his followers to challenge 
his ideas. Buddhism could also be considered reproducible, since the 
Buddha told his followers they could find enlightenment for themselves 
if they followed a certain course of study and discipline. ls Buddhism a 
scientific pursuit? 


0.1.2 What is physics? 


Given for one instant an intelligence which could comprehend all 
the forces by which nature is animated and the respective positions 
of the things which compose it...nothing would be uncertain, and 
the future as the past would be laid out before its eyes. 


Pierre Simon de Laplace 


Physics is the use of the scientific method to find out the basic 
principles governing light and matter, and to discover the implica- 
tions of those laws. Part of what distinguishes the modern outlook 
from the ancient mind-set is the assumption that there are rules by 
which the universe functions, and that those laws can be at least par- 
tially understood by humans. From the Age of Reason through the 
nineteenth century, many scientists began to be convinced that the 
laws of nature not only could be known but, as claimed by Laplace, 
those laws could in principle be used to predict everything about 
the universe’s future if complete information was available about 
the present state of all light and matter. In subsequent sections, 
I'll describe two general types of limitations on prediction using the 
laws of physics, which were only recognized in the twentieth century. 


Matter can be defined as anything that is affected by gravity, 
i.e., that has weight or would have weight if it was near the Earth 
or another star or planet massive enough to produce measurable 
gravity. Light can be defined as anything that can travel from one 
place to another through empty space and can influence matter, but 
has no weight. For example, sunlight can influence your body by 
heating it or by damaging your DNA and giving you skin cancer. 
The physicist’s definition of light includes a variety of phenomena 
that are not visible to the eye, including radio waves, microwaves, 
x-rays, and gamma rays. These are the “colors” of light that do not 
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happen to fall within the narrow violet-to-red range of the rainbow 
that we can see. 


self-check B 

At the turn of the 20th century, a strange new phenomenon was discov- 
ered in vacuum tubes: mysterious rays of unknown origin and nature. 
These rays are the same as the ones that shoot from the back of your 
TV’s picture tube and hit the front to make the picture. Physicists in 
1895 didn’t have the faintest idea what the rays were, so they simply 
named them “cathode rays,” after the name for the electrical contact 
from which they sprang. A fierce debate raged, complete with national- 
istic overtones, over whether the rays were a form of light or of matter. 
What would they have had to do in order to settle the issue? > 
Answer, p. 866 


Many physical phenomena are not themselves light or matter, 
but are properties of light or matter or interactions between light 
and matter. For instance, motion is a property of all light and some 
matter, but it is not itself light or matter. The pressure that keeps 
a bicycle tire blown up is an interaction between the air and the 
tire. Pressure is not a form of matter in and of itself. It is as 
much a property of the tire as of the air. Analogously, sisterhood 
and employment are relationships among people but are not people 
themselves. 


Some things that appear weightless actually do have weight, and 
so qualify as matter. Air has weight, and is thus a form of matter 
even though a cubic inch of air weighs less than a grain of sand. A 
helium balloon has weight, but is kept from falling by the force of the 
surrounding more dense air, which pushes up on it. Astronauts in 
orbit around the Earth have weight, and are falling along a curved 
arc, but they are moving so fast that the curved arc of their fall 
is broad enough to carry them all the way around the Earth in a 
circle. They perceive themselves as being weightless because their 
space capsule is falling along with them, and the floor therefore does 
not push up on their feet. 


Optional Topic: Modern Changes in the Definition of Light and 
Matter 

Einstein predicted as a consequence of his theory of relativity that light 
would after all be affected by gravity, although the effect would be ex- 
tremely weak under normal conditions. His prediction was borne out 
by observations of the bending of light rays from stars as they passed 
close to the sun on their way to the Earth. Einstein’s theory also implied 
the existence of black holes, stars so massive and compact that their 
intense gravity would not even allow light to escape. (These days there 
is strong evidence that black holes exist.) 


Einstein’s interpretation was that light doesn’t really have mass, but 
that energy is affected by gravity just like mass is. The energy in a light 
beam is equivalent to a certain amount of mass, given by the famous 
equation E = mc?, where c is the speed of light. Because the speed 
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c/ This telescope picture shows 
two images of the same distant 
object, an exotic, very luminous 
object called a quasar. This is 
interpreted as evidence that a 
massive, dark object, possibly 
a black hole, happens to be 
between us and it. Light rays that 
would otherwise have missed the 
earth on either side have been 
bent by the dark object’s gravity 
so that they reach us. The actual 
direction to the quasar is presum- 
ably in the center of the image, 
but the light along that central line 
doesn’t get to us because it is 
absorbed by the dark object. The 
quasar is known by its catalog 
number, MG1131+0456, or more 
informally as Einstein’s Ring. 
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of light is such a big number, a large amount of energy is equivalent to 
only a very small amount of mass, so the gravitational force on a light 
ray can be ignored for most practical purposes. 


There is however a more satisfactory and fundamental distinction 
between light and matter, which should be understandable to you if you 
have had a chemistry course. In chemistry, one learns that electrons 
obey the Pauli exclusion principle, which forbids more than one electron 
from occupying the same orbital if they have the same spin. The Pauli 
exclusion principle is obeyed by the subatomic particles of which matter 
is composed, but disobeyed by the particles, called photons, of which a 
molecule beam of light is made. 


Einstein’s theory of relativity is discussed more fully in book 6 of this 
series. 


The boundary between physics and the other sciences is not 
always clear. For instance, chemists study atoms and molecules, 
which are what matter is built from, and there are some scientists 
who would be equally willing to call themselves physical chemists 
or chemical physicists. It might seem that the distinction between 
physics and biology would be clearer, since physics seems to deal 
with inanimate objects. In fact, almost all physicists would agree 
that the basic laws of physics that apply to molecules in a test tube 
work equally well for the combination of molecules that constitutes 
a bacterium. (Some might believe that something more happens in 
the minds of humans, or even those of cats and dogs.) What differ- 
entiates physics from biology is that many of the scientific theories 
that describe living things, while ultimately resulting from the fun- 
damental laws of physics, cannot be rigorously derived from physical 
principles. 


Isolated systems and reductionism 


neutrons 


and protons To avoid having to study everything at once, scientists isolate the 
* things they are trying to study. For instance, a physicist who wants 
ann to study the motion of a rotating gyroscope would probably prefer 
_ r that it be isolated from vibrations and air currents. Even in biology, 
2 —_ where field work is indispensable for understanding how living things 
> eal relate to their entire environment, it is interesting to note the vital 
historical role played by Darwin’s study of the Galapagos Islands, 
which were conveniently isolated from the rest of the world. Any 
part of the universe that is considered apart from the rest can be 
called a “system.” 


Physics has had some of its greatest successes by carrying this 
process of isolation to extremes, subdividing the universe into smaller 
and smaller parts. Matter can be divided into atoms, and the be- 
havior of individual atoms can be studied. Atoms can be split apart 
into their constituent neutrons, protons and electrons. Protons and 
neutrons appear to be made out of even smaller particles called 
quarks, and there have even been some claims of experimental ev- 
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idence that quarks have smaller parts inside them. This method 
of splitting things into smaller and smaller parts and studying how 
those parts influence each other is called reductionism. The hope is 
that the seemingly complex rules governing the larger units can be 
better understood in terms of simpler rules governing the smaller 
units. To appreciate what reductionism has done for science, it is 
only necessary to examine a 19th-century chemistry textbook. At 
that time, the existence of atoms was still doubted by some, elec- 
trons were not even suspected to exist, and almost nothing was 
understood of what basic rules governed the way atoms interacted 
with each other in chemical reactions. Students had to memorize 
long lists of chemicals and their reactions, and there was no way to 
understand any of it systematically. Today, the student only needs 
to remember a small set of rules about how atoms interact, for in- 
stance that atoms of one element cannot be converted into another 
via chemical reactions, or that atoms from the right side of the pe- 
riodic table tend to form strong bonds with atoms from the left 
side. 


Discussion Questions 


A I’ve suggested replacing the ordinary dictionary definition of light 
with a more technical, more precise one that involves weightlessness. It’s 
still possible, though, that the stuff a lightbulb makes, ordinarily called 
“light,” does have some small amount of weight. Suggest an experiment 
to attempt to measure whether it does. 


B_ Heat is weightless (i.e., an object becomes no heavier when heated), 
and can travel across an empty room from the fireplace to your skin, 
where it influences you by heating you. Should heat therefore be con- 
sidered a form of light by our definition? Why or why not? 


C Similarly, should sound be considered a form of light? 


0.1.3 How to learn physics 


For as knowledges are now delivered, there is a kind of contract of 
error between the deliverer and the receiver; for he that delivereth 
knowledge desireth to deliver it in such a form as may be best be- 
lieved, and not as may be best examined; and he that receiveth 
knowledge desireth rather present satisfaction than expectant in- 


quiry. 
Francis Bacon 


Many students approach a science course with the idea that they 
can succeed by memorizing the formulas, so that when a problem 
is assigned on the homework or an exam, they will be able to plug 
numbers in to the formula and get a numerical result on their cal- 
culator. Wrong! That’s not what learning science is about! There 
is a big difference between memorizing formulas and understanding 
concepts. To start with, different formulas may apply in different 
situations. One equation might represent a definition, which is al- 
ways true. Another might be a very specific equation for the speed 
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of an object sliding down an inclined plane, which would not be true 
if the object was a rock drifting down to the bottom of the ocean. 
If you don’t work to understand physics on a conceptual level, you 
won't know which formulas can be used when. 


Most students taking college science courses for the first time 
also have very little experience with interpreting the meaning of an 
equation. Consider the equation w = A/h relating the width of a 
rectangle to its height and area. A student who has not developed 
skill at interpretation might view this as yet another equation to 
memorize and plug in to when needed. A slightly more savvy stu- 
dent might realize that it is simply the familiar formula A = wh 
in a different form. When asked whether a rectangle would have 
a greater or smaller width than another with the same area but 
a smaller height, the unsophisticated student might be at a loss, 
not having any numbers to plug in on a calculator. The more ex- 
perienced student would know how to reason about an equation 
involving division — if h is smaller, and A stays the same, then w 
must be bigger. Often, students fail to recognize a sequence of equa- 
tions as a derivation leading to a final result, so they think all the 
intermediate steps are equally important formulas that they should 
memorize. 


When learning any subject at all, it is important to become as 
actively involved as possible, rather than trying to read through 
all the information quickly without thinking about it. It is a good 
idea to read and think about the questions posed at the end of each 
section of these notes as you encounter them, so that you know you 
have understood what you were reading. 


Many students’ difficulties in physics boil down mainly to diffi- 
culties with math. Suppose you feel confident that you have enough 
mathematical preparation to succeed in this course, but you are 
having trouble with a few specific things. In some areas, the brief 
review given in this chapter may be sufficient, but in other areas 
it probably will not. Once you identify the areas of math in which 
you are having problems, get help in those areas. Don’t limp along 
through the whole course with a vague feeling of dread about some- 
thing like scientific notation. The problem will not go away if you 
ignore it. The same applies to essential mathematical skills that you 
are learning in this course for the first time, such as vector addition. 


Sometimes students tell me they keep trying to understand a 
certain topic in the book, and it just doesn’t make sense. The worst 
thing you can possibly do in that situation is to keep on staring 
at the same page. Every textbook explains certain things badly — 
even mine! — so the best thing to do in this situation is to look 
at a different book. Instead of college textbooks aimed at the same 
mathematical level as the course you’re taking, you may in some 
cases find that high school books or books at a lower math level 
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give clearer explanations. 


Finally, when reviewing for an exam, don’t simply read back 
over the text and your lecture notes. Instead, try to use an active 
method of reviewing, for instance by discussing some of the discus- 
sion questions with another student, or doing homework problems 
you hadn’t done the first time. 


0.1.4 Velocity and acceleration 


Calculus was invented by a physicist, Isaac Newton, because 
he needed it as a tool for calculating velocity and acceleration; in 
your introductory calculus course, velocity and acceleration were 
probably presented as some of the first applications. 


If an object’s position as a function of time is given by the func- 
tion x(t), then its velocity and acceleration are given by the first 
and second derivatives with respect to time, 


ya 
dt 


and 


dt? 


The notation relates in a logical way to the units of the quantities. 
Velocity has units of m/s, and that makes sense because dz is inter- 
preted as an infinitesimally small distance, with units of meters, and 
dt as an infinitesimally small time, with units of seconds. The seem- 
ingly weird and inconsistent placement of the superscripted twos in 
the notation for the acceleration is likewise meant to suggest the 
units: something on top with units of meters, and something on the 
bottom with units of seconds squared. 


Velocity and acceleration have completely different physical in- 
terpretations. Velocity is a matter of opinion. Right now as you sit 
in a chair and read this book, you could say that your velocity was 
zero, but an observer watching the Earth rotate would say that you 
had a velocity of hundreds of miles an hour. Acceleration represents 
a change in velocity, and it’s not a matter of opinion. Accelerations 
produce physical effects, and don’t occur unless there’s a force to 
cause them. For example, gravitational forces on Earth cause falling 
objects to have an acceleration of 9.8 m/s?. 


Constant acceleration example 1 
> How high does a diving board have to be above the water if the 
diver is to have as much as 1.0 s in the air? 


> The diver starts at rest, and has an acceleration of 9.8 m/s?. 
We need to find a connection between the distance she travels 
and time it takes. In other words, we’re looking for information 


Section 0.1 —_ Introduction and Review 


21 


22 


Chapter 0 


about the function x(t), given information about the acceleration. 
To go from acceleration to position, we need to integrate twice: 


x= [ [ adtat 


= [tats Vo) dt [Vo is a constant of integration.] 

- [at [Vo is zero because she’s dropping from rest.] 
= sat? +X. [Xp is a constant of integration] 

= sat? [X) can be zero if we define it that way. ] 


Note some of the good problem-solving habits demonstrated here. 
We solve the problem symbolically, and only plug in numbers at 
the very end, once all the algebra and calculus are done. One 
should also make a habit, after finding a symbolic result, of check- 
ing whether the dependence on the variables make sense. A 
greater value of fin this expression would lead to a greater value 
for x; that makes sense, because if you want more time in the 
air, you're going to have to jump from higher up. A greater ac- 
celeration also leads to a greater height; this also makes sense, 
because the stronger gravity is, the more height you'll need in or- 
der to stay in the air for a given amount of time. Now we plug in 
numbers. 


x= ; (9.8 m/s?) (1.0 s) 


=4.9m 


Note that when we put in the numbers, we check that the units 
work out correctly, (m/s?) (s)* = m. We should also check that 
the result makes sense: 4.9 meters is pretty high, but not unrea- 
sonable. 


The notation dq in calculus represents an infinitesimally small 
change in the variable gq. The corresponding notation for a finite 
change in a variable is Aq. For example, if q represents the value 
of a certain stock on the stock market, and the value falls from 
qo = 5 dollars initially to qf = 3 dollars finally, then Aq = —2 
dollars. When we study linear functions, whose slopes are constant, 
the derivative is synonymous with the slope of the line, and dy/dx 
is the same thing as Ay/Az, the rise over the run. 


Under conditions of constant acceleration, we can relate velocity 
and time, 


_ Av 
“Tt ; 


or, as in the example 1, position and time, 


1 2 
5 + Uot + Lo 
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It can also be handy to have a relation involving velocity and posi- 
tion, eliminating time. Straightforward algebra gives 


vy = v2 + 2aAr ; 


where vy is the final velocity, vo the initial velocity, and Az the 
distance traveled. 


> Solved problem: Dropping a rock on Mars page 49, problem 17 


> Solved problem: The Dodge Viper page 50, problem 19 


0.1.5 Self-evaluation 


The introductory part of a book like this is hard to write, because 
every student arrives at this starting point with a different prepara- 
tion. One student may have grown up outside the U.S. and so may 
be completely comfortable with the metric system, but may have 
had an algebra course in which the instructor passed too quickly 
over scientific notation. Another student may have already taken 
vector calculus, but may have never learned the metric system. The 
following self-evaluation is a checklist to help you figure out what 
you need to study to be prepared for the rest of the course. 


If you disagree with this state- | you should study this section: 
ment... 


I am familiar with the basic metric | subsection 0.1.6 Basic of the Metric 
units of meters, kilograms, and sec- | System 

onds, and the most common metric 
prefixes: milli- (m), kilo- (k), and 
centi- (c). 

I am familiar with these less com- | subsection 0.1.7 Less Common Met- 
mon metric prefixes: mega- (M), | ric Prefixes 


micro- (4), and nano- (n). 
I am comfortable with scientific no- | subsection 0.1.8 Scientific Notation 
tation. 
I can confidently do metric conver- | subsection 0.1.9 Conversions 


sions. 
I understand the purpose and use of | subsection 0.1.10 Significant Figures 
significant figures. 


It wouldn’t hurt you to skim the sections you think you already 
know about, and to do the self-checks in those sections. 
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0.1.6 Basics of the metric system 
The metric system 


Units were not standardized until fairly recently in history, so 
when the physicist Isaac Newton gave the result of an experiment 
with a pendulum, he had to specify not just that the string was 37 
7/s inches long but that it was “37 ’/s London inches long.” The 
inch as defined in Yorkshire would have been different. Even after 
the British Empire standardized its units, it was still very inconve- 
nient to do calculations involving money, volume, distance, time, or 
weight, because of all the odd conversion factors, like 16 ounces in 
a pound, and 5280 feet in a mile. Through the nineteenth century, 
schoolchildren squandered most of their mathematical education in 
preparing to do calculations such as making change when a customer 
in a shop offered a one-crown note for a book costing two pounds, 
thirteen shillings and tuppence. The dollar has always been decimal, 
and British money went decimal decades ago, but the United States 
is still saddled with the antiquated system of feet, inches, pounds, 
ounces and so on. 


Every country in the world besides the U.S. uses a system of 
units known in English as the “metric system.!” This system is 
entirely decimal, thanks to the same eminently logical people who 
brought about the French Revolution. In deference to France, the 
system’s official name is the Systeme International, or SI, meaning 
International System. (The phrase “SI system” is therefore redun- 
dant.) 


The wonderful thing about the SI is that people who live in 
countries more modern than ours do not need to memorize how 
many ounces there are in a pound, how many cups in a pint, how 
many feet in a mile, etc. The whole system works with a single, 
consistent set of Greek and Latin prefixes that modify the basic 
units. Each prefix stands for a power of ten, and has an abbreviation 
that can be combined with the symbol for the unit. For instance, 
the meter is a unit of distance. The prefix kilo- stands for 10°, so a 
kilometer, 1 km, is a thousand meters. 


The basic units of the metric system are the meter for distance, 
the second for time, and the gram for mass. 


The following are the most common metric prefixes. You should 
memorize them. 


prefix meaning example 
kilo k 103 60 kg =a person’s mass 
centi- c 107? 28cm = height of a piece of paper 
mill- m 1073 lms = time for one vibration of a guitar 


string playing the note D 


‘Liberia and Myanmar have not legally adopted metric units, but use them 
in everyday life. 
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The prefix centi-, meaning 10~?, is only used in the centimeter; 
a hundredth of a gram would not be written as 1 cg but as 10 mg. 
The centi- prefix can be easily remembered because a cent is 10~? 
dollars. The official SI abbreviation for seconds is “s” (not “sec” ) 


OA) 


and grams are “g” (not “gm’). 


The second 


When I stated briefly above that the second was a unit of time, it 
may not have occurred to you that this was not much of a definition. 
We can make a dictionary-style definition of a term like “time,” or 
give a general description like Isaac Newton’s: “Absolute, true, and 
mathematical time, of itself, and from its own nature, flows equably 
without relation to anything external...” Newton’s characterization 
sounds impressive, but physicists today would consider it useless as 
a definition of time. Today, the physical sciences are based on oper- 
ational definitions, which means definitions that spell out the actual 
steps (operations) required to measure something numerically. 


In an era when our toasters, pens, and coffee pots tell us the 
time, it is far from obvious to most people what is the fundamental 
operational definition of time. Until recently, the hour, minute, and 
second were defined operationally in terms of the time required for 
the earth to rotate about its axis. Unfortunately, the Earth’s ro- 
tation is slowing down slightly, and by 1967 this was becoming an 
issue in scientific experiments requiring precise time measurements. 
The second was therefore redefined as the time required for a cer- 
tain number of vibrations of the light waves emitted by a cesium 
atoms in a lamp constructed like a familiar neon sign but with the 
neon replaced by cesium. The new definition not only promises to 
stay constant indefinitely, but for scientists is a more convenient 
way of calibrating a clock than having to carry out astronomical 
measurements. 


self-check C 
What is a possible operational definition of how strong apersonis? p> 
Answer, p. 866 


The meter 


The French originally defined the meter as 10~" times the dis- 
tance from the equator to the north pole, as measured through Paris 
(of course). Even if the definition was operational, the operation of 
traveling to the north pole and laying a surveying chain behind you 
was not one that most working scientists wanted to carry out. Fairly 
soon, a standard was created in the form of a metal bar with two 
scratches on it. This was replaced by an atomic standard in 1960, 
and finally in 1983 by the current definition, which is that the me- 
ter is the distance traveled by light in a vacuum over a period of 
(1/299792458) seconds. 


e/The _ original 
the meter. 
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f/A duplicate of the Paris 
kilogram, maintained at the Dan- 
ish National Metrology Institute. 


The kilogram 


The third base unit of the SI is the kilogram, a unit of mass. 
Mass is intended to be a measure of the amount of a substance, 
but that is not an operational definition. Bathroom scales work by 
measuring our planet’s gravitational attraction for the object being 
weighed, but using that type of scale to define mass operationally 
would be undesirable because gravity varies in strength from place 
to place on the earth. 


There’s a surprising amount of disagreement among physics text- 
books about how mass should be defined, but here’s how it’s actually 
handled by the few working physicists who specialize in ultra-high- 
precision measurements. They maintain a physical object in Paris, 
which is the standard kilogram, a cylinder made of platinum-iridium 
alloy. Duplicates are checked against this mother of all kilograms 
by putting the original and the copy on the two opposite pans of a 
balance. Although this method of comparison depends on gravity, 
the problems associated with differences in gravity in different geo- 
graphical locations are bypassed, because the two objects are being 
compared in the same place. The duplicates can then be removed 
from the Parisian kilogram shrine and transported elsewhere in the 
world. It would be desirable to replace this at some point with a 
universally accessible atomic standard rather than one based on a 
specific artifact, but as of 2010 the technology for automated count- 
ing of large numbers of atoms has not gotten good enough to make 
that work with the desired precision. 


Combinations of metric units 


Just about anything you want to measure can be measured with 
some combination of meters, kilograms, and seconds. Speed can be 
measured in m/s, volume in m?°, and density in kg/m®. Part of what 
makes the SI great is this basic simplicity. No more funny units like 
a cord of wood, a bolt of cloth, or a jigger of whiskey. No more 
liquid and dry measure. Just a simple, consistent set of units. The 
SI measures put together from meters, kilograms, and seconds make 
up the mks system. For example, the mks unit of speed is m/s, not 
km/hr. 


Checking units 


A useful technique for finding mistakes in one’s algebra is to 
analyze the units associated with the variables. 


‘Checking units example 2 
> Jae starts from the formula V = 2Ah for the volume of a cone, 
where A is the area of its base, and h is its height. He wants to 
find an equation that will tell him how tall a conical tent has to be 
in order to have a certain volume, given its radius. His algebra 
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goes like this: 


1 = —Ah 
3 
2 A=nr" 
3 V= anr?h 
2 
Tr 
4 h= — 
3V 


Is his algebra correct? If not, find the mistake. 


> Line 4 is supposed to be an equation for the height, so the units 
of the expression on the right-hand side had better equal meters. 
The pi and the 3 are unitless, so we can ignore them. In terms of 
units, line 4 becomes 


eee. 


m=— =— 
ms om 


This is false, so there must be a mistake in the algebra. The units 
of lines 1, 2, and 3 check out, so the mistake must be in the step 
from line 3 to line 4. In fact the result should have been 


3V 
hee 
mre 


Now the units check: m = m3/m?. 


Discussion Question 


A Isaac Newton wrote, “...the natural days are truly unequal, though 
they are commonly considered as equal, and used for a measure of 
time. . . lt may be that there is no such thing as an equable motion, whereby 
time may be accurately measured. All motions may be accelerated or re- 
tarded...” Newton was right. Even the modern definition of the second 
in terms of light emitted by cesium atoms is subject to variation. For in- 
stance, magnetic fields could cause the cesium atoms to emit light with 
a slightly different rate of vibration. What makes us think, though, that a 
pendulum clock is more accurate than a sundial, or that a cesium atom 
is a more accurate timekeeper than a pendulum clock? That is, how can 
one test experimentally how the accuracies of different time standards 
compare? 


Section 0.1. —_ Introduction and Review 


27 


~ ae little 


10-9 nano <—[ NUNS 


10-6 micro «| mix 
10-3 milli Se 
103 kilo es 


106 mega | MUQS. 


g/This is a mnemonic. to 
help you remember the most im- 
portant metric prefixes. The word 
“little” is to remind you that the 
list starts with the prefixes used 
for small quantities and builds 
upward. The exponent changes 
by 3, except that of course that 
we do not need a special prefix 
for 10°, which equals one. 
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0.1.7 Less common metric prefixes 


The following are three metric prefixes which, while less common 
than the ones discussed previously, are well worth memorizing. 


prefix meaning example 
mega- M_ 10° 6.4Mm  ~= radius of the earth 
micro- ys 10-6 10 wm = size of a white blood cell 
nano- n_ 10-9 0.154 nm = distance between carbon 


nuclei in an ethane molecule 


Note that the abbreviation for micro is the Greek letter mu, yu 
— a common mistake is to confuse it with m (milli) or M (mega). 


There are other prefixes even less common, used for extremely 
large and small quantities. For instance, 1 femtometer = 107! m is 
a convenient unit of distance in nuclear physics, and 1 gigabyte = 
10° bytes is used for computers’ hard disks. The international com- 
mittee that makes decisions about the SI has recently even added 
some new prefixes that sound like jokes, e.g., 1 yoctogram = 10774 g¢ 
is about half the mass of a proton. In the immediate future, how- 
ever, you’re unlikely to see prefixes like “yocto-” and “zepto-” used 
except perhaps in trivia contests at science-fiction conventions or 
other geekfests. 


self-check D 

Suppose you could slow down time so that according to your perception, 
a beam of light would move across a room at the speed of a slow walk. 
If you perceived a nanosecond as if it was a second, how would you 
perceive a microsecond? > Answer, p. 866 


0.1.8 Scientific notation 


Most of the interesting phenomena in our universe are not on 
the human scale. It would take about 1,000,000,000,000,000,000,000 
bacteria to equal the mass of a human body. When the physicist 
Thomas Young discovered that light was a wave, it was back in the 
bad old days before scientific notation, and he was obliged to write 
that the time required for one vibration of the wave was 1/500 of 
a millionth of a millionth of a second. Scientific notation is a less 
awkward way to write very large and very small numbers such as 
these. Here’s a quick review. 


Scientific notation means writing a number in terms of a product 
of something from 1 to 10 and something else that is a power of ten. 
For instance, 


s2 S52 410" 
$20 = 3.2 x10" 
3200 = 3.2 x 10° 
Each number is ten times bigger than the previous one. 


Since 10! is ten times smaller than 10? , it makes sense to use 
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the notation 10° to stand for one, the number that is in turn ten 
times smaller than 10! . Continuing on, we can write 10~! to stand 
for 0.1, the number ten times smaller than 10° . Negative exponents 
are used for small numbers: 


3.2 = 3.2 x 10° 
0.32 = 3.2 x 107! 
0.032 = 3.2 x 107? 


A common source of confusion is the notation used on the dis- 
plays of many calculators. Examples: 


3.2 x 10° (written notation) 
3.2E+6 (notation on some calculators) 
3:2? (notation on some other calculators) 


The last example is particularly unfortunate, because 3.2° really 
stands for the number 3.2 x 3.2 « 3.2 x 3.2 x 3.2 x 3.2 = 1074, a 
totally different number from 3.2 x 10° = 3200000. The calculator 
notation should never be used in writing. It’s just a way for the 
manufacturer to save money by making a simpler display. 


self-check E 
A student learns that 10* bacteria, standing in line to register for classes 
at Paramecium Community College, would form a queue of this size: 


——EE SSS 
The student concludes that 10? bacteria would form a line of this length: 


Why is the student incorrect? > Answer, p. 866 


0.1.9 Conversions 


Conversions are one of the three essential mathematical skills, 
summarized on pp.850-852, that you need for success in this course. 


I suggest you avoid memorizing lots of conversion factors be- 
tween SI units and U.S. units, but two that do come in handy are: 


1 inch = 2.54 cm 


An object with a weight on Earth of 2.2 pounds-force has a 
mass of 1 kg. 


The first one is the present definition of the inch, so it’s exact. The 
second one is not exact, but is good enough for most purposes. (U.S. 
units of force and mass are confusing, so it’s a good thing they’re 
not used in science. In U.S. units, the unit of force is the pound- 
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force, and the best unit to use for mass is the slug, which is about 
14.6 kg.) 


More important than memorizing conversion factors is under- 
standing the right method for doing conversions. Even within the 
SI, you may need to convert, say, from grams to kilograms. Differ- 
ent people have different ways of thinking about conversions, but 
the method I’ll describe here is systematic and easy to understand. 
The idea is that if 1 kg and 1000 g represent the same mass, then 
we can consider a fraction like 


10° g 
1 kg 


to be a way of expressing the number one. This may bother you. For 
instance, if you type 1000/1 into your calculator, you will get 1000, 
not one. Again, different people have different ways of thinking 
about it, but the justification is that it helps us to do conversions, 
and it works! Now if we want to convert 0.7 kg to units of grams, 
we can multiply kg by the number one: 


0.7 ke x ——2 
1 kg 


If yow’re willing to treat symbols such as “kg” as if they were vari- 
ables as used in algebra (which they’re really not), you can then 
cancel the kg on top with the kg on the bottom, resulting in 

10° ¢ 


0.7 ke x a? aga 


To convert grams to kilograms, you would simply flip the fraction 
upside down. 


One advantage of this method is that it can easily be applied to 
a series of conversions. For instance, to convert one year to units of 
seconds, 


,, 365 days | 24 hours | 60 min 60s _ 
1 year 1 day lheur limit — 
= 3,15 <10" 


Should that exponent be positive, or negative? 


A common mistake is to write the conversion fraction incorrectly. 
For instance the fraction 


10° kg 
Le 


(incorrect) 


does not equal one, because 10? kg is the mass of a car, and 1 g is 
the mass of a raisin. One correct way of setting up the conversion 
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factor would be 
1073 kg 
lg 
You can usually detect such a mistake if you take the time to check 
your answer and see if it is reasonable. 


(correct) 


If common sense doesn’t rule out either a positive or a negative 
exponent, here’s another way to make sure you get it right. There 
are big prefixes and small prefixes: 


big prefixes: k M 
small prefixes: m wp n 


(It’s not hard to keep straight which are which, since “mega” and 
“micro” are evocative, and it’s easy to remember that a kilometer 
is bigger than a meter and a millimeter is smaller.) In the example 
above, we want the top of the fraction to be the same as the bottom. 
Since & is a big prefix, we need to compensate by putting a small 
number like 107? in front of it, not a big number like 10°. 


> Solved problem: a simple conversion page 47, problem 6 


> Solved problem: the geometric mean page 47, problem 8 
Discussion Question 


A Each of the following conversions contains an error. In each case, 
explain what the error is. 


(a) 1000 kg x ggg = 19 


(b) 50m x 2" = 0.5m 
(c) “Nano” is 10-9, so there are 10-9 nm in a meter. 
(d) “Micro” is 10~®, so 1 kg is 10° yg. 


0.1.10 Significant figures 


An engineer is designing a car engine, and has been told that the 
diameter of the pistons (which are being designed by someone else) 
is 5 cm. He knows that 0.02 cm of clearance is required for a piston 
of this size, so he designs the cylinder to have an inside diameter of 
5.04 cm. Luckily, his supervisor catches his mistake before the car 
goes into production. She explains his error to him, and mentally 
puts him in the “do not promote” category. 


What was his mistake? The person who told him the pistons 
were 5 cm in diameter was wise to the ways of significant figures, 
as was his boss, who explained to him that he needed to go back 
and get a more accurate number for the diameter of the pistons. 
That person said “5 cm” rather than “5.00 cm” specifically to avoid 
creating the impression that the number was extremely accurate. In 
reality, the pistons’ diameter was 5.13 cm. They would never have 
fit in the 5.04-cm cylinders. 


The number of digits of accuracy in a number is referred to as 
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the number of significant figures, or “sig figs” for short. As in the 
example above, sig figs provide a way of showing the accuracy of a 
number. In most cases, the result of a calculation involving several 
pieces of data can be no more accurate than the least accurate piece 
of data. In other words, “garbage in, garbage out.” Since the 5 
cm diameter of the pistons was not very accurate, the result of the 
engineer’s calculation, 5.04 cm, was really not as accurate as he 
thought. In general, your result should not have more than the 
number of sig figs in the least accurate piece of data you started 
with. The calculation above should have been done as follows: 


5 cm (1 sig fig) 
+0.04 cm (1 sig fig) 
=. crt (rounded off to 1 sig fig) 


The fact that the final result only has one significant figure then 
alerts you to the fact that the result is not very accurate, and would 
not be appropriate for use in designing the engine. 


Note that the leading zeroes in the number 0.04 do not count 
as significant figures, because they are only placeholders. On the 
other hand, a number such as 50 cm is ambiguous — the zero could 
be intended as a significant figure, or it might just be there as a 
placeholder. The ambiguity involving trailing zeroes can be avoided 
by using scientific notation, in which 5 x 10! cm would imply one 
sig fig of accuracy, while 5.0 x 10! cm would imply two sig figs. 


self-check F 
The following quote is taken from an editorial by Norimitsu Onishi in the 
New York Times, August 18, 2002. 


Consider Nigeria. Everyone agrees it is Africa’s most populous 
nation. But what is its population? The United Nations says 
114 million; the State Department, 120 million. The World Bank 
says 126.9 million, while the Central Intelligence Agency puts it 
at 126,635,626. 


What should bother you about this? > Answer, p. 866 


Dealing correctly with significant figures can save you time! Of- 
ten, students copy down numbers from their calculators with eight 
significant figures of precision, then type them back in for a later 
calculation. That’s a waste of time, unless your original data had 
that kind of incredible precision. 


The rules about significant figures are only rules of thumb, and 
are not a substitute for careful thinking. For instance, $20.00 + 
$0.05 is $20.05. It need not and should not be rounded off to $20. 
In general, the sig fig rules work best for multiplication and division, 
and we also apply them when doing a complicated calculation that 
involves many types of operations. For simple addition and subtrac- 
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tion, it makes more sense to maintain a fixed number of digits after 
the decimal point. 


When in doubt, don’t use the sig fig rules at all. Instead, in- 
tentionally change one piece of your initial data by the maximum 
amount by which you think it could have been off, and recalculate 
the final result. The digits on the end that are completely reshuffled 
are the ones that are meaningless, and should be omitted. 

self-check G 

How many significant figures are there in each of the following mea- 
surements? 

(1) 9.937 m 

(2) 4.0s 


(3) 0.0000000000000037 kg > Answer, p. 866 
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a/ Amoebas 


this size 


seldom encountered. 
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are 


0.2 Scaling and Order-of-Magnitude Estimates 
0.2.1 Introduction 


Why can’t an insect be the size of a dog? Some skinny stretched- 
out cells in your spinal cord are a meter tall — why does nature 
display no single cells that are not just a meter tall, but a meter 
wide, and a meter thick as well? Believe it or not, these are questions 
that can be answered fairly easily without knowing much more about 
physics than you already do. The only mathematical technique you 
really need is the humble conversion, applied to area and volume. 


Area and volume 


Area can be defined by saying that we can copy the shape of 
interest onto graph paper with 1 cm x 1 cm squares and count the 
number of squares inside. Fractions of squares can be estimated by 
eye. We then say the area equals the number of squares, in units of 
square cm. Although this might seem less “pure” than computing 
areas using formulae like A = mr? for a circle or A = wh/2 for a 
triangle, those formulae are not useful as definitions of area because 
they cannot be applied to irregularly shaped areas. 


Units of square cm are more commonly written as cm? in science. 


Of course, the unit of measurement symbolized by “cm” is not an 
algebra symbol standing for a number that can be literally multiplied 
by itself. But it is advantageous to write the units of area that way 
and treat the units as if they were algebra symbols. For instance, 
if you have a rectangle with an area of 6m? and a width of 2 m, 
then calculating its length as (6 m?)/(2 m) = 3 m gives a result 
that makes sense both numerically and in terms of units. This 
algebra-style treatment of the units also ensures that our methods 
of converting units work out correctly. For instance, if we accept 


the fraction 
100 cm 


1m 
as a valid way of writing the number one, then one times one equals 
one, so we should also say that one can be represented by 
100 cm =: 100 cm 
1m lm 


which is the same as 
10000 cm? 


1 m? 
That means the conversion factor from square meters to square cen- 
timeters is a factor of 10+, i.e., a square meter has 10* square cen- 
timeters in it. 


All of the above can be easily applied to volume as well, using 
one-cubic-centimeter blocks instead of squares on graph paper. 


To many people, it seems hard to believe that a square meter 
equals 10000 square centimeters, or that a cubic meter equals a 
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million cubic centimeters — they think it would make more sense if 
there were 100 cm? in 1 m?, and 100 cm? in 1 m®, but that would be 
incorrect. The examples shown in figure b aim to make the correct 
answer more believable, using the traditional U.S. units of feet and 
yards. (One foot is 12 inches, and one yard is three feet.) 


1ft lyd=3ft Hg 
L | 


1 ft2 1 yd2 = 9 fi2 
1 yd3 = 27 f8 


self-check H 
Based on figure b, convince yourself that there are 9 ft? ina square yard, 
and 27 ft? in a cubic yard, then demonstrate the same thing symbolically 


(i.e., with the method using fractions that equal one). > Answer, p. 
866 

> Solved problem: converting mm? to cm? page 51, problem 31 
> Solved problem: scaling a liter page 52, problem 40 


Discussion Question 


A How many square centimeters are there in a square inch? (1 inch = 
2.54 cm) First find an approximate answer by making a drawing, then de- 
rive the conversion factor more accurately using the symbolic method. 


2 Scaling of area and volume 


Great fleas have lesser fleas 
Upon their backs to bite ’em. 
And lesser fleas have lesser still, 
And so ad infinitum. 


Jonathan Swift 


Now how do these conversions of area and volume relate to the 
questions I posed about sizes of living things? Well, imagine that 
you are shrunk like Alice in Wonderland to the size of an insect. 
One way of thinking about the change of scale is that what used 
to look like a centimeter now looks like perhaps a meter to you, 
because you’re so much smaller. If area and volume scaled according 
to most people’s intuitive, incorrect expectations, with 1 m? being 
the same as 100 cm?, then there would be no particular reason 
why nature should behave any differently on your new, reduced 
scale. But nature does behave differently now that you’re small. 
For instance, you will find that you can walk on water, and jump 
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b / Visualizing 


conversions 


of 


area and volume using traditional 


U.S. units. 
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c / Galileo Galilei (1564-1642). 


et 


d/The small boat holds up 
just fine. 


e/A larger boat built with 
the same proportions as the 
small one will collapse under its 
own weight. 


f/A boat this large needs to 
have timbers that are thicker 
compared to its size. 


36 Chapter 0 


to many times your own height. The physicist Galileo Galilei had 
the basic insight that the scaling of area and volume determines 
how natural phenomena behave differently on different scales. He 
first reasoned about mechanical structures, but later extended his 
insights to living things, taking the then-radical point of view that at 
the fundamental level, a living organism should follow the same laws 
of nature as a machine. We will follow his lead by first discussing 
machines and then living things. 


Galileo on the behavior of nature on large and small scales 


One of the world’s most famous pieces of scientific writing is 
Galileo’s Dialogues Concerning the Two New Sciences. Galileo was 
an entertaining writer who wanted to explain things clearly to laypeo- 
ple, and he livened up his work by casting it in the form of a dialogue 
among three people. Salviati is really Galileo’s alter ego. Simplicio 
is the stupid character, and one of the reasons Galileo got in trouble 
with the Church was that there were rumors that Simplicio repre- 
sented the Pope. Sagredo is the earnest and intelligent student, with 
whom the reader is supposed to identify. (The following excerpts 
are from the 1914 translation by Crew and de Salvio.) 


SAGREDO: Yes, that is what | mean; and | refer especially to 
his last assertion which | have always regarded as false. .. ; 
namely, that in speaking of these and other similar machines 
one cannot argue from the small to the large, because many 
devices which succeed on a small scale do not work ona 
large scale. Now, since mechanics has its foundations in ge- 
ometry, where mere size [ is unimportant], | do not see that 
the properties of circles, triangles, cylinders, cones and other 
solid figures will change with their size. If, therefore, a large 
machine be constructed in such a way that its parts bear to 
one another the same ratio as in a smaller one, and if the 
smaller is sufficiently strong for the purpose for which it is 
designed, | do not see why the larger should not be able to 
withstand any severe and destructive tests to which it may be 
subjected. 


Salviati contradicts Sagredo: 


SALVIATI: ... Please observe, gentlemen, how facts which 
at first seem improbable will, even on scant explanation, drop 
the cloak which has hidden them and stand forth in naked and 
simple beauty. Who does not know that a horse falling from a 
height of three or four cubits will break his bones, while a dog 
falling from the same height or a cat from a height of eight 
or ten cubits will suffer no injury? Equally harmless would be 
the fall of a grasshopper from a tower or the fall of an ant from 
the distance of the moon. 


The point Galileo is making here is that small things are sturdier 
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in proportion to their size. There are a lot of objections that could be 
raised, however. After all, what does it really mean for something to 
be “strong”, to be “strong in proportion to its size,” or to be strong 
“out of proportion to its size?” Galileo hasn’t given operational 
definitions of things like “strength,” i.e., definitions that spell out 
how to measure them numerically. 


Also, a cat is shaped differently from a horse — an enlarged 
photograph of a cat would not be mistaken for a horse, even if the 
photo-doctoring experts at the National Inquirer made it look like a 
person was riding on its back. A grasshopper is not even a mammal, 
and it has an exoskeleton instead of an internal skeleton. The whole 
argument would be a lot more convincing if we could do some iso- 
lation of variables, a scientific term that means to change only one 
thing at a time, isolating it from the other variables that might have 
an effect. If size is the variable whose effect we’re interested in see- 
ing, then we don’t really want to compare things that are different 
in size but also different in other ways. 


SALVIATI: —...we asked the reason why [shipbuilders] em- 
ployed stocks, scaffolding, and bracing of larger dimensions 
for launching a big vessel than they do for a small one; and 
[an old man] answered that they did this in order to avoid the 
danger of the ship parting under its own heavy weight, a dan- 
ger to which small boats are not subject? 


After this entertaining but not scientifically rigorous beginning, 
Galileo starts to do something worthwhile by modern standards. 
He simplifies everything by considering the strength of a wooden 
plank. The variables involved can then be narrowed down to the 
type of wood, the width, the thickness, and the length. He also 
gives an operational definition of what it means for the plank to 
have a certain strength “in proportion to its size,” by introducing 
the concept of a plank that is the longest one that would not snap 
under its own weight if supported at one end. If you increased 
its length by the slightest amount, without increasing its width or 
thickness, it would break. He says that if one plank is the same 
shape as another but a different size, appearing like a reduced or 
enlarged photograph of the other, then the planks would be strong 
“in proportion to their sizes” if both were just barely able to support 
their own weight. 
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h/ Galileo discusses planks 
made of wood, but the concept 
may be easier to imagine with 
clay. All three clay rods in the 
figure were originally the same 
shape. The medium-sized one 
was twice the height, twice the 
length, and twice the width of 
the small one, and similarly the 
large one was twice as big as 
the medium one in all its linear 
dimensions. The big one has 
four times the linear dimensions 
of the small one, 16 times the 
cross-sectional area when cut 
perpendicular to the page, and 
64 times the volume. That means 
that the big one has 64 times the 
weight to support, but only 16 
times the strength compared to 
the smallest one. 
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g/1. This plank is as long as it can be without collapsing under 
its own weight. If it was a hundredth of an inch longer, it would collapse. 
2. This plank is made out of the same kind of wood. It is twice as thick, 
twice as long, and twice as wide. It will collapse under its own weight. 


Also, Galileo is doing something that would be frowned on in 
modern science: he is mixing experiments whose results he has ac- 
tually observed (building boats of different sizes), with experiments 
that he could not possibly have done (dropping an ant from the 
height of the moon). He now relates how he has done actual ex- 
periments with such planks, and found that, according to this op- 
erational definition, they are not strong in proportion to their sizes. 
The larger one breaks. He makes sure to tell the reader how impor- 
tant the result is, via Sagredo’s astonished response: 


SAGREDO: My brain already reels. My mind, like a cloud 
momentarily illuminated by a lightning flash, is for an instant 
filled with an unusual light, which now beckons to me and 
which now suddenly mingles and obscures strange, crude 
ideas. From what you have said it appears to me impossible 
to build two similar structures of the same material, but of 
different sizes and have them proportionately strong. 


In other words, this specific experiment, using things like wooden 
planks that have no intrinsic scientific interest, has very wide impli- 
cations because it points out a general principle, that nature acts 
differently on different scales. 


To finish the discussion, Galileo gives an explanation. He says 
that the strength of a plank (defined as, say, the weight of the heav- 
iest boulder you could put on the end without breaking it) is pro- 
portional to its cross-sectional area, that is, the surface area of the 
fresh wood that would be exposed if you sawed through it in the 
middle. Its weight, however, is proportional to its volume.” 


How do the volume and cross-sectional area of the longer plank 
compare with those of the shorter plank? We have already seen, 


?Galileo makes a slightly more complicated argument, taking into account 
the effect of leverage (torque). The result I’m referring to comes out the same 
regardless of this effect. 
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while discussing conversions of the units of area and volume, that 
these quantities don’t act the way most people naively expect. You 
might think that the volume and area of the longer plank would both 
be doubled compared to the shorter plank, so they would increase 
in proportion to each other, and the longer plank would be equally 
able to support its weight. You would be wrong, but Galileo knows 
that this is a common misconception, so he has Salviati address the 
point specifically: 


SALVIATI: — ... Take, for example, a cube two inches on a 
side so that each face has an area of four square inches 
and the total area, i.e., the sum of the six faces, amounts 
to twenty-four square inches; now imagine this cube to be 
sawed through three times [with cuts in three perpendicular 
planes] so as to divide it into eight smaller cubes, each one 
inch on the side, each face one inch square, and the total 
surface of each cube six square inches instead of twenty- 
four in the case of the larger cube. It is evident therefore, 
that the surface of the little cube is only one-fourth that of 
the larger, namely, the ratio of six to twenty-four; but the vol- 
ume of the solid cube itself is only one-eighth; the volume, 
and hence also the weight, diminishes therefore much more 
rapidly than the surface... You see, therefore, Simplicio, that 
| was not mistaken when ...1 said that the surface of a small 
solid is comparatively greater than that of a large one. 


The same reasoning applies to the planks. Even though they 
are not cubes, the large one could be sawed into eight small ones, 
each with half the length, half the thickness, and half the width. 
The small plank, therefore, has more surface area in proportion to 
its weight, and is therefore able to support its own weight while the 
large one breaks. 


Scaling of area and volume for irregularly shaped objects 


You probably are not going to believe Galileo’s claim that this 
has deep implications for all of nature unless you can be convinced 
that the same is true for any shape. Every drawing you’ve seen so 
far has been of squares, rectangles, and rectangular solids. Clearly 
the reasoning about sawing things up into smaller pieces would not 
prove anything about, say, an egg, which cannot be cut up into eight 
smaller egg-shaped objects with half the length. 


Is it always true that something half the size has one quarter 
the surface area and one eighth the volume, even if it has an irreg- 
ular shape? Take the example of a child’s violin. Violins are made 
for small children in smaller size to accomodate their small bodies. 
Figure i shows a full-size violin, along with two violins made with 
half and 3/4 of the normal length.? Let’s study the surface area of 


’The customary terms “half-size” and “3/4-size” actually don’t describe the 


i/The area of a shape is 
proportional to the square of its 
linear dimensions, even if the 
shape is irregular. 
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j/The muffin comes out of 
the oven too hot to eat. Breaking 
it up into four pieces increases 
its surface area while keeping 
the total volume the same. It 
cools faster because of the 
greater surface-to-volume ratio. 
In general, smaller things have 
greater surface-to-volume ratios, 
but in this example there is no 
easy way to compute the effect 
exactly, because the small pieces 
aren't the same shape as the 
original muffin. 
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the front panels of the three violins. 


Consider the square in the interior of the panel of the full-size 
violin. In the 3/4-size violin, its height and width are both smaller 
by a factor of 3/4, so the area of the corresponding, smaller square 
becomes 3/4 3/4 = 9/16 of the original area, not 3/4 of the original 
area. Similarly, the corresponding square on the smallest violin has 
half the height and half the width of the original one, so its area is 
1/4 the original area, not half. 


The same reasoning works for parts of the panel near the edge, 
such as the part that only partially fills in the other square. The 
entire square scales down the same as a square in the interior, and 
in each violin the same fraction (about 70%) of the square is full, so 
the contribution of this part to the total area scales down just the 
same. 


Since any small square region or any small region covering part 
of a square scales down like a square object, the entire surface area 
of an irregularly shaped object changes in the same manner as the 
surface area of a square: scaling it down by 3/4 reduces the area by 
a factor of 9/16, and so on. 


In general, we can see that any time there are two objects with 
the same shape, but different linear dimensions (i.e., one looks like a 
reduced photo of the other), the ratio of their areas equals the ratio 
of the squares of their linear dimensions: 


Note that it doesn’t matter where we choose to measure the linear 
size, L, of an object. In the case of the violins, for instance, it could 
have been measured vertically, horizontally, diagonally, or even from 
the bottom of the left f-hole to the middle of the right f-hole. We 
just have to measure it in a consistent way on each violin. Since all 
the parts are assumed to shrink or expand in the same manner, the 
ratio L,/L» is independent of the choice of measurement. 


It is also important to realize that it is completely unnecessary 
to have a formula for the area of a violin. It is only possible to 
derive simple formulas for the areas of certain shapes like circles, 
rectangles, triangles and so on, but that is no impediment to the 
type of reasoning we are using. 


Sometimes it is inconvenient to write all the equations in terms 
of ratios, especially when more than two objects are being compared. 
A more compact way of rewriting the previous equation is 


Ax L? 


sizes in any accurate way. They’re really just standard, arbitrary marketing 
labels. 
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The symbol “x” means “is proportional to.” Scientists and engi- 
neers often speak about such relationships verbally using the phrases 


“scales like” or “goes like,” for instance “area goes like length squared. 


All of the above reasoning works just as well in the case of vol- 
ume. Volume goes like length cubed: 


V« LD 


self-check | 

When a car or truck travels over a road, there is wear and tear on the 
road surface, which incurs a cost. Studies show that the cost C per kilo- 
meter of travel is related to the weight per axle w by C « w’. Translate 
this into a statement about ratios. > Answer, p. ?? 


If different objects are made of the same material with the same 
density, p = m/V, then their masses, m = pV, are proportional to 
L°. (The symbol for density is p, the lower-case Greek letter “rho.” ) 


An important point is that all of the above reasoning about 
scaling only applies to objects that are the same shape. For instance, 
a piece of paper is larger than a pencil, but has a much greater 
surface-to-volume ratio. 


Scaling of the area of a triangle example 3 
> In figure k, the larger triangle has sides twice as long. How 
many times greater is its area? 


Correct solution #1: Area scales in proportion to the square of the 
linear dimensions, so the larger triangle has four times more area 
(22 = 4). 


Correct solution #2: You could cut the larger triangle into four of 
the smaller size, as shown in fig. (b), so its area is four times 
greater. (This solution is correct, but it would not work for a shape 
like a circle, which can’t be cut up into smaller circles.) 


Correct solution #3: The area of a triangle is given by 


A = bh/2, where b is the base and h is the height. The areas of 
the triangles are 


A; = bh; /2 
Ao = boho/2 
= (2b1)(2h1)/2 
= 2b,hy 
A2/A; = (2b; h;)/ (by hy /2) 
24 


(Although this solution is correct, it is a lot more work than solution 
#1, and it can only be used in this case because a triangle is a 
simple geometric shape, and we happen to know a formula for its 
area.) 


k/Example 3. The big trian- 
gle has four times more area than 
the little one. 


1/A tricky way of solving ex- 
ample 3, explained in solution #2. 
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Correct solution #4: The area of a triangle is A = bh/2. The 
comparison of the areas will come out the same as long as the 
ratios of the linear sizes of the triangles is as specified, so let’s 
just say 6; = 1.00 m and bo = 2.00 m. The heights are then also 
hy = 1.00 m and he = 2.00 m, giving areas A; = 0.50 m? and 
Ao = 2.00 m2, so A2/A; = 4.00. 


(The solution is correct, but it wouldn’t work with a shape for 
whose area we don’t have a formula. Also, the numerical cal- 
culation might make the answer of 4.00 appear inexact, whereas 
solution #1 makes it clear that it is exactly 4.) 


Incorrect solution: The area of a triangle is A = bh/2, and if you 
plug in b = 2.00 m and h = 2.00 m, you get A = 2.00 m2, so 
the bigger triangle has 2.00 times more area. (This solution is 
incorrect because no comparison has been made with the smaller 
triangle.) 


Scaling of the volume of a sphere example 4 
> In figure m, the larger sphere has a radius that is five times 
greater. How many times greater is its volume? 


Correct solution #1: Volume scales like the third power of the 
linear size, so the larger sphere has a volume that is 125 times 
greater (5° = 125). 


Correct solution #2: The volume of a sphere is V = (4/3)zr%, so 


© 


m/Example 4. The big sphere Vo 
has 125 times more volume than 
the little one. 


Vo/V4 = (AP) / (3x?) = 125 


Incorrect solution: The volume of a sphere is V = (4/3)zr3, so 


n/Example 5. The 48-point V2/V4 = (Fn) / (3-4) =5 


“S” has 1.78 times more area 3 


than the 36-point “S.” eae es . 
(The solution is incorrect because (5/)” is not the same as 5r7.) 
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Scaling of a more complex shape example 5 
> The first letter “S” in figure n is in a 36-point font, the second in 
48-point. How many times more ink is required to make the larger 
“S’? (Points are a unit of length used in typography.) 


Correct solution: The amount of ink depends on the area to be 
covered with ink, and area is proportional to the square of the 
linear dimensions, so the amount of ink required for the second 
“S” is greater by a factor of (48/36)? = 1.78. 


Incorrect solution: The length of the curve of the second “S” is 
longer by a factor of 48/36 = 1.33, so 1.33 times more ink is 
required. 


(The solution is wrong because it assumes incorrectly that the 
width of the curve is the same in both cases. Actually both the 
width and the length of the curve are greater by a factor of 48/36, 
so the area is greater by a factor of (48/36)* = 1.78.) 


Reasoning about ratios and proportionalities is one of the three 
essential mathematical skills, summarized on pp.850-852, that you 
need for success in this course. 


> Solved problem: a telescope gathers light page 51, problem 32 


> Solved problem: distance from an earthquake page 51, problem 33 


Discussion Questions 


A __ A toy fire engine is 1/30 the size of the real one, but is constructed 
from the same metal with the same proportions. How many times smaller 
is its weight? How many times less red paint would be needed to paint 
it? 

B Galileo spends a lot of time in his dialog discussing what really 
happens when things break. He discusses everything in terms of Aristo- 
tle’s now-discredited explanation that things are hard to break, because 
if something breaks, there has to be a gap between the two halves with 
nothing in between, at least initially. Nature, according to Aristotle, “ab- 
hors a vacuum,” i.e., nature doesn’t “like” empty space to exist. Of course, 
air will rush into the gap immediately, but at the very moment of breaking, 
Aristotle imagined a vacuum in the gap. Is Aristotle’s explanation of why 
it is hard to break things an experimentally testable statement? If so, how 
could it be tested experimentally? 


0.2.3 Order-of-magnitude estimates 


It is the mark of an instructed mind to rest satisfied with the degree 
of precision that the nature of the subject permits and not to seek 
an exactness where only an approximation of the truth is possible. 


Aristotle 


It is a common misconception that science must be exact. For 
instance, in the Star Trek TV series, it would often happen that 
Captain Kirk would ask Mr. Spock, “Spock, we’re in a pretty bad 
situation. What do you think are our chances of getting out of 
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o/Can you guess how many 
jelly beans are in the jar? If you 
try to guess directly, you will 
almost certainly underestimate. 
The right way to do it is to esti- 
mate the linear dimensions, then 
get the volume indirectly. See 
problem 44, p. 53. 


1m 


k—————> 


p/Consider a_ spherical cow. 
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here?” The scientific Mr. Spock would answer with something like, 
“Captain, I estimate the odds as 237.345 to one.” In reality, he 
could not have estimated the odds with six significant figures of 
accuracy, but nevertheless one of the hallmarks of a person with a 
good education in science is the ability to make estimates that are 
likely to be at least somewhere in the right ballpark. In many such 
situations, it is often only necessary to get an answer that is off by no 
more than a factor of ten in either direction. Since things that differ 
by a factor of ten are said to differ by one order of magnitude, such 
an estimate is called an order-of-magnitude estimate. The tilde, 
~, is used to indicate that things are only of the same order of 
magnitude, but not exactly equal, as in 


odds of survival ~ 100 to one 


The tilde can also be used in front of an individual number to em- 
phasize that the number is only of the right order of magnitude. 


Although making order-of-magnitude estimates seems simple and 
natural to experienced scientists, it’s a mode of reasoning that is 
completely unfamiliar to most college students. Some of the typical 
mental steps can be illustrated in the following example. 


Cost of transporting tomatoes (incorrect solution) example 6 
> Roughly what percentage of the price of a tomato comes from 
the cost of transporting it in a truck? 


> The following incorrect solution illustrates one of the main ways 
you can go wrong in order-of-magnitude estimates. 


Incorrect solution: Let’s say the trucker needs to make a $400 
profit on the trip. Taking into account her benefits, the cost of gas, 
and maintenance and payments on the truck, let’s say the total 
cost is more like $2000. I’d guess about 5000 tomatoes would fit 
in the back of the truck, so the extra cost per tomato is 40 cents. 
That means the cost of transporting one tomato is comparable to 
the cost of the tomato itself. Transportation really adds a lot to the 
cost of produce, | guess. 


The problem is that the human brain is not very good at esti- 
mating area or volume, so it turns out the estimate of 5000 tomatoes 
fitting in the truck is way off. That’s why people have a hard time 
at those contests where you are supposed to estimate the number of 
jellybeans in a big jar. Another example is that most people think 
their families use about 10 gallons of water per day, but in reality 
the average is about 300 gallons per day. When estimating area 
or volume, you are much better off estimating linear dimensions, 
and computing volume from the linear dimensions. Here’s a better 
solution to the problem about the tomato truck: 


‘Cost of transporting tomatoes (correct solution) example 7 
As in the previous solution, say the cost of the trip is $2000. The 


Introduction and Review 


dimensions of the bin are probably 4m x 2m x 1 m, for a vol- 
ume of 8 m?. Since the whole thing is just an order-of-magnitude 
estimate, let's round that off to the nearest power of ten, 10 m°. 
The shape of a tomato is complicated, and | don’t know any for- 
mula for the volume of a tomato shape, but since this is just an 
estimate, let’s pretend that a tomato is a cube, 0.05 m x 0.05m x 
0.05 m, for a volume of 1.25 x 10-4 m®. Since this is just a rough 
estimate, let’s round that to 10~-4m?. We can find the total num- 
ber of tomatoes by dividing the volume of the bin by the volume 
of one tomato: 10 m?/10-4 m® = 10° tomatoes. The transporta- 
tion cost per tomato is $2000/10° tomatoes=$0.02/tomato. That 
means that transportation really doesn’t contribute very much to 
the cost of a tomato. 


Approximating the shape of a tomato as a cube is an example of 
another general strategy for making order-of-magnitude estimates. 
A similar situation would occur if you were trying to estimate how 
many m? of leather could be produced from a herd of ten thousand 
cattle. There is no point in trying to take into account the shape of 
the cows’ bodies. A reasonable plan of attack might be to consider 
a spherical cow. Probably a cow has roughly the same surface area 
as a sphere with a radius of about 1 m, which would be 47(1 m)?. 
Using the well-known facts that pi equals three, and four times three 
equals about ten, we can guess that a cow has a surface area of about 
10 m?, so the herd as a whole might yield 10° m? of leather. 


‘Estimating mass indirectly example 8 
Usually the best way to estimate mass is to estimate linear di- 
mensions, then use those to infer volume, and then get the mass 
based on the volume. For example, Amphicoelias, shown in the 
figure, may have been the largest land animal ever to live. Fossils 
tell us the linear dimensions of an animal, but we can only indi- 
rectly guess its mass. Given the length scale in the figure, let’s 
estimate the mass of an Amphicoelias. 


Its torso looks like it can be approximated by a rectangular box 
with dimensions 10 mx 5 mx 3 m, giving about 2 x 10% m°. Living 
things are mostly made of water, so we assume the animal to 
have the density of water, 1 g/cm, which converts to 10° kg/m?. 
This gives a mass of about 2 x 10° kg, or 200 metric tons. 


The following list summarizes the strategies for getting a good 
order-of-magnitude estimate. 


Section 0.2 Scaling and Order-of-Magnitude Estimates 
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1. Don’t even attempt more than one significant figure of preci- 
sion. 


2. Don’t guess area, volume, or mass directly. Guess linear di- 
mensions and get area, volume, or mass from them. 


3. When dealing with areas or volumes of objects with complex 
shapes, idealize them as if they were some simpler shape, a 
cube or a sphere, for example. 


4. Check your final answer to see if it is reasonable. If you esti- 
mate that a herd of ten thousand cattle would yield 0.01 m? 
of leather, then you have probably made a mistake with con- 
version factors somewhere. 
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Problems 


The symbols Vv, “, etc. are explained on page 53. 


: — 74658 __ 
1 Correct use of a calculator: (a) Calculate s3995 797557 On a cal- 


culator. [Self-check: The most common mistake results in 97555.40.] 
4 

(b) Which would be more like the price of a TV, and which would 

be more like the price of a house, $3.5 x 10° or $3.5°? a 


2 Compute the following things. If they don’t make sense be- 
cause of units, say so. 

(a)3cm +5 cm 

(b) 1.11 m + 22 cm 

(c) 120 miles + 2.0 hours 

(d) 120 miles / 2.0 hours 


~ 


3 Your backyard has brick walls on both ends. You measure a 
distance of 23.4 m from the inside of one wall to the inside of the 
other. Each wall is 29.4 cm thick. How far is it from the outside 
of one wall to the outside of the other? Pay attention to significant 
figures. & 


4 The speed of light is 3.0 x 10° m/s. Convert this to furlongs 
per fortnight. A furlong is 220 yards, and a fortnight is 14 days. An 
inch is 2.54 cm. Vv 


~ 


5 Express each of the following quantities in micrograms: 
(a) 10 mg, (b) 104 g, (c) 10 kg, (d) 100 x 10° g, (e) 1000 ng. Vv 


~ 


6 Convert 134 mg to units of kg, writing your answer in scientific 
notation. > Solution, p. 875 & 


7 In the last century, the average age of the onset of puberty for 
girls has decreased by several years. Urban folklore has it that this 
is because of hormones fed to beef cattle, but it is more likely to be 
because modern girls have more body fat on the average and pos- 
sibly because of estrogen-mimicking chemicals in the environment 
from the breakdown of pesticides. A hamburger from a hormone- 
implanted steer has about 0.2 ng of estrogen (about double the 
amount of natural beef). A serving of peas contains about 300 
ng of estrogen. An adult woman produces about 0.5 mg of estrogen 
per day (note the different unit!). (a) How many hamburgers would 
a girl have to eat in one day to consume as much estrogen as an 
adult woman’s daily production? (b) How many servings of peas? 


v 


— 


8 The usual definition of the mean (average) of two numbers a 
and b is (a+b)/2. This is called the arithmetic mean. The geometric 
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mean, however, is defined as (ab)!/? (i.e., the square root of ab). For 
the sake of definiteness, let’s say both numbers have units of mass. 
(a) Compute the arithmetic mean of two numbers that have units 
of grams. Then convert the numbers to units of kilograms and 
recompute their mean. Is the answer consistent? (b) Do the same 
for the geometric mean. (c) If a and 6b both have units of grams, 
what should we call the units of ab? Does your answer make sense 
when you take the square root? (d) Suppose someone proposes to 
you a third kind of mean, called the superduper mean, defined as 
(ab)'/3, Is this reasonable? > Solution, p. 875 & 


9 In an article on the SARS epidemic, the May 7, 2003 New 
York Times discusses conflicting estimates of the disease’s incuba- 
tion period (the average time that elapses from infection to the first 
symptoms). “The study estimated it to be 6.4 days. But other sta- 
tistical calculations ... showed that the incubation period could be 
1 ; 
NET WT. 3 1/2 = (99 2 g) as long as 14.22 days.” What’s wrong here? z 


10 The photo shows the corner of a bag of pretzels. What’s 
Problem 10. wrong here? - 


11 The distance to the horizon is given by the expression V2rh, 

where r is the radius of the Earth, and h is the observer’s height 

above the Earth’s surface. (This can be proved using the Pythagorean 
theorem.) Show that the units of this expression make sense. (See 

example 2 on p. 26 for an example of how to do this.) Don’t try to 

prove the result, just check its units. & 


12 (a) Based on the definitions of the sine, cosine, and tangent, 
what units must they have? (b) A cute formula from trigonometry 
lets you find any angle of a triangle if you know the lengths of 
its sides. Using the notation shown in the figure, and letting s = 
(a+b+c)/2 be half the perimeter, we have 


ee soe 


Show that the units of this equation make sense. In other words, 


a check that the units of the right-hand side are the same as your 
answer to part a of the question. > Solution, p. 875 & 
13 A physics homework question asks, “If you start from rest 


and accelerate at 1.54 m/s? for 3.29 s, how far do you travel by the 
end of that time?” A student answers as follows: 


1.54 x 3.29 = 5.07 m 


Problem 12. His Aunt Wanda is good with numbers, but has never taken physics. 
She doesn’t know the formula for the distance traveled under con- 
stant acceleration over a given amount of time, but she tells her 
nephew his answer cannot be right. How does she know? & 
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14 You are looking into a deep well. It is dark, and you cannot 
see the bottom. You want to find out how deep it is, so you drop 
a rock in, and you hear a splash 3.0 seconds later. How deep is the 
well? Vv Oo 


15 You take a trip in your spaceship to another star. Setting off, 
you increase your speed at a constant acceleration. Once you get 
half-way there, you start decelerating, at the same rate, so that by 
the time you get there, you have slowed down to zero speed. You see 
the tourist attractions, and then head home by the same method. 
(a) Find a formula for the time, T, required for the round trip, in 
terms of d, the distance from our sun to the star, and a, the magni- 
tude of the acceleration. Note that the acceleration is not constant 
over the whole trip, but the trip can be broken up into constant- 
acceleration parts. 

(b) The nearest star to the Earth (other than our own sun) is Prox- 
ima Centauri, at a distance of d = 4 x 10'° m. Suppose you use an 
acceleration of a = 10 m/s”, just enough to compensate for the lack 
of true gravity and make you feel comfortable. How long does the 
round trip take, in years? 

(c) Using the same numbers for d and a, find your maximum speed. 
Compare this to the speed of light, which is 3.0 x 10° m/s. (Later 
in this course, you will learn that there are some new things going 
on in physics when one gets close to the speed of light, and that it 
is impossible to exceed the speed of light. For now, though, just use 
the simpler ideas you’ve learned so far.) V 


16 You climb half-way up a tree, and drop a rock. Then you 
climb to the top, and drop another rock. How many times greater 
is the velocity of the second rock on impact? Explain. (The answer 
is not two times greater.) x 


17 If the acceleration of gravity on Mars is 1/3 that on Earth, 
how many times longer does it take for a rock to drop the same 
distance on Mars? Ignore air resistance. > Solution, p. 875 © 


18 A person is parachute jumping. During the time between 
when she leaps out of the plane and when she opens her chute, her 
altitude is given by an equation of the form 


y=b—c(t+ke/*) 


where e is the base of natural logarithms, and 6, c, and k are con- 
stants. Because of air resistance, her velocity does not increase at a 
steady rate as it would for an object falling in vacuum. 

(a) What units would b, c, and k have to have for the equation to 
make sense? 

(b) Find the person’s velocity, v, as a function of time. [You will 
need to use the chain rule, and the fact that d(e”)/dzx = e”.] v 
(c) Use your answer from part (b) to get an interpretation of the 
constant c. [Hint: e~* approaches zero for large values of x.] 
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(d) Find the person’s acceleration, a, as a function of time. Vv 
(ce) Use your answer from part (b) to show that if she waits long 
enough to open her chute, her acceleration will become very small. 


~ 


19 In July 1999, Popular Mechanics carried out tests to find 
which car sold by a major auto maker could cover a quarter mile 
(402 meters) in the shortest time, starting from rest. Because the 
distance is so short, this type of test is designed mainly to favor the 
car with the greatest acceleration, not the greatest maximum speed 
(which is irrelevant to the average person). The winner was the 
Dodge Viper, with a time of 12.08 s. The car’s top (and presumably 
final) speed was 118.51 miles per hour (52.98 m/s). (a) If a car, 
starting from rest and moving with constant acceleration, covers 
a quarter mile in this time interval, what is its acceleration? (b) 
What would be the final speed of a car that covered a quarter mile 
with the constant acceleration you found in part a? (c) Based on 
the discrepancy between your answer in part } and the actual final 
speed of the Viper, what do you conclude about how its acceleration 
changed over time? > Solution, p. 876 & 


20 The speed required for a low-earth orbit is 7.9 x 10° m/s (see 
ch. 10). When a rocket is launched into orbit, it goes up a little at 
first to get above almost all of the atmosphere, but then tips over 
horizontally to build up to orbital speed. Suppose the horizontal 
acceleration is limited to 3g to keep from damaging the cargo (or 
hurting the crew, for a crewed flight). (a) What is the minimum 
distance the rocket must travel downrange before it reaches orbital 
speed? How much does it matter whether you take into account the 
initial eastward velocity due to the rotation of the earth? (b) Rather 
than a rocket ship, it might be advantageous to use a railgun design, 
in which the craft would be accelerated to orbital speeds along a 
railroad track. This has the advantage that it isn’t necessary to lift 
a large mass of fuel, since the energy source is external. Based on 
your answer to part a, comment on the feasibility of this design for 
crewed launches from the earth’s surface. a 


21 Consider the following passage from Alice in Wonderland, in 
which Alice has been falling for a long time down a rabbit hole: 


Down, down, down. Would the fall never come to an end? “I 
wonder how many miles I’ve fallen by this time?” she said aloud. 
“T must be getting somewhere near the center of the earth. Let me 
see: that would be four thousand miles down, I think” (for, you see, 
Alice had learned several things of this sort in her lessons in the 
schoolroom, and though this was not a very good opportunity for 
showing off her knowledge, as there was no one to listen to her, still 
it was good practice to say it over)... 


Alice doesn’t know much physics, but let’s try to calculate the 
amount of time it would take to fall four thousand miles, starting 
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from rest with an acceleration of 10 m/s”. This is really only a lower 
limit; if there really was a hole that deep, the fall would actually 
take a longer time than the one you calculate, both because there 
is air friction and because gravity gets weaker as you get deeper (at 
the center of the earth, g is zero, because the earth is pulling you 
equally in every direction at once). vo 


22 How many cubic inches are there in a cubic foot? The answer 
is not 12. Vv 


23 Assume a dog’s brain is twice as great in diameter as a cat’s, 
but each animal’s brain cells are the same size and their brains are 
the same shape. In addition to being a far better companion and 
much nicer to come home to, how many times more brain cells does 
a dog have than a cat? The answer is not 2. g 


24 ‘The population density of Los Angeles is about 4000 people/ km?. 


That of San Francisco is about 6000 people/km?. How many times 
farther away is the average person’s nearest neighbor in LA than in 
San Francisco? The answer is not 1.5. vo 


25 A hunting dog’s nose has about 10 square inches of active 
surface. How is this possible, since the dog’s nose is only about 1 in 
x lin x 1 in = 1 in?? After all, 10 is greater than 1, so how can it 
fit’? g 


26 Estimate the number of blades of grass on a football field. 


_ 


27 In a computer memory chip, each bit of information (a 0 or 
a 1) is stored in a single tiny circuit etched onto the surface of a 
silicon chip. The circuits cover the surface of the chip like lots in a 
housing development. A typical chip stores 64 Mb (megabytes) of 
data, where a byte is 8 bits. Estimate (a) the area of each circuit, 
and (b) its linear size. a 


28 Suppose someone built a gigantic apartment building, mea- 
suring 10 km x 10 km at the base. Estimate how tall the building 
would have to be to have space in it for the entire world’s population 
to live. g 


29 A hamburger chain advertises that it has sold 10 billion 
Bongo Burgers. Estimate the total mass of feed required to raise 


the cows used to make the burgers. B 
30 Estimate the volume of a human body, in cm?. a 
31 How many cm? is 1 mm?? > Solution, p. 876 © 


32 Compare the light-gathering powers of a 3-cm-diameter tele- 
scope and a 30-cm telescope. > Solution, p. 876 © 


33 One step on the Richter scale corresponds to a factor of 100 
in terms of the energy absorbed by something on the surface of the 
Earth, e.g., a house. For instance, a 9.3-magnitude quake would 
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release 100 times more energy than an 8.3. The energy spreads out 
from the epicenter as a wave, and for the sake of this problem we’ll 
assume we’re dealing with seismic waves that spread out in three 
dimensions, so that we can visualize them as hemispheres spreading 
out under the surface of the earth. If a certain 7.6-magnitude earth- 
quake and a certain 5.6-magnitude earthquake produce the same 
amount of vibration where I live, compare the distances from my 
house to the two epicenters. > Solution, p. 876 L 


34 In Europe, a piece of paper of the standard size, called A4, 
is a little narrower and taller than its American counterpart. The 
ratio of the height to the width is the square root of 2, and this has 
some useful properties. For instance, if you cut an A4 sheet from left 
to right, you get two smaller sheets that have the same proportions. 
You can even buy sheets of this smaller size, and they’re called A5. 
There is a whole series of sizes related in this way, all with the same 
proportions. (a) Compare an A5 sheet to an A4 in terms of area and 
linear size. (b) The series of paper sizes starts from an AO sheet, 
which has an area of one square meter. Suppose we had a series 
of boxes defined in a similar way: the BO box has a volume of one 
cubic meter, two B1 boxes fit exactly inside an BO box, and so on. 


What would be the dimensions of a BO box? vo 
35 Estimate the mass of one of the hairs in Albert Einstein’s 
moustache, in units of kg. & 
36 According to folklore, every time you take a breath, you are 
inhaling some of the atoms exhaled in Caesar’s last words. Is this 
true? If so, how many? x 


37 The Earth’s surface is about 70% water. Mars’s diameter is 
about half the Earth’s, but it has no surface water. Compare the 
land areas of the two planets. Vv oE 


Albert Einstein, and his mous- 38 The traditional Martini glass is shaped like a cone with 
tache, problem 35. the point at the bottom. Suppose you make a Martini by pouring 
vermouth into the glass to a depth of 3 cm, and then adding gin 
to bring the depth to 6 cm. What are the proportions of gin and 
vermouth? > Solution, p. 876 L 


39 The central portion of a CD is taken up by the hole and some 
surrounding clear plastic, and this area is unavailable for storing 
data. The radius of the central circle is about 35% of the outer 
radius of the data-storing area. What percentage of the CD’s area 
is therefore lost? vo 


40 The one-liter cube in the photo has been marked off into 
smaller cubes, with linear dimensions one tenth those of the big 
one. What is the volume of each of the small cubes? 

> Solution, p. 876 L 


Problem 40. 
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41 Estimate the number of man-hours required for building the 
Great Wall of China. > Solution, p. 876 


42 (a) Using the microscope photo in the figure, estimate the 
mass of a one cell of the E. coli bacterium, which is one of the 
most common ones in the human intestine. Note the scale at the 
lower right corner, which is 1 wm. Each of the tubular objects in 
the column is one cell. (b) The feces in the human intestine are 
mostly bacteria (some dead, some alive), of which E. coli is a large 
and typical component. Estimate the number of bacteria in your 
intestines, and compare with the number of human cells in your 
body, which is believed to be roughly on the order of 101°. (c) 
Interpreting your result from part b, what does this tell you about 
the size of a typical human cell compared to the size of a typical 
bacterial cell? g 


43 The figure shows a practical, simple experiment for determin- 
ing g to high precision. Two steel balls are suspended from electro- 
magnets, and are released simultaneously when the electric current 
is shut off. They fall through unequal heights Av, and Axo. A 
computer records the sounds through a microphone as first one ball 
and then the other strikes the floor. From this recording, we can 
accurately determine the quantity T’ defined as T = Atg — Aty, ie., 
the time lag between the first and second impacts. Note that since 
the balls do not make any sound when they are released, we have 
no way of measuring the individual times Atg and Atj. 

(a) Find an equation for g in terms of the measured quantities T,, 
Ax; and Ago. JV 
(b) Check the units of your equation. 

(c) Check that your equation gives the correct result in the case 
where Az, is very close to zero. However, is this case realistic? 

(d) What happens when Ax; = Az? Discuss this both mathemat- 
ically and physically. B 


44 Estimate the number of jellybeans in figure o on p. 44. 
> Solution, p. 876 L 


Key to symbols: 
easy . typical challenging Mdifficult © very difficult 
Vv An answer check is available at www.lightandmatter.com. 


Problem 42. 


oO 


Problem 43. 
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The universe has been recycling 
its contents ever since the Big 
Bang, 13.7 billion years ago. 


Chapter 1 
Conservation of Mass 


It took just a moment for that head to fall, but a hundred years 
might not produce another like it. 


Joseph-Louis Lagrange, referring to the execution of Lavoisier 
on May 8, 1794 


Mass 


Change is impossible, claimed the ancient Greek philosopher Par- 
menides. His work was nonscientific, since he didn’t state his ideas 
in a form that would allow them to be tested experimentally, but 
modern science nevertheless has a strong Parmenidean flavor. His 
main argument that change is an illusion was that something can’t 
be turned into nothing, and likewise if you have nothing, you can’t 
turn it into something. To make this into a scientific theory, we have 
to decide on a way to measure what “something” is, and we can then 
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a/ Portrait of Monsieur Lavoisier 
and His Wife, by Jacques-Louis 
David, 1788. Lavoisier invented 
the concept of conservation of 
mass. The husband is depicted 
with his scientific apparatus, 
while in the background on the 
left is the portfolio belonging 
to Madame Lavoisier, who is 
thought to have been a student of 
David's. 


b/A measurement of gravi- 
tational mass: the sphere has 
a gravitational mass of five 
kilograms. 
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check by measurements whether the total amount of “something” in 
the universe really stays constant. How much “something” is there 
in arock? Does a sunbeam count as “something?” Does heat count? 
Motion? Thoughts and feelings? 


If you look at the table of contents of this book, you’ll see that 
the first four chapters have the word “conservation” in them. In 
physics, a conservation law is a statement that the total amount of 
a certain physical quantity always stays the same. This chapter is 
about conservation of mass. The metric system is designed around 
a unit of distance, the meter, a unit of mass, the kilogram, and a 
time unit, the second. Numerical measurement of distance and time 
probably date back almost as far into prehistory as counting money, 
but mass is a more modern concept. Until scientists figured out that 
mass was conserved, it wasn’t obvious that there could be a single, 
consistent way of measuring an amount of matter, hence jiggers of 
whiskey and cords of wood. You may wonder why conservation of 
mass wasn’t discovered until relatively modern times, but it wasn’t 
obvious, for example, that gases had mass, and that the apparent 
loss of mass when wood was burned was exactly matched by the 
mass of the escaping gases. 


Once scientists were on the track of the conservation of mass 
concept, they began looking for a way to define mass in terms of a 
definite measuring procedure. If they tried such a procedure, and the 
result was that it led to nonconservation of mass, then they would 
throw it out and try a different procedure. For instance, we might 
be tempted to define mass using kitchen measuring cups, i.e., as a 
measure of volume. Mass would then be perfectly conserved for a 
process like mixing marbles with peanut butter, but there would be 
processes like freezing water that led to a net increase in mass, and 
others like soaking up water with a sponge that caused a decrease. 
If, with the benefit of hindsight, it seems like the measuring cup 
definition was just plain silly, then here’s a more subtle example of 
a wrong definition of mass. Suppose we define it using a bathroom 
scale, or a more precise device such as a postal scale that works on 
the same principle of using gravity to compress or twist a spring. 
The trouble is that gravity is not equally strong all over the surface 
of the earth, so for instance there would be nonconservation of mass 
when you brought an object up to the top of a mountain, where 
gravity is a little weaker. 


Although some of the obvious possibilities have problems, there 
do turn out to be at least two approaches to defining mass that lead 
to its being a conserved quantity, so we consider these definitions 
to be “right” in the pragmatic sense that what’s correct is what’s 
useful. 


One definition that works is to use balances, but compensate 
for the local strength of gravity. This is the method that is used 


Conservation of Mass 


by scientists who actually specialize in ultraprecise measurements. 
A standard kilogram, in the form of a platinum-iridium cylinder, 
is kept in a special shrine in Paris. Copies are made that balance 
against the standard kilogram in Parisian gravity, and they are then 
transported to laboratories in other parts of the world, where they 
are compared with other masses in the local gravity. The quantity 
defined in this way is called gravitational mass. 


A second and completely different approach is to measure how 
hard it is to change an object’s state of motion. This tells us its in- 
ertial mass. For example, I’d be more willing to stand in the way of 
an oncoming poodle than in the path of a freight train, because my 
body will have a harder time convincing the freight train to stop. 
This is a dictionary-style conceptual definition, but in physics we 
need to back up a conceptual definition with an operational defini- 
tion, which is one that spells out the operations required in order 
to measure the quantity being defined. We can operationalize our 
definition of inertial mass by throwing a standard kilogram at an ob- 
ject at a speed of 1 m/s (one meter per second) and measuring the 
recoiling object’s velocity. Suppose we want to measure the mass 
of a particular block of cement. We put the block in a toy wagon 
on the sidewalk, and throw a standard kilogram at it. Suppose the 
standard kilogram hits the wagon, and then drops straight down 
to the sidewalk, having lost all its velocity, and the wagon and the 
block inside recoil at a velocity of 0.23 m/s. We then repeat the 
experiment with the block replaced by various numbers of standard 
kilograms, and find that we can reproduce the recoil velocity of 0.23 
m/s with four standard kilograms in the wagon. We have deter- 
mined the mass of the block to be four kilograms.! Although this 
definition of inertial mass has an appealing conceptual simplicity, it 
is obviously not very practical, at least in this crude form. Never- 
theless, this method of collision is very much like the methods used 
for measuring the masses of subatomic particles, which, after all, 
can’t be put on little postal scales! 


Astronauts spending long periods of time in space need to mon- 
itor their loss of bone and muscle mass, and here as well, it’s im- 
possible to measure gravitational mass. Since they don’t want to 
have standard kilograms thrown at them, they use a slightly differ- 
ent technique (figures d and e). They strap themselves to a chair 
which is attached to a large spring, and measure the time it takes 
for one cycle of vibration. 


‘You might think intuitively that the recoil velocity should be exactly one 
fourth of a meter per second, and you’d be right except that the wagon has some 
mass as well. Our present approach, however, only requires that we give a way 
to test for equality of masses. To predict the recoil velocity from scratch, we’d 
need to use conservation of momentum, which is discussed in a later chapter. 
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c/A measurement of _ inertial 
mass: the wagon recoils with 
the same velocity in experiments 
1 and 2, establishing that the 
inertial mass of the cement block 
is four kilograms. 


d/The time for one cycle of 
vibration is related to the object's 
inertial mass. 


e/Astronaut Tamara Jernigan 


measures her inertial mass 
aboard the Space Shuttle. 
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1.1.1 Problem-solving techniques 


How do we use a conservation law, such as conservation of mass, 
to solve problems? There are two basic techniques. 


As an analogy, consider conservation of money, which makes it 
illegal for you to create dollar bills using your own inkjet printer. 
(Most people don’t intentionally destroy their dollar bills, either!) 
Suppose the police notice that a particular store doesn’t seem to 
have any customers, but the owner wears lots of gold jewelry and 
drives a BMW. They suspect that the store is a front for some kind 
of crime, perhaps counterfeiting. With intensive surveillance, there 
are two basic approaches they could use in their investigation. One 
method would be to have undercover agents try to find out how 
much money goes in the door, and how much money comes back 
out at the end of the day, perhaps by arranging through some trick 
to get access to the owner’s briefcase in the morning and evening. If 
the amount of money that comes out every day is greater than the 
amount that went in, and if they’re convinced there is no safe on the 
premises holding a large reservoir of money, then the owner must 
be counterfeiting. This inflow-equals-outflow technique is useful if 
we are sure that there is a region of space within which there is no 
supply of mass that is being built up or depleted. 


A stream of water example 1 
If you watch water flowing out of the end of a hose, you'll see 
that the stream of water is fatter near the mouth of the hose, and 
skinnier lower down. This is because the water speeds up as it 
falls. If the cross-sectional area of the stream was equal all along 
its length, then the rate of flow (kilograms per second) through a 
lower cross-section would be greater than the rate of flow through 
a cross-section higher up. Since the flow is steady, the amount 
of water between the two cross-sections stays constant. Conser- 
vation of mass therefore requires that the cross-sectional area of 
the stream shrink in inverse proportion to the increasing speed of 
the falling water. 


self-check A 

Suppose the you point the hose straight up, so that the water is rising 
rather than falling. What happens as the velocity gets smaller? What 
happens when the velocity becomes zero? > Answer, p. 866 


How can we apply a conservation law, such as conservation of 
mass, in a situation where mass might be stored up somewhere? To 
use a crime analogy again, a prison could contain a certain number 
of prisoners, who are not allowed to flow in or out at will. In physics, 
this is known as a closed system. A guard might notice that a certain 
prisoner’s cell is empty, but that doesn’t mean he’s escaped. He 
could be sick in the infirmary, or hard at work in the shop earning 
cigarette money. What prisons actually do is to count all their 
prisoners every day, and make sure today’s total is the same as 


f/ Example 1. 
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yesterday’s. One way of stating a conservation law is that for a 
closed system, the total amount of stuff (mass, in this chapter) stays 
constant. 


Lavoisier and chemical reactions in a closed system example 2 
The French chemist Antoine-Laurent Lavoisier is considered the 
inventor of the concept of conservation of mass. Before Lavoisier, 
chemists had never systematically weighed their chemicals to 
quantify the amount of each substance that was undergoing re- 
actions. They also didn’t completely understand that gases were 
just another state of matter, and hadn’t tried performing reactions 
in sealed chambers to determine whether gases were being con- 
sumed from or released into the air. For this they had at least 
one practical excuse, which is that if you perform a gas-releasing 
reaction in a sealed chamber with no room for expansion, you get 
an explosion! Lavoisier invented a balance that was capable of 
measuring milligram masses, and figured out how to do reactions 
in an upside-down bowl in a basin of water, so that the gases 
could expand by pushing out some of the water. In a crucial ex- 
periment, Lavoisier heated a red mercury compound, which we 
would now describe as mercury oxide (HgO), in such a sealed 
chamber. A gas was produced (Lavoisier later named it “oxy- 
gen”), driving out some of the water, and the red compound was 
transformed into silvery liquid mercury metal. The crucial point 
was that the total mass of the entire apparatus was exactly the 
same before and after the reaction. Based on many observations 
of this type, Lavoisier proposed a general law of nature, that mass 
is always conserved. (In earlier experiments, in which closed sys- 
tems were not used, chemists had become convinced that there 
was a mysterious substance, phlogiston, involved in combustion 
and oxidation reactions, and that phlogiston’s mass could be pos- 
itive, negative, or zero depending on the situation!) 


1.1.2 Delta notation 


A convenient notation used throughout physics is A, the up- 
percase Greek letter delta, which indicates “change in” or “after 
minus before.” For example, if b represents how much money you 
have in the bank, then a deposit. of $100 could be represented as 
Ab = $100. That is, the change in your balance was $100, or the 
balance after the transaction minus the balance before the transac- 
tion equals $100. A withdrawal would be indicated by Ab < 0. We 
represent “before” and “after” using the subscripts 7 (initial) and 
f (final), e.g., Ab = br — b;. Often the delta notation allows more 
precision than English words. For instance, “time” can be used to 
mean a point in time (“now’s the time”), t, or it could mean a period 
of time (“the whole time, he had spit on his chin”), At. 


This notation is particularly convenient for discussing conserved 
quantities. The law of conservation of mass can be stated simply as 
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a/The two pendulum bobs 
are constructed with equal grav- 
itational masses. If their inertial 
masses are also equal, then each 
pendulum should take exactly the 
same amount of time per swing. 


“1\ 


b/lf the cylinders have slightly 
unequal ratios of inertial to grav- 
itational mass, their trajectories 
will be a little different. 


Ss Y 


c/A_ simplified drawing of an 
Edtv6s-style experiment. If 
the two masses, made out of 
two different substances, have 
slightly different ratios of inertial 
to gravitational mass, then the 
apparatus will twist slightly as the 
earth spins. 
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Am = 0, where m is the total mass of any closed system. 


self-check B 
If x represents the location of an object moving in one dimension, then 
how would positive and negative signs of Ax be interpreted? > 


Answer, p. 866 
Discussion Questions 


A If an object had a straight-line x — tf graph with Ax = 0 and At # 0, 
what would be true about its velocity? What would this look like on a 
graph? What about At = 0 and Ax 40? 


1.2 Equivalence of Gravitational and Inertial Mass 


We find experimentally that both gravitational and inertial mass 
are conserved to a high degree of precision for a great number of 
processes, including chemical reactions, melting, boiling, soaking up 
water with a sponge, and rotting of meat and vegetables. Now it’s 
logically possible that both gravitational and inertial mass are con- 
served, but that there is no particular relationship between them, in 
which case we would say that they are separately conserved. On the 
other hand, the two conservation laws may be redundant, like having 
one law against murder and another law against killing people! 


Here’s an experiment that gets at the issue: stand up now and 
drop a coin and one of your shoes side by side. I used a 400-gram 
shoe and a 2-gram penny, and they hit the floor at the same time 
as far as I could tell by eye. This is an interesting result, but a 
physicist and an ordinary person will find it interesting for different 
reasons. 


The layperson is surprised, since it would seem logical that 
heaver objects would always fall faster than light ones. However, 
it’s fairly easy to prove that if air friction is negligible, any two ob- 
jects made of the same substance must have identical motion when 
they fall. For instance, a 2-kg copper mass must exhibit the same 
falling motion as a 1-kg copper mass, because nothing would be 
changed by physically joining together two 1-kg copper masses to 
make a single 2-kg copper mass. Suppose, for example, that they 
are joined with a dab of glue; the glue isn’t under any strain, be- 
cause the two masses are doing the same thing side by side. Since 
the glue isn’t really doing anything, it makes no difference whether 
the masses fall separately or side by side.? 


What a physicist finds remarkable about the shoe-and-penny ex- 
periment is that it came out the way it did even though the shoe 
and the penny are made of different substances. There is absolutely 
no theoretical reason why this should be true. We could say that it 


?The argument only fails for objects light enough to be affected appreciably 
by air friction: a bunch of feathers falls differently if you wad them up because 
the pattern of air flow is altered by putting them together. 
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happens because the greater gravitational mass of the shoe is exactly 
counteracted by its greater inertial mass, which makes it harder for 
gravity to get it moving, but that just leaves us wondering why in- 
ertial mass and gravitational mass are always in proportion to each 
other. It’s possible that they are only approximately equivalent. 
Most of the mass of ordinary matter comes from neutrons and pro- 
tons, and we could imagine, for instance, that neutrons and protons 
do not have exactly the same ratio of gravitational to inertial mass. 
This would show up as a different ratio of gravitational to inertial 
mass for substances containing different proportions of neutrons and 
protons. 


Galileo did the first numerical experiments on this issue in the 
seventeenth century by rolling balls down inclined planes, although 
he didn’t think about his results in these terms. A fairly easy way to 
improve on Galileo’s accuracy is to use pendulums with bobs made 
of different materials. Suppose, for example, that we construct an 
aluminum bob and a brass bob, and use a double-pan balance to 
verify to good precision that their gravitational masses are equal. If 
we then measure the time required for each pendulum to perform 
a hundred cycles, we can check whether the results are the same. 
If their inertial masses are unequal, then the one with a smaller 
inertial mass will go through each cycle faster, since gravity has 
an easier time accelerating and decelerating it. With this type of 
experiment, one can easily verify that gravitational and inertial mass 
are proportional to each other to an accuracy of 107° or 107‘. 


In 1889, the Hungarian physicist Roland Edtv6s used a slightly 
different approach to verify the equivalence of gravitational and in- 
ertial mass for various substances to an accuracy of about 107°, and 
the best such experiment, figure d, improved on even this phenome- 
nal accuracy, bringing it to the 10~" level.? In all the experiments 
described so far, the two objects move along similar trajectories: 
straight lines in the penny-and-shoe and inclined plane experiments, 
and circular arcs in the pendulum version. The Edtvés-style exper- 
iment looks for differences in the objects’ trajectories. The concept 
can be understood by imagining the following simplified version. 
Suppose, as in figure b, we roll a brass cylinder off of a tabletop 
and measure where it hits the floor, and then do the same with an 
aluminum cylinder, making sure that both of them go over the edge 
with precisely the same velocity. An object with zero gravitational 
mass would fly off straight and hit the wall, while an object with 
zero inertial mass would make a sudden 90-degree turn and drop 
straight to the floor. If the aluminum and brass cylinders have or- 
dinary, but slightly unequal, ratios of gravitational to inertial mass, 
then they will follow trajectories that are just slightly different. In 
other words, if inertial and gravitational mass are not exactly pro- 
portional to each other for all substances, then objects made of 


3V.B. Braginskii and V.I. Panov, Soviet Physics JETP 34, 463 (1972). 


d/A more realistic drawing 
of Braginskii and Panov’s ex- 
periment. The whole thing was 
encased in a tall vacuum tube, 
which was placed in a sealed 
basement whose _ temperature 
was controlled to within 0.02°C. 
The total mass of the platinum 
and aluminum test masses, 
plus the tungsten wire and the 
balance arms, was only 4.4 g. 
To detect tiny motions, a laser 
beam was bounced off of a mirror 
attached to the wire. There was 
so little friction that the balance 
would have taken on the order of 
several years to calm down com- 
pletely after being put in place; 
to stop these vibrations, static 
electrical forces were applied 
through the two circular plates to 
provide very gentle twists on the 
ellipsoidal mass between them. 
After Braginskii and Panov. 
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different substances will have different trajectories in the presence 
of gravity. 


A simplified drawing of a practical, high-precision experiment 
is shown in figure c. Two objects made of different substances are 
balanced on the ends of a bar, which is suspended at the center from 
a thin fiber. The whole apparatus moves through space on a com- 
plicated, looping trajectory arising from the rotation of the earth 
superimposed on the earth’s orbital motion around the sun. Both 
the earth’s gravity and the sun’s gravity act on the two objects. If 
their inertial masses are not exactly in proportion to their gravi- 
tational masses, then they will follow slightly different trajectories 
through space, which will result in a very slight twisting of the fiber 
between the daytime, when the sun’s gravity is pulling upward, and 
the night, when the sun’s gravity is downward. Figure d shows a 
more realistic picture of the apparatus. 


This type of experiment, in which one expects a null result, is 
a tough way to make a career as a scientist. If your measurement 
comes out as expected, but with better accuracy than other people 
had previously achieved, your result is publishable, but won’t be 
considered earthshattering. On the other hand, if you build the 
most sensitive experiment ever, and the result comes out contrary 
to expectations, you’re in a scary situation. You could be right, and 
earn a place in history, but if the result turns out to be due to a 
defect in your experiment, then you’ve made a fool of yourself. 


1.3 Galilean Relativity 


I defined inertial mass conceptually as a measure of how hard it 
is to change an object’s state of motion, the implication being that if 
you don’t interfere, the object’s motion won’t change. Most people, 
however, believe that objects in motion have a natural tendency to 
slow down. Suppose I push my refrigerator to the west for a while at 
0.1 m/s, and then stop pushing. The average person would say fridge 
just naturally stopped moving, but let’s imagine how someone in 
China would describe the fridge experiment carried out in my house 
here in California. Due to the rotation of the earth, California is 
moving to the east at about 400 m/s. A point in China at the same 
latitude has the same speed, but since China is on the other side 
of the planet, China’s east is my west. (If you’re finding the three- 
dimensional visualization difficult, just think of China and California 
as two freight trains that go past each other, each traveling at 400 
m/s.) If I insist on thinking of my dirt as being stationary, then 
China and its dirt are moving at 800 m/s to my west. From China’s 
point of view, however, it’s California that is moving 800 m/s in 
the opposite direction (my east). When I’m pushing the fridge to 
the west at 0.1 m/s, the observer in China describes its speed as 
799.9 m/s. Once I stop pushing, the fridge speeds back up to 800 


a/ Galileo Galilei (1564-1642). 
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m/s. From my point of view, the fridge “naturally” slowed down 
when I stopped pushing, but according to the observer in China, it 
“naturally” sped up! 


What’s really happening here is that there’s a tendency, due 
to friction, for the fridge to stop moving relative to the floor. In 
general, only relative motion has physical significance in physics, not 
absolute motion. It’s not even possible to define absolute motion, 
since there is no special reference point in the universe that everyone 
can agree is at rest. Of course if we want to measure motion, we 
do have to pick some arbitrary reference point which we will say 
is standing still, and we can then define x, y, and z coordinates 
extending out from that point, which we can define as having x = 0, 
y = 0, z = 0. Setting up such a system is known as choosing a 
frame of reference. The local dirt is a natural frame of reference for 
describing a game of basketball, but if the game was taking place on 
the deck of a moving ocean liner, we would probably pick a frame of 
reference in which the deck was at rest, and the land was moving. 


Galileo was the first scientist to reason along these lines, and 
we now use the term Galilean relativity to refer to a somewhat 
modernized version of his principle. Roughly speaking, the principle 
of Galilean relativity states that the same laws of physics apply in 
any frame of reference that is moving in a straight line at constant 
speed. We need to refine this statement, however, since it is not 
necessarily obvious which frames of reference are going in a straight 
line at constant speed. A person in a pickup truck pulling away 
from a stoplight could admit that the car’s velocity is changing, 
or she could insist that the truck is at rest, and the meter on the 
dashboard is going up because the asphalt picked that moment to 
start moving faster and faster backward! Frames of reference are 
not all created equal, however, and the accelerating truck’s frame 
of reference is not as good as the asphalt’s. We can tell, because 
a bowling ball in the back of the truck, as in figure c, appears to 
behave strangely in the driver’s frame of reference: in her rear- 
view mirror, she sees the ball, initially at rest, start moving faster 
and faster toward the back of the truck. This goofy behavior is 
evidence that there is something wrong with her frame of reference. 
A person on the sidewalk, however, sees the ball as standing still. In 
the sidewalk’s frame of reference, the truck pulls away from the ball, 
and this makes sense, because the truck is burning gas and using up 
energy to change its state of motion. 


We therefore define an inertial frame of reference as one in which 
we never see objects change their state of motion without any appar- 
ent reason. The sidewalk is a pretty good inertial frame, and a car 
moving relative to the sidewalk at constant speed in a straight line 
defines a pretty good inertial frame, but a car that is accelerating 
or turning is not a inertial frame. 


b/The earth spins. People 
in Shanghai say they’re at rest 
and people in Los Angeles are 
moving. Angelenos say the same 
about the Shanghainese. 
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c/Left: In a frame of reference 
that speeds up with the truck, the 
bowling ball appears to change 
its state of motion for no reason. 
Right: In an inertial frame of ref- 
erence, which the surface of the 
earth approximately is, the bowl- 
ing ball stands still, which makes 
sense because there is nothing 
that would cause it to change its 
state of motion. 


The principle of Galilean relativity states that inertial frames 
exist, and that the same laws of physics apply in all inertial frames 
of reference, regardless of one frame’s straight-line, constant-speed 
motion relative to another.* 


Another way of putting it is that all inertial frames are created 
equal. We can say whether one inertial frame is in motion or at rest 
relative to another, but there is no privileged “rest frame.” There 
is no experiment that comes out any different in laboratories in 
different inertial frames, so there is no experiment that could tell us 
which inertial frame is really, truly at rest. 


‘The speed of sound example 3 
> The speed of sound in air is only 340 m/s, so unless you live 
at a near-polar latitude, you’re moving at greater than the speed 
of sound right now due to the Earth’s rotation. In that case, why 
don’t we experience exciting phenomena like sonic booms all the 
time? > It might seem as though you’re unprepared to deal with 
this question right now, since the only law of physics you know 
is conservation of mass, and conservation of mass doesn’t tell 
you anything obviously useful about the speed of sound or sonic 
booms. Galilean relativity, however, is a blanket statement about 
all the laws of physics, so in a situation like this, it may let you 
predict the results of the laws of physics without actually knowing 
what all the laws are! If the laws of physics predict a certain value 
for the speed of sound, then they had better predict the speed 
of the sound relative to the air, not their speed relative to some 


“The principle of Galilean relativity is extended on page 189. 
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special “rest frame.” Since the air is moving along with the rotation 
of the earth, we don’t detect any special phenomena. To get a 
sonic boom, the source of the sound would have to be moving 
relative to the air. 


‘The Foucault pendulum example 4 
Note that in the example of the bowling ball in the truck, | didn’t 
claim the sidewalk was exactly a Galilean frame of reference. 
This is because the sidewalk is moving in a circle due to the rota- 
tion of the Earth, and is therefore changing the direction of its mo- 
tion continuously on a 24-hour cycle. However, the curve of the 


motion is so gentle that under ordinary conditions we don't notice d/ Foucault demonstrates 
that the local dirt’s frame of reference isn’t quite inertial. The first his pendulum to an audience at a 
demonstration of the noninertial nature of the earth-fixed frame of lecture in 1851. 


reference was by Foucault using a very massive pendulum (fig- 
ure d) whose oscillations would persist for many hours without be- 
coming imperceptible. Although Foucault did his demonstration in 
Paris, it’s easier to imagine what would happen at the north pole: 
the pendulum would keep swinging in the same plane, but the 
earth would spin underneath it once every 24 hours. To someone 
standing in the snow, it would appear that the pendulum’s plane 
of motion was twisting. The effect at latitudes less than 90 de- 
grees turns out to be slower, but otherwise similar. The Foucault 
pendulum was the first definitive experimental proof that the earth 
really did spin on its axis, although scientists had been convinced 
of its rotation for a century based on more indirect evidence about 
the structure of the solar system. 


Although popular belief has Galileo being prosecuted by the 
Catholic Church for saying the earth rotated on its axis and also or- 
bited the sun, Foucault’s pendulum was still centuries in the future, 
so Galileo had no hard proof; Galileo’s insights into relative versus 
absolute motion simply made it more plausible that the world could 
be spinning without producing dramatic effects, but didn’t disprove 
the contrary hypothesis that the sun, moon, and stars went around 
the earth every 24 hours. Furthermore, the Church was much more 
liberal and enlightened than most people believe. It didn’t (and still e / Galileo's trial. 
doesn’t) require a literal interpretation of the Bible, and one of the 
Church officials involved in the Galileo affair wrote that “the Bible 
tells us how to go to heaven, not how the heavens go.” In other 
words, religion and science should be separate. The actual reason 
Galileo got in trouble is shrouded in mystery, since Italy in the age of 
the Medicis was a secretive place where unscrupulous people might 
settle a score with poison or a false accusation of heresy. What is cer- 
tain is that Galileo’s satirical style of scientific writing made many 
enemies among the powerful Jesuit scholars who were his intellec- 
tual opponents — he compared one to a snake that doesn’t know 
its own back is broken. It’s also possible that the Church was far 
less upset by his astronomical work than by his support for atomism 
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(discussed further in the next section). Some theologians perceived 
atomism as contradicting transubstantiation, the Church’s doctrine 
that the holy bread and wine were literally transformed into the 
flesh and blood of Christ by the priest’s blessing. 


self-check C 
What is incorrect about the following supposed counterexamples to the 
principle of inertia? 


(1) When astronauts blast off in a rocket, their huge velocity does cause 
a physical effect on their bodies — they get pressed back into their 
seats, the flesh on their faces gets distorted, and they have a hard time 
lifting their arms. 


(2) When you're driving in a convertible with the top down, the wind in 
your face is an observable physical effect of your absolute motion. p> 
Answer, p. 866 


> Solved problem: a bug on a wheel page 71, problem 12 


Discussion Questions 


BA passenger on a cruise ship finds, while the ship is docked, that 
he can leap off of the upper deck and just barely make it into the pool 
on the lower deck. If the ship leaves dock and is cruising rapidly, will this 
adrenaline junkie still be able to make it? 


Cc You are a passenger in the open basket hanging under a helium 
balloon. The balloon is being carried along by the wind at a constant 
velocity. If you are holding a flag in your hand, will the flag wave? If so, 
which way? [Based on a question from PSSC Physics.] 


D_ Aristotle stated that all objects naturally wanted to come to rest, with 
the unspoken implication that “rest” would be interpreted relative to the 
surface of the earth. Suppose we could transport Aristotle to the moon, 
put him in a space suit, and kick him out the door of the spaceship and into 
the lunar landscape. What would he expect his fate to be in this situation? 
If intelligent creatures inhabited the moon, and one of them independently 
came up with the equivalent of Aristotelian physics, what would they think 
about objects coming to rest? 


E Sally is on an amusement park ride which begins with her chair being 
hoisted straight up a tower at a constant speed of 60 miles/hour. Despite 
stern warnings from her father that he’ll take her home the next time she 
misbehaves, she decides that as a scientific experiment she really needs 
to release her corndog over the side as she’s on the way up. She does 
not throw it. She simply sticks it out of the car, lets it go, and watches it 
against the background of the sky, with no trees or buildings as reference 
points. What does the corndog’s motion look like as observed by Sally? 
Does its speed ever appear to her to be zero? What acceleration does 
she observe it to have: is it ever positive? negative? zero? What would 
her enraged father answer if asked for a similar description of its motion 
as it appears to him, standing on the ground? 


1.3.1 Applications of calculus 


Let’s see how this relates to calculus. If an object is moving in 
one dimension, we can describe its position with a function x(t). The 
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derivative v = da/dt is called the velocity, and the second derivative 
a = dvu/dt = d?x/dt? is the acceleration. Galilean relativity tells us 
that there is no detectable effect due to an object’s absolute velocity, 
since in some other frame of reference, the object’s velocity might 
be zero. However, an acceleration does have physical consequences. 


(vi) 


Observers in different inertial frames of reference will disagree 
on velocities, but agree on accelerations. Let’s keep it simple by 
continuing to work in one dimension. One frame of reference uses a 
coordinate system x1, and the other we label x2. If the positive x1 
and x2 axes point in the same direction, then in general two inertial 
frames could be related by an equation of the form ry = 7, +b+ ut, 
where wu is the constant velocity of one frame relative to the other, 
and the constant 6 tells us how far apart the origins of the two 
coordinate systems were at t = 0. The velocities are different in the 
two frames of reference: 

dx2 dx 

rs = ae +uU 5 
Suppose, for example, frame 1 is defined from the sidewalk, and 
frame 2 is fixed to a float in a parade that is moving to our left at 
a velocity u = 1 m/s. A dog that is moving to the right with a 
velocity v; = dx,/dt = 3 m/s in the sidewalk’s frame will appear to 
be moving at a velocity of vo = dr2/dt = dx, /dt+u = 4 m/s in the 
float’s frame. 

For acceleration, however, we have 

dx = dx, 


dt? —s dt? ; 


i/This Air Force doctor volun- 
teered to ride a rocket sled as a 
medical experiment. The obvious 
effects on his head and face are 
not because of the sled’s speed 
but because of its rapid changes 
in speed: increasing in (ii) and 
(iii), and decreasing in (v) and (vi). 
In (iv) his speed is greatest, but 
because his speed is not increas- 
ing or decreasing very much at 
this moment, there is little effect 
on him. (U.S. Air Force) 
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since the derivative of the constant u is zero. Thus an accelera- 
tion, unlike a velocity, can have a definite physical significance to all 
observers in all frames of reference. If this wasn’t true, then there 
would be no particular reason to define a quantity called acceleration 
in the first place. 


self-check D 

The figure shows a bottle of beer sitting on a table in the dining car of a 
train. Does the tilting of the surface tell us about the train’s velocity, or 
its acceleration? What would a person in the train say about the bottle’s 
velocity? What about a person standing in a field outside and looking in 
through the window? What about the acceleration? > Answer, p. 866 


1.4 A Preview of Some Modern Physics 


“Mommy, why do you and Daddy have to go to work?” “To make 
money, sweetie-pie.” “Why do we need money?” “To buy food.” 
“Why does food cost money?” When small children ask a chain 
of “why” questions like this, it usually isn’t too long before their 
parents end up saying something like, “Because that’s just the way 
it is,” or, more honestly, “I don’t know the answer.” 


The same happens in physics. We may gradually learn to ex- 
plain things more and more deeply, but there’s always the possibility 
that a certain observed fact, such as conservation of mass, will never 
be understood on any deeper level. Science, after all, uses limited 
methods to achieve limited goals, so the ultimate reason for all exis- 
tence will always be the province of religion. There is, however, an 
appealing explanation for conservation of mass, which is atomism, 
the theory that matter is made of tiny, unchanging particles. The 
atomic hypothesis dates back to ancient Greece, but the first solid 
evidence to support it didn’t come until around the eighteenth cen- 
tury, and individual atoms were never detected until about 1900. 
The atomic theory implies not only conservation of mass, but a 
couple of other things as well. 


First, it implies that the total mass of one particular element 
is conserved. For instance, lead and gold are both elements, and if 
we assume that lead atoms can’t be turned into gold atoms, then 
the total mass of lead and the total mass of gold are separately 
conserved. It’s as though there was not just a law against pickpock- 
eting, but also a law against surreptitiously moving money from 
one of the victim’s pockets to the other. It turns out, however, that 
although chemical reactions never change one type of atom into an- 
other, transmutation can happen in nuclear reactions, such as the 
ones that created most of the elements in your body out of the pri- 
mordial hydrogen and helium that condensed out of the aftermath 
of the Big Bang. 


Second, atomism implies that mass is quantized, meaning that 
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only certain values of mass are possible and the ones in between 
can’t exist. We can have three atoms of gold or four atoms of gold, 
but not three an a half. Although quantization of mass is a natural 
consequence of any theory in which matter is made up of tiny parti- 
cles, it was discovered in the twentieth century that other quantities, 
such as energy, are quantized as well, which had previously not been 
suspected. 


self-check E 
Is money quantized? > Answer, p. 866 


If atomism is starting to make conservation of mass seem in- 
evitable to you, then it may disturb you to know that Einstein 
discovered it isn’t really conserved. If you put a 50-gram iron nail 
in some water, seal the whole thing up, and let it sit on a fantasti- 
cally precise balance while the nail rusts, you'll find that the system 
loses about 6 x 107!* ke of mass by the time the nail has turned 
completely to rust. This has to do with Einstein’s famous equa- 
tion E = mc?. Rusting releases heat energy, which then escapes 
out into the room. Einstein’s equation states that this amount of 
heat, E, is equivalent to a certain amount of mass, m. The c in 
the c? is the speed of light, which is a large number, and a large 
amount of energy is therefore equivalent to a very small amount of 
mass, so you don’t notice nonconservation of mass under ordinary 
conditions. What is really conserved is not the mass, m, but the 
mass-plus-energy, E + mc*. The point of this discussion is not to 
get you to do numerical exercises with E = mc? (at this point you 
don’t even know what units are used to measure energy), but simply 
to point out to you the empirical nature of the laws of physics. If a 
previously accepted theory is contradicted by an experiment, then 
the theory needs to be changed. This is also a good example of 
something called the correspondence principle, which is a historical 
observation about how scientific theories change: when a new scien- 
tific theory replaces an old one, the old theory is always contained 
within the new one as an approximation that works within a certain 
restricted range of situations. Conservation of mass is an extremely 
good approximation for all chemical reactions, since chemical reac- 
tions never release or consume enough energy to change the total 
mass by a large percentage. Conservation of mass would not have 
been accepted for 110 years as a fundamental principle of physics 
if it hadn’t been verified over and over again by a huge number of 
accurate experiments. 


This chapter is summarized on page 890. Notation and terminology 
are tabulated on pages 885-886. 
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Problem 6. 
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Problem 8. 


Problems 
The symbols V, “, etc. are explained on page 72. 


1 Thermometers normally use either mercury or alcohol as their 
working fluid. If the level of the fluid rises or falls, does this violate 
conservation of mass? 


2 The ratios of the masses of different types of atoms were 
determined a century before anyone knew any actual atomic masses 
in units of kg. One finds, for example, that when ordinary table 
salt, NaCl, is melted, the chlorine atoms bubble off as a gas, leaving 
liquid sodium metal. Suppose the chlorine escapes, so that its mass 
cannot be directly determined by weighing. Experiments show that 
when 1.00000 kg of NaCl is treated in this way, the mass of the 
remaining sodium metal is 0.39337 kg. Based on this information, 
determine the ratio of the mass of a chlorine atom to that of a 
sodium atom. vo 


3 An atom of the most common naturally occurring uranium 
isotope breaks up spontaneously into a thorium atom plus a helium 
atom. The masses are as follows: 

uranium  3.95292849 x 107° kg 

thorium 3.88638748 x 107° ke 

helium 6.646481 x 107?” kg 


Each of these experimentally determined masses is uncertain in its 
last decimal place. Is mass conserved in this process to within the 
accuracy of the experimental data? How would you interpret this? 


~ 


4 If two spherical water droplets of radius b combine to make a 
single droplet, what is its radius? (Assume that water has constant 
density. ) a 


5 Make up an experiment that would test whether mass is con- 
served in an animal’s metabolic processes. L 


6 The figure shows a hydraulic jack. What is the relationship 
between the distance traveled by the plunger and the distance trav- 
eled by the object being lifted, in terms of the cross-sectional areas? 


~ 


7 In an example in this chapter, I argued that a stream of water 
must change its cross-sectional area as it rises or falls. Suppose that 
the stream of water is confined to a constant-diameter pipe. Which 
assumption breaks down in this situation? z 


8 A river with a certain width and depth splits into two parts, 
each of which has the same width and depth as the original river. 
What can you say about the speed of the current after the split? 


~ 


9 The diagram shows a cross-section of a wind tunnel of the 
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kind used, for example, to test designs of airplanes. Under normal 
conditions of use, the density of the air remains nearly constant 
throughout the whole wind tunnel. How can the speed of the air 
be controlled and calculated? (Diagram by NASA, Glenn Research 
Center.) 


Drive Motor 


View from above: 


Test Section Model 


10 A water wave is in a tank that extends horizontally from 
x =0to x =a, and from z = 0 to z = b. We assume for simplicity 
that at a certain moment in time the height y of the water’s surface 
only depends on xz, not z, so that we can effectively ignore the z 
coordinate. Under these assumptions, the total volume of the water 
in the tank is 


v= b | w(vae 


Since the density of the water is essentially constant, conservation 
of mass requires that V is always the same. When the water is calm, 
we have y = h, where h = V/ab. If two different wave patterns move 
into each other, we might imagine that they would add in the sense 
that Ytotai —h = (yr —h) + (yo —h). Show that this type of addition 
is consistent with conservation of mass. B 


11 The figure shows the position of a falling ball at equal time 
intervals, depicted in a certain frame of reference. On a similar grid, 
show how the ball’s motion would appear in a frame of reference that 
was moving horizontally at a speed of one box per unit time relative 
to the first frame. gz 


12 The figure shows the motion of a point on the rim of a rolling 
wheel. (The shape is called a cycloid.) Suppose bug A is riding on 
the rim of the wheel on a bicycle that is rolling, while bug B is on 
the spinning wheel of a bike that is sitting upside down on the floor. 
Bug A is moving along a cycloid, while bug B is moving in a circle. 
Both wheels are doing the same number of revolutions per minute. 
Which bug has a harder time holding on, or do they find it equally 
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difficult? > Solution, p. 877 © 


Key to symbols: 
easy typical challenging difficult M® very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Chapter 2 
Conservation of Energy 


Do you pronounce it Joule’s to rhyme with schools, 
Joule’s to rhyme with Bowls, 

or Joule’s to rhyme with Scowls? 

Whatever you call it, by Joule’s, 

or Joule’s, 

or Joule’s, it’s good! 


Advertising slogan of the Joule brewery. The name, and the 
corresponding unit of energy, are now usually pronounced so 
as to rhyme with “school.” 


2.1 Energy 


2.1.1 The energy concept 


You'd probably like to be able to drive your car and light your 
apartment without having to pay money for gas and electricity, and 
if you do a little websurfing, you can easily find people who say 
they have the solution to your problem. This kind of scam has been 
around for centuries. It used to be known as a perpetual motion 
machine, but nowadays the con artists’ preferred phrase is “free 
energy.”! A typical “free-energy” machine would be a sealed box 
that heats your house without needing to be plugged into a wall 
socket or a gas pipe. Heat comes out, but nothing goes in, and this 
can go on indefinitely. But an interesting thing happens if you try 
to check on the advertised performance of the machine. Typically, 
you'll find out that either the device is still in development, or it’s 
back-ordered because so many people have already taken advantage 
of this Fantastic Opportunity! In a few cases, the magic box exists, 
but the inventor is only willing to demonstrate very small levels of 
heat output for short periods of time, in which case there’s probably 
a tiny hearing-aid battery hidden in there somewhere, or some other 
trick. 


Since nobody has ever succeeded in building a device that creates 
heat out of nothing, we might also wonder whether any device exists 


'An entertaining account of this form of quackery is given in Voodoo Sci- 
ence: The Road from Foolishness to Fraud, Robert Park, Oxford Univer- 
sity Press, 2000. Until reading this book, I hadn’t realized the degree to which 
pseudoscience had penetrated otherwise respectable scientific organizations like 
NASA. 


a/James Joule, 1818-1889. 
The son of a wealthy brewer, 
Joule was tutored as a young 
man by the famous scientist John 
Dalton. Fascinated by electricity, 
he and his brother experimented 
by giving electric shocks to each 
other and to the family’s servants. 
Joule ran the brewery as an adult, 
and science was merely a serious 
hobby. His work on energy can 
be traced to his attempt to build 
an electric motor that would 
replace steam engines. His ideas 
were not accepted at first, partly 
because they contradicted the 
widespread belief that heat was 
a fluid, and partly because they 
depended on extremely precise 
measurements, which had not 
previously been common _ in 
physics. 
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b/Heat energy can be con- 
verted to light energy. Very hot 
objects glow visibly, and even 
objects that aren’t so hot give 
off infrared light, a color of light 
that lies beyond the red end of 
the visible rainbow. This photo 
was made with a special camera 
that records infrared light. The 
man’s warm skin emits quite a 
bit of infrared light energy, while 
his hair, at a lower temperature, 
emits less. 


that can do the opposite, turning heat into nothing. You might think 
that a refrigerator was such a device, but actually your refrigerator 
doesn’t destroy the heat in the food. What it really does is to extract 
some of the heat and bring it out into the room. That’s why it has 
big radiator coils on the back, which get hot when it’s in operation. 


If it’s not possible to destroy or create heat outright, then you 
might start to suspect that heat was a conserved quantity. This 
would be a successful rule for explaining certain processes, such as 
the transfer of heat between a cold Martini and a room-temperature 
olive: if the olive loses a little heat, then the drink must gain the 
same amount. It would fail in general, however. Sunlight can heat 
your skin, for example, and a hot lightbulb filament can cool off by 
emitting light. Based on these observations, we could revise our pro- 
posed conservation law, and say that there is something called heat- 
pluslight, which is conserved. Even this, however, needs to be gen- 
eralized in order to explain why you can get a painful burn playing 
baseball when you slide into a base. Now we could call it heatplus- 
lightplusmotion. The word is getting pretty long, and we haven’t 
even finished the list. 


Rather than making the word longer and longer, physicists have 
hijacked the word “energy” from ordinary usage, and give it a new, 
specific technical meaning. Just as the Parisian platinum-iridium 
kilogram defines a specific unit of mass, we need to pick something 
that defines a definite unit of energy. The metric unit of energy is 
the joule (J), and we’ll define it as the amount of energy required 
to heat 0.24 grams of water from 20 to 21 degrees Celsius. (Don’t 
memorize the numbers.)? 


‘Temperature of a mixture example 1 
> If 1.0 kg of water at 20°C is mixed with 4.0 kg of water at 30°C, 
what is the temperature of the mixture? 


> Let’s assume as an approximation that each degree of tem- 
perature change corresponds to the same amount of energy. In 
other words, we assume AE=mcAT, regardless of whether, as 
in the definition of the joule, we have AT = 21°C-20°C or, as in 
the present example, some other combination of initial and final 
temperatures. To be consistent with the definition of the joule, we 
must have c = (1 J)/(0.24 g)/(1°C) = 4.2x 10% J/kg-°C, which is 
referred to as the specific heat of water. 


? Although the definition refers to the Celsius scale of temperature, it’s not 
necessary to give an operational definition of the temperature concept in general 
(which turns out to be quite a tricky thing to do completely rigorously); we 
only need to establish two specific temperatures that can be reproduced on ther- 
mometers that have been calibrated in a standard way. Heat and temperature 
are discussed in more detail in section 2.4, and in chapter 5. Conceptually, heat 
is a measure of energy, whereas temperature relates to how concentrated that 
energy is. 
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Conservation of energy tells us AE = 0, so 


mcAT; + m2cATs = 0 


or 
AT; = Mo 
ATo 7 mM 
= —4.0 


If 7; has to change four times as much as 7o, and the two fi- 
nal temperatures are equal, then the final temperature must be 
28°C. 


Note how only differences in temperature and energy appeared 
in the preceding example. In other words, we don’t have to make 
any assumptions about whether there is a temperature at which all 
an object’s heat energy is removed. Historically, the energy and 
temperature units were invented before it was shown that there is 
such a temperature, called absolute zero. There is a scale of tem- 
perature, the Kelvin scale, in which the unit of temperature is the 
same as the Celsius degree, but the zero point is defined as absolute 
zero. But as long as we only deal with temperature differences, it 
doesn’t matter whether we use Kelvin or Celsius. Likewise, as long 
as we deal with differences in heat energy, we don’t normally have 
to worry about the total amount of heat energy the object has. In 
standard physics terminology, “heat” is used only to refer to differ- 
ences, while the total amount is called the object’s “thermal energy.” 
This distinction is often ignored by scientists in casual speech, and 
in this book ll usually use “heat” for either quantity. 


We’re defining energy by adding up things from a list, which we 
lengthen as needed: heat, light, motion, etc. One objection to this 
approach is aesthetic: physicists tend to regard complication as a 
synonym for ugliness. If we have to keep on adding more and more 
forms of energy to our laundry list, then it’s starting to sound like 
energy is distressingly complicated. Luckily it turns out that energy 
is simpler than it seems. Many forms of energy that are apparently 
unrelated turn out to be manifestations of a small number of forms 
at the atomic level, and this is the topic of section 2.4. 


Discussion Questions 


A __ The ancient Greek philosopher Aristotle said that objects “naturally” 
tended to slow down, unless there was something pushing on them to 
keep them moving. What important insight was he missing? 


2.1.2 Logical issues 


Another possible objection is that the open-ended approach to 
defining energy might seem like a kind of cheat, since we keep on 
inventing new forms whenever we need them. If a certain experi- 
ment seems to violate conservation of energy, can’t we just invent 
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c/As in figure b, an infrared 
camera distinguishes hot and 
cold areas. As the bike skids to 
a stop with its brakes locked, the 
kinetic energy of the bike and 
rider is converted into heat in 
both the floor (top) and the tire 
(bottom). 


a new form of invisible “mystery energy” that patches things up? 
This would be like balancing your checkbook by putting in a fake 
transaction that makes your calculation of the balance agree with 
your bank’s. If we could fudge this way, then conservation of energy 
would be untestable — impossible to prove or disprove. 


Actually all scientific theories are unprovable. A theory can 
never be proved, because the experiments can only cover a finite 
number out of the infinitely many situations in which the theory is 
supposed to apply. Even a million experiments won’t suffice to prove 
it in the same sense of the word “proof” that is used in mathematics. 
However, even one experiment that contradicts a theory is sufficient 
to show that the theory is wrong. A theory that is immune to 
disproof is a bad theory, because there is no way to test it. For 
instance, if I say that 23 is the maximum number of angels that 
can dance on the head of a pin, I haven’t made a properly falsifiable 
scientific theory, since there’s no method by which anyone could even 
attempt to prove me wrong based on observations or experiments. 


Conservation of energy is testable because new forms of energy 
are expected to show regular mathematical behavior, and are sup- 
posed to be related in a measurable way to observable phenomena. 
As an example, let’s see how to extend the energy concept to include 
motion. 


2.1.3 Kinetic energy 


Energy of motion is called kinetic energy. (The root of the word 
is the same as the word “cinema” — in French, kinetic energy is 
“énergie cinétique.”) How does an object’s kinetic energy depend 
on its mass and velocity? Joule attempted a conceptually simple 
experiment on his honeymoon in the French-Swiss Alps near Mt. 
Chamonix, in which he measured the difference in temperature be- 
tween the top and bottom of a waterfall. The water at the top of 
the falls has some gravitational energy, which isn’t our subject right 
now, but as it drops, that gravitational energy is converted into ki- 
netic energy, and then into heat energy due to internal friction in 
the churning pool at the bottom: 


gravitational energy + kinetic energy > heat energy 


In the logical framework of this book’s presentation of energy, the 
significance of the experiment is that it provides a way to find out 
how an object’s kinetic energy depends on its mass and velocity. The 
increase in heat energy should equal the kinetic energy of the water 
just before impact, so in principle we could measure the water’s 
mass, velocity, and kinetic energy, and see how they relate to one 
another. 


°From Joule’s point of view, the point of the experiment was different. At 
that time, most physicists believed that heat was a quantity that was conserved 
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Although the story is picturesque and memorable, most books 
that mention the experiment fail to note that it was a failure! The 
problem was that heat wasn’t the only form of energy being released. 
In reality, the situation was more like this: 


gravitational energy kinetic energy 
—heat energy 
+ sound energy 


+ energy of partial evaporation 


The successful version of the experiment, shown in figures d and 
f, used a paddlewheel spun by a dropping weight. As with the 
waterfall experiment, this one involves several types of energy, but 
the difference is that in this case, they can all be determined and 
taken into account. (Joule even took the precaution of putting a 
screen between himself and the can of water, so that the infrared 
light emitted by his warm body wouldn’t warm it up at all!) The 


result* is 


2 


1 
K= nw {kinetic energy] 


Whenever you encounter an equation like this for the first time, 
you should get in the habit of interpreting it. First off, we can tell 
that by making the mass or velocity greater, we’d get more kinetic 
energy. That makes sense. Notice, however, that we have mass to 
the first power, but velocity to the second. Having the whole thing 
proportional to mass to the first power is necessary on theoretical 
grounds, since energy is supposed to be additive. The dependence 
on v? couldn’t have been predicted, but it is sensible. For instance, 
suppose we reverse the direction of motion. This would reverse the 
sign of v, because in one dimension we use positive and negative signs 
to indicate the direction of motion. But since v? is what appears in 
the equation, the resulting kinetic energy is unchanged. 


separately from the rest of the things to which we now refer as energy, i.e., 
mechanical energy. Separate units of measurement had been constructed for 
heat and mechanical of energy, but Joule was trying to show that one could 
convert back and forth between them, and that it was actually their sum that 
was conserved, if they were both expressed in consistent units. His main result 
was the conversion factor that would allow the two sets of units to be reconciled. 
By showing that the conversion factor came out the same in different types 
of experiments, he was supporting his assertion that heat was not separately 
conserved. From Joule’s perspective or from ours, the result is to connect the 
mysterious, invisible phenomenon of heat with forms of energy that are visible 
properties of objects, i.e., mechanical energy. 

“Tf you’ve had a previous course in physics, you may have seen this presented 
not as an empirical result but as a theoretical one, derived from Newton’s laws, 
and in that case you might feel you’re being cheated here. However, I’m go- 
ing to reverse that reasoning and derive Newton’s laws from the conservation 
laws in chapter 3. From the modern perspective, conservation laws are more 
fundamental, because they apply in cases where Newton’s laws don’t. 
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d/A_ simplified drawing — of 
Joule’s paddlewheel experiment. 


e/ The 
and floor in figure c is something 
that the average person might 
have predicted in advance, but 
there are other situations where 


heating of the tire 


its not so obvious. When a 
ball slams into a wall, it doesn’t 
rebound with the same amount of 
kinetic energy. Was some energy 
destroyed? No. The ball and 
the wall heat up. These infrared 
photos show a squash ball at 
room temperature (top), and after 
it has been played with for several 
minutes (bottom), causing it to 
heat up detectably. 
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peal 
f/A realistic drawing of Joule’s apparatus, based on the illustration 
in his original paper. The paddlewheel is sealed inside the can in the 
middle. Joule wound up the two 13-kg lead weights and dropped them 
1.6 meters, repeating this 20 times to produce a temperature change 
of only about half a degree Fahrenheit in the water inside the sealed 


can. He claimed in his paper to be able to measure temperatures to an 
accuracy of 1/200 of a degree. 


What about the factor of 1/2 in front? It comes out to be 
exactly 1/2 by the design of the metric system. If we’d been using 
the old-fashioned British engineering system of units (which is no 
longer used in the U.K.), the equation would have been K = (7.44 x 
10? Btu-s?/slug- ft?)mv?. The version of the metric system called 
the SI,° in which everything is based on units of kilograms, meters, 
and seconds, not only has the numerical constant equal to 1/2, but 
makes it unitless as well. In other words, we can think of the joule 
as simply an abbreviation, 1 J=1 kg-m?/s?. More familiar examples 
of this type of abbreviation are 1 minute=60 s, and the metric unit 
of land area, 1 hectare=10000 m?. 


Ergs and joules example 2 
> There used to be two commonly used systems of metric units, 
referred to as mks and cgs. The mks system, now called the Sl, 
is based on the meter, the kilogram, and the second. The cgs 
system, which is now obsolete, was based on the centimeter, the 
gram, and the second. In the cgs system, the unit of energy is 
not the joule but the erg, 1 erg=1 g- cm?/s*. How many ergs are 
in one joule? 


> The simplest approach is to treat the units as if they were alge- 


°Systéme International 
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bra symbols. 
kg - m? 
$2 
_ kg - m2 1000 g (100 cm ; 
. $2 1 kg 1m 


g-cm? 
s2 


1J=1 


= 10’ 


= 10’ erg 


Cabin air in a jet airplane example 3 
> A jet airplane typically cruises at a velocity of 270 m/s. Outside 
air is continuously pumped into the cabin, but must be cooled off 
first, both because (1) it heats up due to friction as it enters the en- 
gines, and (2) it is heated as a side-effect of being compressed to 
cabin pressure. Calculate the increase in temperature due to the 
first effect. The specific heat of dry air is about 1.0x 10° J/kg-°C. 


> This is easiest to understand in the frame of reference of the 
plane, in which the air rushing into the engine is stopped, and its 
kinetic energy converted into heat.® Conservation of energy tells 
us 


0O=AE 
In the plane’s frame of reference, the air’s initial velocity is vji=270 
m/s, and its final velocity is zero, so the change in its kinetic en- 
ergy is negative, 
AK = Ky — Kj 
= 0 — (1/2) mv,? 
= —(1/2)mv;/? 
Assuming that the specific heat of air is roughly independent of 


temperature (which is why the number was stated with the word 
“about”), we can substitute into 0 = AK + AEpear, giving 


1 
0 = = ivy +mcAT 


, 
a = cAT 


Note how the mass cancels out. This is a big advantage of solving 
problems algebraically first, and waiting until the end to plug in 


°It’s not at all obvious that the solution would work out in the earth’s frame of 
reference, although Galilean relativity states that it doesn’t matter which frame 
we use. Chapter 3 discusses the relationship between conservation of energy and 
Galilean relativity. 
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numbers. With a purely numerical approach, we wouldn't even 
have known what value of m to pick, or if we’d guessed a value 
like 1 kg, we wouldn’t have known whether our answer depended 
on that guess. 


Solving for AT, and writing v instead of v; for simplicity, we find 


2 
V 
AT = — 
2c 


= 40°C 


The passengers would be boiled alive if not for the refrigeration. 
The first stage of cooling happens via heat exchangers in the 
engine struts, but a second stage, using a refrigerator under the 
floor of the cabin, is also necessary. Running this refrigerator 
uses up energy, cutting into the fuel efficiency of the airplane, 
which is why typically only 50% of the cabin’s air is replaced in 
each pumping cycle of 2-3 minutes. Although the airlines prefer 
to emphasize that this is a much faster recirculation rate than in 
the ventilation systems of most buildings, people are packed more 
tightly in an airplane. 


2.1.4 Power 

Power, P, is defined as the rate of change of energy, dE/dt. 
Power thus has units of joules per second, which are usually abbre- 
viated as watts, 1 W=1 J/s. Since energy is conserved, we would 
have dE’/dt = 0 if E was the total energy of a closed system, and 
that’s not very interesting. What’s usually more interesting to dis- 
cuss is either the power flowing in or out of an open system, or the 
rate at which energy is being transformed from one form into an- 
other. The following is an example of energy flowing into an open 
system. 


Heating by a lightbulb example 4 
> The electric company bills you for energy in units of kilowatt- 
hours (kilowatts multiplied by hours) rather than in SI units of 
joules. How many joules is a kilowatt-hour? 


> 1 kilowatt-hour = (1 kW)(1 hour) = (1000 J/s)(3600 s) = 3.6 MJ. 


Now here’s an example of energy being transformed from one 
form into another. 


Human wattage example 5 
> Food contains chemical energy (discussed in more detail in sec- 
tion 2.4), and for historical reasons, food energy is normally given 
in non-Sl units of Calories. One Calorie with a capital “C” equals 
1000 calories, and 1 calorie is defined as 4.18 J. A typical person 
consumes 2000 Calories of food in a day, and converts nearly all 
of that directly to body heat. Compare the person’s heat produc- 
tion to the rate of energy consumption of a 100-watt lightbulb. 


> Strictly speaking, we can’t really compute the derivative dE /dt, 
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since we don’t know how the person’s metabolism ebbs and flows 
over the course of a day. What we can really compute is AEF /At, 
which is the power averaged over a one-day period. 


Converting to joules, we find AE = 8 x 10° J for the amount of 
energy transformed into heat within our bodies in one day. Con- 
verting the time interval likewise into SI units, At = 9 x 10’ s. 
Dividing, we find that our power is 90 J/s = 90 W, about the same 
as a lightbulb. 


2.1.5 Gravitational energy 


Gravitational energy, to which I’ve already alluded, is different 
from heat and kinetic energy in an important way. Heat and kinetic 
energy are properties of a single object, whereas gravitational energy 
describes an interaction between two objects. When the skater in 
figures g and h is at the top, his distance from the bulk of the planet 
earth is greater. Since we observe his kinetic energy decreasing on 
the way up, there must be some other form of energy that is increas- 
ing. We invent a new form of energy, called gravitational energy, and 
written U or Ug, which depends on the distance between his body 
and the planet. Where is this energy? It’s not in the skater’s body, 
and it’s not inside the earth, either, since it takes two to tango. If 
either object didn’t exist, there wouldn’t be any interaction or any 
way to measure a distance, so it wouldn’t make sense to talk about 
a distance-dependent energy. Just as marriage is a relationship be- 
tween two people, gravitational energy is a relationship between two 
objects. 


There is no precise way to define the distance between the skater 
and the earth, since both are objects that have finite size. As dis- 
cussed in more detail in section 2.3, gravity is one of the fundamen- 
tal forces of nature, a universal attraction between any two particles 
that have mass. Each atom in the skater’s body is at a definite dis- 
tance from each atom in the earth, but each of these distances is 
different. An atom in his foot is only a few centimeters from some 
of the atoms in the plaster side of the pool, but most of the earth’s 
atoms are thousands of kilometers away from him. In theory, we 
might have to add up the contribution to the gravitational energy 
for every interaction between an atom in the skater’s body and an 
atom in the earth. 


For our present purposes, however, there is a far simpler and 
more practical way to solve problems. In any region of the earth’s 
surface, there is a direction called “down,” which we can establish 
by dropping a rock or hanging a plumb bob. In figure h, the skater is 
moving up and down in one dimension, and if we did measurements 
of his kinetic energy, like the made-up data in the figure, we could 
infer his gravitational energy. As long as we stay within a relatively 
small range of heights, we find that an object’s gravitational energy 
increases at a steady rate with height. In other words, the strength 
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of gravity doesn’t change much if you only move up or down a few 
meters. We also find that the gravitational energy is proportional 
to the mass of the object we’re testing. Writing y for the height, 
and g for the overall constant of proportionality, we have 


Ug =mgy . [gravitational energy; y=height; only ac- 
curate within a small range of heights] 


The number g, with units of joules per kilogram per meter, is called 
the gravitational field. It tells us the strength of gravity in a certain 
region of space. Near the surface of our planet, it has a value of 
about 9.8 J/kg-m, which is conveniently close to 10 J/kg-m for 
rough calculations. 


Velocity at the bottom of a drop example 6 
> If the skater in figure g drops 3 meters from rest, what is his 
velocity at the bottom of the pool? 


> Starting from conservation of energy, we have 


O=AE 
=AK+AU 
= Kp — Kj + Us — Uj 
= smu +mgy; — mgy; (because K;=0) 
4 
= mv +mgAy , (Ay <0) 
sO 


Vv =./—2gAy 
= ,/—(2)(10 J/kg-m)( — 3 m) 
=8m/s (rounded to one sig. fig.) 


There are a couple of important things to note about this ex- 
ample. First, we were able to massage the equation so that it only 
involved Ay, rather than y itself. In other words, we don’t need to 
worry about where y = 0 is; any coordinate system will work, as 
long as the positive y axis points up, not down. This is no accident. 
Gravitational energy can always be changed by adding a constant 
onto it, with no effect on the final result, as long as you’re consistent 
within a given problem. 


The other interesting thing is that the mass canceled out: even 
if the skater gained weight or strapped lead weights to himself, his 
velocity at the bottom would still be 8 m/s. This isn’t an accident 
either. This is the same conclusion we reached in section 1.2, based 
on the equivalence of gravitational and inertial mass. The kinetic 
energy depends on the inertial mass, while gravitational energy is 
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related to gravitational mass, but since these two masses are equal, 
we were able to use a single symbol, m, for them, and cancel them 
out. 


We can see from the equation v = ,/—2gAy that a falling object’s 
velocity isn’t constant. It increases as the object drops farther and 
farther. What about its acceleration? If we assume that air friction 
is negligible, the arguments in section 1.2 show that the accelera- 
tion can’t depend on the object’s mass, so there isn’t much else the 
acceleration can depend on besides g. In fact, the acceleration of a 
falling object equals —g (in a coordinate system where the positive 
y axis points up), as we can easily show using the chain rule: 


(sz) = (Ge) (Se) (Ge) ) 


= (=) nenay(e) 


mv 


where I’ve calculated dv/dk as 1/(dkK/dv), and dk /dU = —1 can 
be found by differentiating K+U = (constant) to give dk +dU = 0." 


We can also check that the units of g, J/kg-m, are equivalent to 
the units of acceleration, 


J kg-m?/s?_ m 


kg-m kg-m S 


and therefore the strength of the gravitational field near the earth’s 
surface can just as well be stated as 10 m/s?. 


Speed after a given time example 7 
> An object falls from rest. How fast is it moving after two sec- 
onds? Assume that the amount of energy converted to heat by 
air friction is negligible. 


> Under the stated assumption, we have a = —g, which can be 
integrated to give v = —gt+constant. If we let t = 0 be the 
beginning of the fall, then the constant of integration is zero, so at 
t = 2s we have v = —gt = —(10 m/s?) x (2s) = 20 mis. 


The Vomit Comet example 8 
> The U.S. Air Force has an airplane, affectionately known as the 
Vomit Comet, in which astronaut trainees can experience sim- 
ulated weightlessness. Oversimplifying a little, imagine that the 
plane climbs up high, and then drops straight down like a rock. 
(It actually flies along a parabola.) Since the people are falling 
with the same acceleration as the plane, the sensation is just like 
what you'd experience if you went out of the earth’s gravitational 


“There is a mathematical loophole in this argument that would allow the 
object to hover for a while with zero velocity and zero acceleration. This point 
is discussed on page 855. 
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field. If the plane can start from 10 km up, what is the maximum 
amount of time for which the dive can last? 


> Based on data about acceleration and distance, we want to 
find time. Acceleration is the second derivative of distance, so if 
we integrate the acceleration twice with respect to time, we can 
find how position relates to time. For convenience, let’s pick a 
coordinate system in which the positive y axis is down, so a=g 
instead of —g. 


a=9 
v = gt + constant (integrating) 
= gt (starts from rest) 
1 
y= gt + constant (integrating again) 


Choosing our coordinate system to have y = 0 at t = 0, we can 
make the second constant of integration equal zero as well, so 


ey 
g 


_ [2-10000 m 
10 m/s? 
= V 2000 s* 


=40s (to one sig. fig.) 


t= 


Note that if we hadn’t converted the altitude to units of meters, 
we would have gotten the wrong answer, but we would have been 
alerted to the problem because the units inside the square root 
wouldn’t have come out to be s?. In general, it’s a good idea to 
convert all your data into SI (meter-kilogram-second) units before 
you do anything with them. 


High road, low road example 9 
> In figure i, what can you say based on conservation of energy 
about the speeds of the balls when the reach point B? What does 
conservation of energy tell you about which ball will get there 
first? Assume friction doesn’t convert any mechanical energy to 
heat or sound energy. 


> Since friction is assumed to be negligible, there are only two 
forms of energy involved: kinetic and gravitational. Since both 
balls start from rest, and both lose the same amount of gravi- 
tational energy, they must have the same kinetic energy at the 
end, and therefore they're rolling at the same speed when they 
reach B. (A subtle point is that the balls have kinetic energy both 
because they’re moving through space and because they’re spin- 
ning as they roll. These two types of energy must be in fixed 


84 Chapter 2 Conservation of Energy 


proportion to one another, so this has no effect on the conclu- 
sion.) 


Conservation of energy does not, however, tell us anything obvi- 
ous about which ball gets there first. This is a general problem 
with applying conservation laws: conservation laws don’t refer di- 
rectly to time, since they are statements that something stays the 
same at all moments in time. We expect on intuitive grounds that 
the ball that goes by the lower ramp gets to B first, since it builds 
up speed early on. 


Buoyancy example 10 
> A cubical box with mass m and volume V = b® is submerged 
in a fluid of density p. How much energy is required to raise it 
through a height Ay? 


> As the box moves up, it invades a volume V’ = b*Ay previously 
occupied by some of the fluid, and fluid flows into an equal volume 
that it has vacated on the bottom. Lowering this amount of fluid 
by a height b reduces the fluid’s gravitational energy by pV’gb = 
pgb®Ay, so the net change in energy is 


AE = mgAy — pgb®Ay 
= (m—pV)gAy 


In other words, it’s as if the mass of the box had been reduced by 
an amount equal to the fluid that otherwise would have occupied 
that volume. This is known as Archimedes’ principle, and it is 
true even if the box is not a cube, although we'll defer the more 
general proof until page 202 in chapter 3. If the box is less dense 
than the fluid, then it will float. 


A simple machine example 11 
> If the father and son on the seesaw in figure k start from rest, 
what will happen? 


> Note that although the father is twice as massive, he is at half 
the distance from the fulcrum. If the seesaw was going to start 
rotating, it would have to be losing gravitational energy in order to 
gain some kinetic energy. However, there is no way for it to gain 
or lose gravitational energy by rotating in either direction. The 
change in gravitational energy would be 


AU = AU, + AU> 
= g(mAy; + m2Ayz2) 


but Ay; and Ays have opposite signs and are in the proportion of 
two to one, since the son moves along a circular arc that covers 
the same angle as the father’s but has half the radius. Therefore 
AU = 0, and there is no way for the seesaw to trade gravitational 
energy for kinetic. 
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The seesaw example demonstrates the principle of the lever, 
which is one of the basic mechanical building blocks known as sim- 
ple machines. As discussed in more detail in chapters 3 and 4, the 
principle applies even when the interactions involved aren’t gravita- 
tional. 


Note that although a lever makes it easier to lift a heavy weight, 
it also decreases the distance traveled by the load. By reversing the 
lever, we can make the load travel a greater distance, at the expense 
of increasing the amount of force required. The human muscular- 
skeletal system uses reversed levers of this kind, which allows us to 
move more rapidly, and also makes our bodies more compact, at the 
expense of brute strength. A piano uses reversed levers so that a 
small amount of motion of the key produces a longer swing of the 
hammer. Another interesting example is the hydraulic jack shown 
in figure n. The analysis in terms of gravitational energy is exactly 
the same as for the seesaw, except that the relationship between Ay; 
and Ay2 is now determined not by geometry but by conservation 
of mass: since water is highly incompressible, conservation of mass 
is approximately the same as a requirement of constant volume, 
which can only be satisfied if the distance traveled by each piston is 
in inverse proportion to its cross-sectional area. 


Discussion Questions 


A Hydroelectric power (water flowing over a dam to spin turbines) 
appears to be completely free. Does this violate conservation of energy? 
If not, then what is the ultimate source of the electrical energy produced 
by a hydroelectric plant? 


B You throw a steel ball up in the air. How can you prove based on 
conservation of energy that it has the same speed when it falls back into 
your hand? What if you threw a feather up? Is energy not conserved in 
this case? 


C Figure m shows a pendulum that is released at A and caught by a 
peg as it passes through the vertical, B. To what height will the bob rise 
on the right? 


D ~~ What is wrong with the following definitions of g? 
(a) “g is gravity.” 

(b) “g is the speed of a falling object.” 

(c) “g is how hard gravity pulls on things.” 


2.1.6 Equilibrium and stability 


The seesaw in figure k is in equilibrium, meaning that if it starts 
out being at rest, it will stay put. This is known as a neutral equi- 
librium, since the seesaw has no preferred position to which it will 
return if we disturb it. If we move it to a different position and 
release it, it will stay at rest there as well. If we put it in motion, it 
will simply continue in motion until one person’s feet hit the ground. 


Most objects around you are in stable equilibria, like the black 
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block in figure 0/3. Even if the block is moved or set in motion, it 
will oscillate about the equilibrium position. The pictures are like 
graphs of y versus x, but since the gravitational energy U = mgy 
is proportional to y, we can just as well think of them as graphs 
of U versus x. The block’s stable equilibrium position is where 
the function U(x) has a local minimum. The book yow’re reading 
right now is in equilibrium, but gravitational energy isn’t the only 
form of energy involved. To move it upward, we’d have to supply 
gravitational energy, but downward motion would require a different 
kind of energy, in order to compress the table more. (As we'll see 
in section 2.4, this is electrical energy due to interactions between 
atoms within the table.) 


A differentiable function’s local extrema occur where its deriva- 
tive is zero. A position where dU/dz is zero can be a stable (3), 
neutral (2), or unstable equilibrium, (4). An unstable equilibrium 
is like a pencil balanced on its tip. Although it could theoretically 
remain balanced there forever, in reality it will topple due to any 
tiny perturbation, such as an air current or a vibration from a pass- 
ing truck. This is a technical, mathematical definition of instability, 
which is more restrictive than the way the word is used in ordinary 
speech. Most people would describe a domino standing upright as 
being unstable, but in technical usage it would be considered stable, 
because a certain finite amount of energy is required to tip it over, 
and perturbations smaller than that would only cause it to oscillate 
around its equilibrium position. 


The domino is also an interesting example because it has two 
local minima, one in which it is upright, and another in which it is 
lying flat. A local minimum that is not the global minimum, as in 
figure 0/5, is referred to as a metastable equilibrium. 


‘A neutral equilibrium example 12 
Figure p shows a special-purpose one-block funicular railroad 
near Hill and Fourth Streets in Los Angeles, California, used for 
getting passengers up and down a very steep hill. It has two cars 
attached to a single loop of cable, arranged so that while one car 
goes up, the other comes down. They pass each other in the 
middle. Since one car’s gravitational energy is increasing while 
the other’s is decreasing, the system is in neutral equilibrium. If 


p/ Example 12. 
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there were no frictional heating, exactly zero energy would be re- 
quired in order to operate the system. A similar counterweighting 
principle is used in aerial tramways in mountain resorts, and in 
elevators (with a solid weight, rather than a second car, as coun- 
terweight). 
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Water in a U-shaped tube example 13 
> The U-shaped tube in figure q has cross-sectional area A, and 
the density of the water inside is p. Find the gravitational energy 
as a function of the quantity y shown in the figure, and show that 
there is an equilibrium at y=0. 


> The question is a little ambiguous, since gravitational energy 
is only well defined up to an additive constant. To fix this con- 
stant, let’s define U to be zero when y=0. The difference between 
U(y) and U(0) is the energy that would be required to lift a wa- 
ter column of height y out of the right side, and place it above 
the dashed line, on the left side, raising it through a height y. 
This water column has height y and cross-sectional area A, so 
its volume is Ay, its mass is pAy, and the energy required is 
mgy=(pAy)gy=pgAy*. We then have U(y) = U(0) + pgAy? = 
pgAy?. 


To find equilibria, we look for places where the derivative dU/dy = 
2pgAy equals 0. As we'd expect intuitively, the only equilibrium 
occurs at y=0. The second derivative test shows that this is a 
local minimum (not a maximum or a point of inflection), so this is 
a stable equilibrium. 


2.1.7 Predicting the direction of motion 


Kinetic energy doesn’t depend on the direction of motion. Some- 
times this is helpful, as in the high road-low road example (p. 84, 
example 9), where we were able to predict that the balls would have 
the same final speeds, even though they followed different paths and 
were moving in different directions at the end. In general, however, 
the two conservation laws we’ve encountered so far aren’t enough to 
predict an object’s path through space, for which we need conserva- 
tion of momentum (chapter 3), and the mathematical technique of 
vectors. Before we develop those ideas in their full generality, how- 
ever, it will be helpful to do a couple of simple examples, including 
one that we'll get a lot of mileage out of in section 2.3. 


Suppose we observe an air hockey puck gliding frictionlessly to 
the right at a velocity v, and we want to predict its future motion. 
Since there is no friction, no kinetic energy is converted to heat. 
The only form of energy involved is kinetic energy, so conservation 
of energy, AE = 0, becomes simply Ak = 0. There’s no particular 
reason for the puck to do anything but continue moving to the right 
at constant speed, but it would be equally consistent with conserva- 
tion of energy if it spontaneously decided to reverse its direction of 
motion, changing its velocity to —v. Either way, we’d have AK = 0. 
There is, however, a way to tell which motion is physical and which 
is unphysical. Suppose we consider the whole thing again in the 
frame of reference that is initially moving right along with the puck. 
In this frame, the puck starts out with K = 0. What we origi- 
nally described as a reversal of its velocity from v to —v is, in this 


q / Water in a U-shaped tube. 
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new frame of reference, a change from zero velocity to —2v, which 
would violate conservation of energy. In other words, the physically 
possible motion conserves energy in all frames of reference, but the 
unphysical motion only conserves energy in one special frame of 
reference. 


For our second example, we consider a car driving off the edge 
of a cliff (r). For simplicity, we assume that air friction is negligible, 
so only kinetic and gravitational energy are involved. Does the car 
follow trajectory 1, familiar from Road Runner cartoons, trajectory 
2, a parabola, or 3, a diagonal line? All three could be consistent 
with conservation of energy, in the ground’s frame of reference. For 
instance, the car would have constant gravitational energy along 
the initial horizontal segment of trajectory 1, so during that time 
it would have to maintain constant kinetic energy as well. Only a 
parabola, however, is consistent with conservation of energy com- 
bined with Galilean relativity. Consider the frame of reference that 
is moving horizontally at the same speed as that with which the car 
went over the edge. In this frame of reference, the cliff slides out 
from under the initially motionless car. The car can’t just hover 
for a while, so trajectory 1 is out. Repeating the same math as in 
example 8 on p. 83, we have 


* 1 2 
x =0, y= 59 


in this frame of reference, where the stars indicate coordinates mea- 
sured in the moving frame of reference. These coordinates are re- 
lated to the ground-fixed coordinates (x,y) by the equations 


r=a2*+ut and y=y : 


where v is the velocity of one frame with respect to the other. We 
therefore have 
1 


r=vt, y= sgt ; 


in our original frame of reference. Eliminating t, we can see that 
this has the form of a parabola: 


= 8 
y Qv2 
self-check A 
What would the car’s motion be like in the * frame of reference if it 
followed trajectory 3? > Answer, p. 866 
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2.2 Numerical Techniques 


Engineering majors are a majority of the students in the kind of 
physics course for which this book is designed, so most likely you 
fall into that category. Although you surely recognize that physics 
is an important part of your training, if you’ve had any exposure 
to how engineers really work, you’re probably skeptical about the 
flavor of problem-solving taught in most science courses. You real- 
ize that not very many practical engineering calculations fall into 
the narrow range of problems for which an exact solution can be 
calculated with a piece of paper and a sharp pencil. Real-life prob- 
lems are usually complicated, and typically they need to be solved 
by number-crunching on a computer, although we can often gain 
insight by working simple approximations that have algebraic solu- 
tions. Not only is numerical problem-solving more useful in real life, 
it’s also educational; as a beginning physics student, I really only 
felt like I understood projectile motion after I had worked it both 
ways, using algebra and then a computer program. (This was back 
in the days when 64 kilobytes of memory was considered a lot.) 


In this section, we'll start by seeing how to apply numerical 
techniques to some simple problems for which we know the answer in 
“closed form,” i.e., a single algebraic expression without any calculus 
or infinite sums. After that, we'll solve a problem that would have 
made you world-famous if you could have done it in the seventeenth 
century using paper and a quill pen! Before you continue, you should 
read Appendix 1 on page 853 that introduces you to the Python 
programming language. 


First let’s solve the trivial problem of finding how much time it 
takes an object moving at speed v to travel a straight-line distance 
dist. This closed-form answer is, of course, dist/v, but the point 
is to introduce the techniques we can use to solve other problems of 
this type. The basic idea is to divide the distance up into n equal 
parts, and add up the times required to traverse all the parts. The 
following Python function does the job. Note that you shouldn’t 
type in the line numbers on the left, and you don’t need to type 
in the comments, either. I’ve omitted the prompts >>> and ... in 
order to save space. 


1 import math 

2 def timei(dist,v,n): 

3 x=0 # Initialize the position. 

4 dx = dist/n # Divide dist into n equal parts. 

5 t=0 # Initialize the time. 

6 for i in range(n): 

7 x = xtdx # Change x. 

8 dt=dx/v # time=distance/speed 

9 t=ttdt # Keep track of elapsed time. 
return t 
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How long does it take to move 1 meter at a constant speed of 1 m/s? 
If we do this, 


>>> print (time1(1.0,1.0,10)) # dist, v, n 
0.99999999999999989 


Python produces the expected answer by dividing the distance into 
ten equal 0.1-meter segments, and adding up the ten 0.1-second 
times required to traverse each one. Since the object moves at con- 
stant speed, it doesn’t even matter whether we set n to 10, 1, or a 
million: 


>>> print (time1(1.0,1.0,1)) # dist, v, n 
1.0 


Now let’s do an example where the answer isn’t obvious to people 
who don’t know calculus: how long does it take an object to fall 
through a height h, starting from rest? We know from example 8 
on page 83 that the exact answer, found using calculus, is \/2h/g. 
Let’s see if we can reproduce that answer numerically. The main 
difference between this program and the previous one is that now 
the velocity isn’t constant, so we need to update it as we go along. 
Conservation of energy gives mgh = (1/2)mv?+mgy for the velocity 
v at height y, so v = —,/2g(h — y). (We choose the negative root 
because the object is moving down, and our coordinate system has 
the positive y axis pointing up.) 


1 import math 

2 def time2(h,n): 

3 g=9.8 # gravitational field 

4 y=h # Initialize the height. 

5 v=0 # Initialize the velocity. 

6 dy = -h/n # Divide h into n equal parts. 
7 t=0 # Initialize the time. 

8 for i in range(n): 

9 y = ytdy # Change y. (Note dy<0.) 

10 v = -math.sqrt (2*g* (h-y) ) # from cons. of energy 
11 dt=dy/v # dy and v are <0, so dt is >0 
12 t=tt+dt # Keep track of elapsed time. 
13 return t 


For h=1.0 m, the closed-form result is \/2- 1.0 m/9.8 m/s? = 0.45 s. 
With the drop split up into only 10 equal height intervals, the nu- 
merical technique provides a pretty lousy approximation: 


>>> print (time2(1.0,10)) #h, n 
0.35864270709233342 
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But by increasing n to ten thousand, we get an answer that’s as 
close as we need, given the limited accuracy of the raw data: 


>>> print (time2(1.0,10000) ) #h, n 
0.44846664060793945 


A subtle point is that we changed y in line 9, and then on line 
10 we calculated v, which depends on y. Since y is only changing 
by a ten-thousandth of a meter with each step, you might think 
this wouldn’t make much of a difference, and you’d be almost right, 
except for one small problem: if we swapped lines 9 and 10, then 
the very first time through the loop, we’d have v=0, which would 
produce a division-by-zero error when we calculated dt! Actually 
what would make the most sense would be to calculate the velocity 
at height y and the velocity at height y+dy (recalling that dy is 
negative), average them together, and use that value of y to calculate 
the best estimate of the velocity between those two points. Since 
the acceleration is constant in the present example, this modification 
results in a program that gives an exact result even for n=1: 


1 import math 
2 def time3(h,n): 
3 g=9.8 
4 y=h 
5 v=0 
6 dy = -h/n 
7 t=0 
8 for i in range(n): 
9 y.old =y 
10 yor ry 
11 v_old = math.sqrt (2*g*(h-y_old) ) 
12 v = math.sqrt (2*g*(h-y) ) 
13 v_avg = —(v_oldtv)/2. 
14 dt=dy/v_avg 
15 t=tt+dt 
16 return t 
>>> print (time3(1.0,1)) #h,n 


0.45175395145262565 


Now we’re ready to attack a problem that challenged the best 
minds of Europe back in the days when there were no computers. 
In 1696, the mathematician Johann Bernoulli posed the following 
famous question. Starting from rest, an object slides frictionlessly 
over a curve joining the point (a,b) to the point (0,0). Of all the 
possible shapes that such a curve could have, which one gets the 
object to its destination in the least possible time, and how much 
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0 0.5 1.0 
x (m) 

a / Approximations to the 
brachistochrone curve using 
a third-order polynomial (solid 
line), and a seventh-order poly- 
nomial (dashed). The latter 
only improves the time by four 
milliseconds. 


time does it take? The optimal curve is called the brachistochrone, 
from the Greek “short time.” The solution to the brachistochrone 
problem evaded Bernoulli himself, as well as Leibniz, who had been 
one of the inventors of calculus. The English physicist Isaac Newton, 
however, stayed up late one night after a day’s work running the 
royal mint, and, according to legend, produced an algebraic solution 
at four in the morning. He then published it anonymously, but 
Bernoulli is said to have remarked that when he read it, he knew 
instantly from the style that it was Newton — he could “tell the 
lion from the mark of his claw.” 


Rather than attempting an exact algebraic solution, as Newton 
did, we'll produce a numerical result for the shape of the curve 
and the minimum time, in the special case of a=1.0 m and b=1.0 m. 
Intuitively, we want to start with a fairly steep drop, since any speed 
we can build up at the start will help us throughout the rest of the 
motion. On the other hand, it’s possible to go too far with this idea: 
if we drop straight down for the whole vertical distance, and then 
do a right-angle turn to cover the horizontal distance, the resulting 
time of 0.68 s is quite a bit longer than the optimal result, the reason 
being that the path is unnecessarily long. There are infinitely many 
possible curves for which we could calculate the time, but let’s look 
at third-order polynomials, 


y=cqnt+ con" + 3x? ; 


where we require cz = (b—c,a—c2a”)/a® in order to make the curve 
pass through the point (a,b). The Python program, below, is not 
much different from what we’ve done before. The function only asks 
for cy and cg, and calculates c3 internally at line 4. Since the motion 
is two-dimensional, we have to calculate the distance between one 
point and the next using the Pythagorean theorem, at line 16. 


1 import math 

2 def timeb(a,b,c1,c2,n): 

3 g=9.8 

4 c3 = (b-c1l*a-c2*a**2) / (a**3) 
5 x=a 

6 y=b 

7 dx = -a/n 

8 t=0 

9 for i in range(n): 

10 y.old =y 

11 x = xtdx 

12 y = CL*K+C2*K**2+C3*X**S 
13 dy = y-y_old 

14 v_old = math.sqrt (2*g* (b-y_old) ) 
15 v = math.sqrt (2*g*(b-y) ) 
16 v_avg = (v_old+v)/2. 
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17 ds = math.sqrt (dx**2+dy**2) # Pythagorean thm. 


dt=ds/v_avg 
t=t+dt 
return t 


As a first guess, we could try a straight diagonal line, y = xz, 
which corresponds to setting c; = 1, and all the other coefficients to 
zero. The result is a fairly long time: 


>>> a=l. 

>>> b=1. 

>>> n=10000 

>>> ci=1. 

>>> c2=0. 

>>> print (timeb(a,b,c1,c2,n)) 
0.63887656499994161 


What we really need is a curve that’s very steep on the right, and 


flatter on the left, so it would actually make more sense to try y = x?: 


>>> ci=0. 

>>> c2=0. 

>>> print (timeb(a,b,c1,c2,n)) 
0.59458339947087069 


This is a significant improvement, and turns out to be only a hun- 
dredth of a second off of the shortest possible time! It’s possible, 
although not very educational or entertaining, to find better ap- 
proximations to the brachistochrone curve by fiddling around with 
the coefficients of the polynomial by hand. The real point of this 
discussion was to give an example of a nontrivial problem that can 
be attacked successfully with numerical techniques. I found the first 
approximation shown in figure a, 


y = (0.62)” + (—0.93)a? + (1.31)2 


by using the program listed in appendix 2 on page 856 to search 
automatically for the optimal curve. The seventh-order approxima- 
tion shown in the figure came from a straightforward extension of 
the same program. 
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a/An ellipse is circle that has 
been distorted by shrinking and 
stretching along perpendicular 
axes. 


b/An_ ellipse can be con- 
structed by tying a string to two 
pins and drawing like this with a 
pencil stretching the string taut. 
Each pin constitutes one focus of 
the ellipse. 


D 
a 


S 


c/lf the time interval taken 
by the planet to move from P to Q 
is equal to the time interval from 
R to S, then according to Kepler’s 
equal-area law, the two shaded 
areas are equal. The planet 
is moving faster during time 
interval RS than it was during 
PQ, because gravitational energy 
has been transformed into kinetic 
energy. 


2.3 Gravitational Phenomena 


Cruise your radio dial today and try to find any popular song that 
would have been imaginable without Louis Armstrong. By introduc- 
ing solo improvisation into jazz, Armstrong took apart the jigsaw 
puzzle of popular music and fit the pieces back together in a dif- 
ferent way. In the same way, Newton reassembled our view of the 
universe. Consider the titles of some recent physics books written 
for the general reader: The God Particle, Dreams of a Fi- 
nal Theory. When the subatomic particle called the neutrino was 
recently proven for the first time to have mass, specialists in cos- 
mology began discussing seriously what effect this would have on 
calculations of the evolution of the universe from the Big Bang to 
its present state. Without the English physicist Isaac Newton, such 
attempts at universal understanding would not merely have seemed 
ambitious, they simply would not have occurred to anyone. 


This section is about Newton’s theory of gravity, which he used 
to explain the motion of the planets as they orbited the sun. Newton 
tosses off a general treatment of motion in the first 20 pages of his 
Mathematical Principles of Natural Philosophy, and then 
spends the next 130 discussing the motion of the planets. Clearly 
he saw this as the crucial scientific focus of his work. Why? Because 
in it he showed that the same laws of nature applied to the heavens 
as to the earth, and that the gravitational interaction that made 
an apple fall was the same as the as the one that kept the earth’s 
motion from carrying it away from the sun. 


2.3.1 Kepler’s laws 


Newton wouldn’t have been able to figure out why the planets 
move the way they do if it hadn’t been for the astronomer Tycho 
Brahe (1546-1601) and his protege Johannes Kepler (1571-1630), 
who together came up with the first simple and accurate description 
of how the planets actually do move. The difficulty of their task is 
suggested by the figure below, which shows how the relatively simple 
orbital motions of the earth and Mars combine so that as seen from 
earth Mars appears to be staggering in loops like a drunken sailor. 


Brahe, the last of the great naked-eye astronomers, collected ex- 
tensive data on the motions of the planets over a period of many 
years, taking the giant step from the previous observations’ accuracy 
of about 10 minutes of arc (10/60 of a degree) to an unprecedented 
1 minute. The quality of his work is all the more remarkable consid- 
ering that his observatory consisted of four giant brass protractors 
mounted upright in his castle in Denmark. Four different observers 
would simultaneously measure the position of a planet in order to 
check for mistakes and reduce random errors. 


With Brahe’s death, it fell to his former assistant Kepler to try 
to make some sense out of the volumes of data. After 900 pages of 
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Mars' orbit 


calculations and many false starts and dead-end ideas, Kepler finally 
synthesized the data into the following three laws: 


Kepler’s elliptical orbit law: The planets orbit the sun in 
elliptical orbits with the sun at one focus. 

Kepler’s equal-area law: The line connecting a planet to the 
sun sweeps out equal areas in equal amounts of time. 

Kepler’s law of periods: The time required for a planet to orbit 
the sun, called its period, T’, is proportional to the long axis of the 
ellipse raised to the 3/2 power. The constant of proportionality 


is the same for all the planets. 


Although the planets’ orbits are ellipses rather than circles, most 
are very close to being circular. The earth’s orbit, for instance, is 
only flattened by 1.7% relative to a circle. In the special case of a 
planet in a circular orbit, the two foci (plural of “focus” ) coincide 
at the center of the circle, and Kepler’s elliptical orbit law thus says 
that the circle is centered on the sun. The equal-area law implies 
that a planet in a circular orbit moves around the sun with constant 
speed. For a circular orbit, the law of periods then amounts to a 
statement that the time for one orbit is proportional to r?/?, where 
r is the radius. If all the planets were moving in their orbits at the 
same speed, then the time for one orbit would simply depend on 
the circumference of the circle, so it would only be proportional to 
r to the first power. The more drastic dependence on r°/? means 
that the outer planets must be moving more slowly than the inner 


planets. 
Our main focus in this section will be to use the law of periods 


d/As the earth and Mars revolve 
around the sun at different rates, 
the combined effect of their mo- 
tions makes Mars appear to trace 
a strange, looped path across the 
background of the distant stars. 
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e/A cannon fires cannon- 
balls at different velocities, from 
the top of an imaginary mountain 
that rises above the earth’s atmo- 
sphere. This is almost the same 
as a figure Newton included in his 
Mathematical Principles. 


to deduce the general equation for gravitational energy. The equal- 
area law turns out to be a statement on conservation of angular 
momentum, which is discussed in chapter 4. We'll demonstrate 
the elliptical orbit law numerically in chapter 3, and analytically in 
chapter 4. 


2.3.2 Circular orbits 


Kepler’s laws say that planets move along elliptical paths (with 
circles as a special case), which would seem to contradict the proof 
on page 90 that objects moving under the influence of gravity have 
parabolic trajectories. Kepler was right. The parabolic path was 
really only an approximation, based on the assumption that the 
gravitational field is constant, and that vertical lines are all parallel. 
In figure e, trajectory 1 is an ellipse, but it gets chopped off when 
the cannonball hits the earth, and the small piece of it that is above 
ground is nearly indistinguishable from a parabola. Our goal is 
to connect the previous calculation of parabolic trajectories, y = 
(g/2v7)a?, with Kepler’s data for planets orbiting the sun in nearly 
circular orbits. Let’s start by thinking in terms of an orbit that 
circles the earth, like orbit 2 in figure e. It’s more natural now 
to choose a coordinate system with its origin at the center of the 
earth, so the parabolic approximation becomes y = r — (g/2v?)z?, 
where r is the distance from the center of the earth. For small 
values of x, i.e., when the cannonball hasn’t traveled very far from 
the muzzle of the gun, the parabola is still a good approximation 
to the actual circular orbit, defined by the Pythagorean theorem, 
r2 = 2? +y?, or y=rr/1—22/r2. For small values of x, we can use 
the approximation 1 + € © 1+€/2 to find y » r—(1/2r)x?. Setting 
this equal to the equation of the parabola, we have g/2v” = (1/2r), 
or 


v= /gr [condition for a circular orbit] 
' Low-earth orbit example 14 


To get a feel for what this all means, let’s calculate the velocity 
required for a satellite in a circular low-earth orbit. Real low-earth- 
orbit satellites are only a few hundred km up, so for purposes of 
rough estimation we can take r to be the radius of the earth, and 
g is not much less than its value on the earth’s surface, 10 m/s”. 
Taking numerical data from Appendix 5, we have 


v= Vor 
2 (10 m/s2)(6.4 x 103 km) 


= \/(10 m/s2)(6.4 x 108 m) 


= \/6.4 x 107 m2/s? 
= 8000 m/s 
(about twenty times the speed of sound). 
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In one second, the satellite moves 8000 m horizontally. Dur- 
ing this time, it drops the same distance any other object would: 
about 5 m. But a drop of 5 m over a horizontal distance of 8000 
m is just enough to keep it at the same altitude above the earth’s 
curved surface. 


2.3.3 The sun’s gravitational field 


We can now use the circular orbit condition v = \/gr, combined 
with Kepler’s law of periods, T « r°/? for circular orbits, to deter- 
mine how the sun’s gravitational field falls off with distance.? From 
there, it will be just a hop, skip, and a jump to get to a universal 


description of gravitational interactions. 


The velocity of a planet in a circular orbit is proportional to 
r/T, so 


r/T x /gr 
r [r3/? x /gr 


g x 1/r? 


If gravity behaves systematically, then we can expect the same to 
be true for the gravitational field created by any object, not just the 
sun. 


There is a subtle point here, which is that so far, r has just meant 
the radius of a circular orbit, but what we have come up with smells 
more like an equation that tells us the strength of the gravitational 
field made by some object (the sun) if we know how far we are from 
the object. In other words, we could reinterpret r as the distance 
from the sun. 


2.3.4 Gravitational energy in general 


We now want to find an equation for the gravitational energy of 
any two masses that attract each other from a distance r. We assume 
that r is large enough compared to the distance between the objects 
so that we don’t really have to worry about whether r is measured 
from center to center or in some other way. This would be a good 
approximation for describing the solar system, for example, since 
the sun and planets are small compared to the distances between 
them — that’s why you see Venus (the “evening star”) with your 
bare eyes as a dot, not a disk. 


The equation we seek is going to give the gravitational energy, 
U, as a function of m1, m2, and r. We already know from expe- 
rience with gravity near the earth’s surface that U is proportional 


®There is a hidden assumption here, which is that the sun doesn’t move. 
Actually the sun wobbles a little because of the planets’ gravitational interactions 
with it, but the wobble is small due to the sun’s large mass, so it’s a pretty good 
approximation to assume the sun is stationary. Chapter 3 provides the tools to 
analyze this sort of thing completely correctly — see p. 142. 
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f/The gravitational energy 
U = —Gm,m2/r graphed as a 
function of r. 


to the mass of the object that interacts with the earth gravitation- 
ally, so it makes sense to assume the relationship is symmetric: U is 
presumably proportional to the product m m2. We can no longer 
assume AU « Ar, as in the earth’s-surface equation AU = mgAy, 
since we are trying to construct an equation that would be valid 
for all values of r, and g depends on r. We can, however, con- 
sider an infinitesimally small change in distance dr, for which we’ll 
have dU = moggidr, where g; is the gravitational field created by 
m,. (We could just as well have written this as dU = migedr, since 
we’re not assuming either mass is “special” or “active.”) Integrating 
this equation, we have 


fu- [ meer 


U=ms f nar 


1 
Ux mms f Gar 
r 


myimM2 


Ue 
Tr 


where we’re free to take the constant of integration to be equal to 
zero, since gravitational energy is never a well-defined quantity in 
absolute terms. Writing G for the constant of proportionality, we 
have the following fundamental description of gravitational interac- 
tions: 


Gm m2 


U= [gravitational energy of two masses 


r 
separated by a distance r] 


We'll refer to this as Newton’s law of gravity, although in reality 
he stated it in an entirely different form, which turns out to be 
mathematically equivalent to this one. 


Let’s interpret his result. First, don’t get hung up on the fact 
that it’s negative, since it’s only differences in gravitational energy 
that have physical significance. The graph in figure f could be shifted 
up or down without having any physical effect. The slope of this 
graph relates to the strength of the gravitational field. For instance, 
suppose figure f is a graph of the gravitational energy of an asteroid 
interacting with the sun. If the asteroid drops straight toward the 
sun, from A to B, the decrease in gravitational energy is very small, 
so it won’t speed up very much during that motion. Points C and 
D, however, are in a region where the graph’s slope is much greater. 
As the asteroid moves from C to D, it loses a lot of gravitational 
energy, and therefore speeds up considerably. This is due to the 
stronger gravitational field. 
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Determining G example 15 
The constant G is not easy to determine, and Newton went to his 
grave without knowing an accurate value for it. If we knew the 
mass of the earth, then we could easily determine G from exper- 
iments with terrestrial gravity, but the only way to determine the 
mass of the earth accurately in units of kilograms is by finding G 
and reasoning the other way around! (If you estimate the average 
density of the earth, you can make at least a rough estimate of 
G.) Figures g and h show how G was first measured by Henry 
Cavendish in the nineteenth century.The rotating arm is released 
from rest, and the kinetic energy of the two moving balls is mea- 
sured when they pass position C. Conservation of energy gives 


IBA IBD ICA fers) 


where M is the mass of one of the large balls, m is the mass of 
one of the small ones, and the factors of two, which will cancel, 
occur because every energy is mirrored on the opposite side of 
the apparatus. (As discussed on page 102, it turns out that we get 
the right result by measuring all the distances from the center of 
one sphere to the center of the other.) This can easily be solved 
for G. The best modern value of G, from later versions of the 
same experiment, is 6.67 x 10-1! J - m/kg?. 


Escape velocity example 16 
> The Pioneer 10 space probe was launched in 1972, and contin- 
ued sending back signals for 30 years. In the year 2001, not long 
before contact with the probe was lost, it was about 1.2 x 10'? m 
from the sun, and was moving almost directly away from the sun 
at a velocity of 1.21 104 m. The mass of the sun is 1.99 x 10°° kg. 
Will Pioneer 10 escape permanently, or will it fall back into the so- 
lar system? 


> We want to know whether there will be a point where the probe 
will turn around. If so, then it will have zero kinetic energy at the 
turnaround point: 


Kj + U; = U; 
1. 5 GMm GMm 
mv = 
2 if lf 
1 2 GM_ GM 
2 fof , 


where M is the mass of the sun, ™ is the (irrelevant) mass of 
the probe, and r; is the distance from the sun of the hypothetical 
turnaround point. Plugging in numbers on the left, we get a posi- 
tive result. There can therefore be no solution, since the right side 
is negative. There won't be any turnaround point, and Pioneer 10 
is never coming back. 


g/Cavendish’s original draw- 
ing of the apparatus for his 
experiment, discussed in exam- 
ple 15. The room was sealed 
to exclude air currents, and the 
motion was observed through 
telescopes sticking through holes 
in the walls. 


h/A_ simplified drawing of the 
Cavendish experiment, viewed 
from above. The rod with the two 
small masses on the ends hangs 
from a thin fiber, and is free to 
rotate. 


Pluto 


Saturn 


i/ The Pioneer 10 space probe’s 
trajectory from 1974 to 1992, 
with circles marking its position 


at one-year intervals. After its 
1974 slingshot maneuver around 
Jupiter, the probe’s motion was 
determined almost exclusively by 
the sun’s gravity. 
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The minimum velocity required for this to happen is called es- 
cape velocity. For speeds above escape velocity, the orbits are 
open-ended hyperbolas, rather than repeating elliptical orbits. In 
figure i, Pioneer’s hyperbolic trajectory becomes almost indistin- 
guishable from a line at large distances from the sun. The motion 
slows perceptibly in the first few years after 1974, but later the 
speed becomes nearly constant, as shown by the nearly constant 
spacing of the dots. 


The gravitational field 


We got the energy equation U = —Gm,mg2/r by integrating 
g « 1/r? and then inserting a constant of proportionality to make 
the proportionality into an equation. The opposite of an integral is 
a derivative, so we can now go backwards and insert a constant of 
proportionality in g x 1/r? that will be consistent with the energy 
equation: 


dU = mogidr 
had 

n= as. ae 
_ 1d Gm m2 
mg dr ( r ) 


| 

| 

Q 
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This kind of inverse-square law occurs all the time in nature. For 
instance, if you go twice as far away from a lightbulb, you receive 
1/4 as much light from it, because as the light spreads out, it is 
like an expanding sphere, and a sphere with twice the radius has 
four times the surface area. It’s like spreading the same amount of 
peanut butter on four pieces of bread instead of one — we have to 
spread it thinner. 


Discussion Questions 


A A bowling ball interacts gravitationally with the earth. Would it 
make sense for the gravitational energy to be inversely proportional to the 
distance between their surfaces rather than their centers? 


2.3.5 The shell theorem 


Newton’s great insight was that gravity near the earth’s surface 
was the same kind of interaction as the one that kept the planets 
from flying away from the sun. He told his niece that the idea 
came to him when he saw an apple fall from a tree, which made him 
wonder whether the earth might be affecting the apple and the moon 
in the same way. Up until now, we’ve generally been dealing with 
gravitational interactions between objects that are small compared 
to the distances between them, but that assumption doesn’t apply to 
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the apple. A kilogram of dirt a few feet under his garden in England 
would interact much more strongly with the apple than a kilogram 
of molten rock deep under Australia, thousands of miles away. Also, 
we know that the earth has some parts that are more dense, and 
some parts that are less dense. The solid crust, on which we live, 
is considerably less dense than the molten rock on which it floats. 
By all rights, the computation of the total gravitational energy of 
the apple should be a horrendous mess. Surprisingly, it turns out to 
be fairly simple in the end. First, we note that although the earth 
doesn’t have the same density throughout, it does have spherical 
symmetry: if we imagine dividing it up into thin concentric shells, 
the density of each shell is uniform. 


Second, it turns out that a uniform spherical shell interacts with 
external masses as if all its mass were concentrated at its center. 


The shell theorem: The gravitational energy of a uniform spher- 
ical shell of mass M interacting with a pointlike mass m outside it 
equals —GMm/s, where s is the center-to-center distance. If mass 
m is inside the shell, then the energy is constant, i.e., the shell’s 
interior gravitational field is zero. 


Proof: Let b be the radius of the shell, h its thickness, and p 
its density. Its volume is then V=(area)(thickness)=47b7h, and 
its mass is M = pV = 4rpb*h. The strategy is to divide the 
shell up into rings as shown in figure j, with each ring extending 
from 6 to 0+ dé. Since the ring is infinitesimally skinny, its en- 
tire mass lies at the same distance, r, from mass m. The width of 
such a ring is found by the definition of radian measure to be w = 
bdé, and its mass is dM = (p)(circumference) (thickness) (width)= 


(p)(27bsin 0)(h)(bd0)=2rpb?h sin dd. The gravitational energy of ss 
the ring interacting with mass m is therefore j/A spherical shell of mass 
CHa M interacts with a pointlike mass 
dU = — m. 
= Snap hint 
Fs 


Integrating both sides, we find the total gravitational energy of the 
shell: 


sin 6d6 


r 


U= -2nGpl hm | 
0 
The integral has a mixture of the variables r and 0, which are related 
by the law of cosines, 
r? = b% + 5? — 2bscos6 ; 


and to evaluate the integral, we need to get everything in terms of ei- 
ther r and dr or 6 and d@. The relationship between the differentials 
is found by differentiating the law of cosines, 


2rdr = 2bs sin dd ; 
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k/The gravitational energy 
of amass mat adistance s from 
the center of a hollow spherical 
shell of mass. 


and since sin @d@ occurs in the integral, the easiest path is to sub- 
stitute for it, and get everything in terms of r and dr: 


9) bh s+b 
y = 27 Gebhm / dp 
s s—b 


ArGpb?hm 


8 
GMm 


Ss 


This was all under the assumption that mass m was on the outside 
of the shell. To complete the proof, we consider the case where it’s 
inside. In this case, the only change is that the limits of integration 
are different: 


2nGpbhm ots 
oo dr 
8 b-—s 
—4nGpbhm 
GMm 
b 


The two results are equal at the surface of the sphere, s = b, 
so the constant-energy part joins continuously onto the 1/s part, 
and the effect is to chop off the steepest part of the graph that we 
would have had if the whole mass M had been concentrated at its 
center. Dropping a mass m from A to B in figure k releases the 
same amount of energy as if mass M had been concentrated at its 
center, but there is no release of gravitational energy at all when 
moving between two interior points like C and D. In other words, 
the internal gravitational field is zero. Moving from C to D brings 
mass m farther away from the nearby side of the shell, but closer 
to the far side, and the cancellation between these two effects turns 
out to be perfect. Although the gravitational field has to be zero 
at the center due to symmetry, it’s much more surprising that it 
cancels out perfectly in the whole interior region; this is a special 
mathematical characteristic of a 1/r interaction like gravity. 


Newton's apple example 17 
Over a period of 27.3 days, the moon travels the circumference 
of its orbit, so using data from Appendix 5, we can calculate its 
speed, and solve the circular orbit condition to determine the 
strength of the earth’s gravitational field at the moon’s distance 
from the earth, g = v?/r = 2.72 x 10-% m/s?, which is 3600 
times smaller than the gravitational field at the earth’s surface. 
The center-to-center distance from the moon to the earth is 60 
times greater than the radius of the earth. The earth is, to a very 
good approximation, a sphere made up of concentric shells, each 
with uniform density, so the shell theorem tells us that its external 
gravitational field is the same as if all its mass was concentrated 
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at its center. We already know that a gravitational energy that 
varies as —1/r is equivalent to a gravitational field proportional to 
1/r?, so it makes sense that a distance that is greater by a fac- 
tor of 60 corresponds to a gravitational field that is 60 x 60=3600 
times weaker. Note that the calculation didn’t require knowledge 
of the earth’s mass or the gravitational constant, which Newton 
didn’t know. 


In 1665, shortly after Newton graduated from Cambridge, the 
Great Plague forced the college to close for two years, and New- 
ton returned to the family farm and worked intensely on scientific 
problems. During this productive period, he carried out this calcu- 
lation, but it came out wrong, causing him to doubt his new theory 
of gravity. The problem was that during the plague years, he was 
unable to use the university’s library, so he had to use a figure for 
the radius of the moon’s orbit that he had memorized, and he for- 
got that the memorized value was in units of nautical miles rather 
than statute miles. Once he realized his mistake, he found that 
the calculation came out just right, and became confident that his 
theory was right after all. ° 


Weighing the earth example 18 
> Once Cavendish had found G = 6.67 x 10-1! J- m/kg? (p. 101, 
example 15), it became possible to determine the mass of the 
earth. By the shell theorem, the gravitational energy of a mass 
m at a distance r from the center of the earth is U = —GMm/r, 
where M is the mass of the earth. The gravitational field is related 
to this by mgdr = dU, or g = (1/m)dU/dr = GM/r?. Solving for 
M, we have 
M =gr?/G 
_ (9.8 m/s?)(6.4 x 10° m)? 
~ 6.67 x 10-11 J- m/kg? 
m2 ‘ kg? 
= 6.0 x 1074 ——_= 
: J-s? 
= 6.0 x 10°4 kg 


Gravity inside the earth example 19 
> The earth is somewhat more dense at greater depths, but as an 
approximation let’s assume it has a constant density throughout. 
How does its internal gravitational field vary with the distance r 
from the center? 


> Let’s write b for the radius of the earth. The shell theorem tell us 
that at a given location r, we only need to consider the mass Mz, 


°Some historians are suspicious that the story of the apple and the mistake 
in conversions may have been fabricated by Newton later in life. The conversion 
incident may have been a way of explaining his long delay in publishing his work, 
which led to a conflict with Leibniz over priority in the invention of calculus. 
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earth 


> 


= B moon 


|1/The actual trajectory’ of 
the Apollo 11 spacecraft, A, and 
the straight-line trajectory, B, 
assumed in the example. 


that is deeper than r. Under the assumption of constant density, 
this mass is related to the total mass of the earth by 
Mer _ e 
M ps , 


and by the same reasoning as in example 18, 


GMzr 
= #2 


g 


so 
7 GMr 


a. 
In other words, the gravitational field interpolates linearly between 
zero at r = 0 and its ordinary surface value at r = b. 


The following example applies the numerical techniques of sec- 
tion 2.2. 


From the earth to the moon example 20 
The Apollo 11 mission landed the first humans on the moon in 
1969. In this example, we’ll estimate the time it took to get to 
the moon, and compare our estimate with the actual time, which 
was 73.0708 hours from the engine burn that took the ship out of 
earth orbit to the engine burn that inserted it into lunar orbit. Dur- 
ing this time, the ship was coasting with the engines off, except 
for a small course-correction burn, which we neglect. More im- 
portantly, we do the calculation for a straight-line trajectory rather 
than the real S-shaped one, so the result can only be expected 
to agree roughly with what really happened. The following data 
come from the original press kit, which NASA has scanned and 
posted on the Web: 

initial altitude 3.363 x 10° m 

initial velocity 1.083 x 104 m/s 
The endpoint of the the straight-line trajectory is a free-fall im- 
pact on the lunar surface, which is also unrealistic (luckily for the 
astronauts). 


The ship’s energy is 


GM.m GMym 3 ge 
r In—T 


E= 


but since everything is proportional to the mass of the ship, m, we 
can divide it out 


E__GMe_GMm 1,» 


m- r a , 
and the energy variables in the program with names like e, k, and 
u are actually energies per unit mass. The program is a straight- 
forward modification of the function time3 on page 93. 
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import math 
def tmoon(vi,ri,rf,n): 
bigg=6.67e-11 # gravitational constant 
me=5.97e24 # mass of earth 
mm=7 .35e22 # mass of moon 
rm=3 .84e8 # earth-moon distance 
r=ri 
v=evi 


dr = (rf-ri)/n 
e=-bigg*me/ri-bigg*mm/(rm-ri)+.5*vi**2 
t=0 
for i in range(n): 
u_old = -bigg*me/r-bigg*mm/(rm-r) 
k_old = e - u_old 
v_old = math.sqrt(2.*k_old) 
r = rtdr 
u = -bigg*me/r-bigg*mm/(rm-r) 


k=e-u 
v = math.sqrt(2.*k) 
v_avg = .5*(v_oldtv) 
dt=dr/v_avg 
t=t+dt 

return t 


>>> re=6.378e6 # radius of earth 

>>> rm=1.74e6 # radius of moon 

>>> ri=ret3.363e5 # retinitial altitude 

>>> rf=3.8e8-rm # earth-moon distance minus rm 

>>> vi=1.083e4 # initial velocity 

>>> print (tmoon(vi,ri,rf,1000)/3600.) # convert seconds to hours 
59.654047441976552 


This is pretty decent agreement, considering the wildly inaccurate 
trajectory assumed. It’s interesting to see how much the duration 
of the trip changes if we increase the initial velocity by only ten 
percent: 


>>> vi=1.2e4 
>>> print (tmoon(vi,ri,rf,1000)/3600.) 
18.177752636111677 


The most important reason for using the lower speed was that 
if something had gone wrong, the ship would have been able to 
whip around the moon and take a “free return” trajectory back to 
the earth, without having to do any further burns. At a higher 
speed, the ship would have had so much kinetic energy that in 
the absence of any further engine burns, it would have escaped 
from the earth-moon system. The Apollo 13 mission had to take a 
free return trajectory after an explosion crippled the spacecraft. 
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m/The WMAP probe’s map 
of the cosmic microwave back- 
ground is like a “baby picture” of 
the universe. 


2.3.6 Evidence for repulsive gravity 


Until recently, physicists thought they understood gravity fairly 
well. Einstein had modified Newton’s theory, but certain character- 
istrics of gravitational forces were firmly established. For one thing, 
they were always attractive. If gravity always attracts, then it is 
logical to ask why the universe doesn’t collapse. Newton had an- 
swered this question by saying that if the universe was infinite in 
all directions, then it would have no geometric center toward which 
it would collapse; the forces on any particular star or planet ex- 
erted by distant parts of the universe would tend to cancel out by 
symmetry. More careful calculations, however, show that Newton’s 
universe would have a tendency to collapse on smaller scales: any 
part of the universe that happened to be slightly more dense than 
average would contract further, and this contraction would result 
in stronger gravitational forces, which would cause even more rapid 
contraction, and so on. 


When Einstein overhauled gravity, the same problem reared its 
ugly head. Like Newton, Einstein was predisposed to believe in a 
universe that was static, so he added a special repulsive term to his 
equations, intended to prevent a collapse. This term was not associ- 
ated with any interaction of mass with mass, but represented merely 
an overall tendency for space itself to expand unless restrained by 
the matter that inhabited it. It turns out that Einstein’s solution, 
like Newton’s, is unstable. Furthermore, it was soon discovered 
observationally that the universe was expanding, and this was in- 
terpreted by creating the Big Bang model, in which the universe’s 
current expansion is the aftermath of a fantastically hot explosion.!° 
An expanding universe, unlike a static one, was capable of being ex- 
plained with Einstein’s equations, without any repulsion term. The 
universe’s expansion would simply slow down over time due to the 
attractive gravitational forces. After these developments, Einstein 
said woefully that adding the repulsive term, known as the cosmo- 
logical constant, had been the greatest blunder of his life. 


Section subsection 6.1.5 presents some of the evidence for the Big Bang. 
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This was the state of things until 1999, when evidence began to 
turn up that the universe’s expansion has been speeding up rather 
than slowing down! The first evidence came from using a telescope 
as a sort of time machine: light from a distant galaxy may have 
taken billions of years to reach us, so we are seeing it as it was far 
in the past. Looking back in time, astronomers saw the universe 
expanding at speeds that were lower, rather than higher. At first 
they were mortified, since this was exactly the opposite of what 
had been expected. The statistical quality of the data was also not 
good enough to constitute ironclad proof, and there were worries 
about systematic errors. The case for an accelerating expansion has 
however been supported by high-precision mapping of the dim, sky- 
wide afterglow of the Big Bang, known as the cosmic microwave 
background. 


So now Einstein’s “greatest blunder” has been resurrected. Since 
we don’t actually know whether or not this self-repulsion of space 
has a constant strength, the term “cosmological constant” has lost 
currency. Nowadays physicists usually refer to the phenomenon as 
“dark energy.” Picking an impressive-sounding name for it should 
not obscure the fact that we know absolutely nothing about the 
nature of the effect or why it exists. 


2.4 Atomic Phenomena 


Variety is the spice of life, not of science. So far this chapter has 
focused on heat energy, kinetic energy, and gravitational energy, 
but it might seem that in addition to these there is a bewildering 
array of other forms of energy. Gasoline, chocolate bars, batteries, 
melting water — in each case there seems to be a whole new type 
of energy. The physicist’s psyche rebels against the prospect of a 
long laundry list of types of energy, each of which would require 
its own equations, concepts, notation, and terminology. The point 
at which we’ve arrived in the study of energy is analogous to the 
period in the 1960’s when a half a dozen new subatomic particles 
were being discovered every year in particle accelerators. It was an 
embarrassment. Physicists began to speak of the “particle zoo,” 
and it seemed that the subatomic world was distressingly complex. 
The particle zoo was simplified by the realization that most of the 
new particles being whipped up were simply clusters of a previously 
unsuspected set of fundamental particles (which were whimsically 
dubbed quarks, a made-up word from a line of poetry by James 
Joyce, “Three quarks for Master Mark.”) The energy zoo can also 
be simplified, and it’s the purpose of this section to demonstrate the 
hidden similarities between forms of energy as seemingly different 
as heat and motion. 
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a/A_ vivid demonstration that 
heat is a form of motion. A small 
amount of boiling water is poured 
into the empty can, which rapidly 
fills up with hot steam. The can 
is then sealed tightly, and soon 
crumples. 


SOOOCOOOL 
00000000 
OGOO0@ 00000 
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b/Random motion of atoms 
in a gas, a liquid, and a solid. 


2.4.1 Heat is kinetic energy. 


What is heat really? Is it an invisible fluid that your bare feet 
soak up from a hot sidewalk? Can one ever remove all the heat from 
an object? Is there a maximum to the temperature scale? 


The theory of heat as a fluid seemed to explain why colder ob- 
jects absorbed heat from hotter ones, but once it became clear that 
heat was a form of energy, it began to seem unlikely that a material 
substance could transform itself into and out of all those other forms 
of energy like motion or light. For instance, a compost pile gets hot, 
and we describe this as a case where, through the action of bacteria, 
chemical energy stored in the plant cuttings is transformed into heat 
energy. The heating occurs even if there is no nearby warmer object 
that could have been leaking “heat fluid” into the pile. 


An alternative interpretation of heat was suggested by the theory 
that matter is made of atoms. Since gases are thousands of times less 
dense than solids or liquids, the atoms (or clusters of atoms called 
molecules) in a gas must be far apart. In that case, what is keeping 
all the air molecules from settling into a thin film on the floor of the 
room in which you are reading this book? The simplest explanation 
is that they are moving very rapidly, continually ricocheting off of 
the floor, walls, and ceiling. Though bizarre, the cloud-of-bullets 
image of a gas did give a natural explanation for the surprising 
ability of something as tenuous as a gas to exert huge forces. 


The experiment shown in figure a, for instance, can be explained 
as follows. The high temperature of the steam is interpreted as a 
high average speed of random motions of its molecules. Before the 
lid was put on the can, the rapidly moving steam molecules pushed 
their way out of the can, forcing the slower air molecules out of the 
way. As the steam inside the can thinned out, a stable situation 
was soon achieved, in which the force from the less dense steam 
molecules moving at high speed balanced against the force from the 
more dense but slower air molecules outside. The cap was put on, 
and after a while the steam inside the can began to cool off. The 
force from the cooler, thin steam no longer matched the force from 
the cool, dense air outside, and the imbalance of forces crushed the 
can. 


This type of observation leads naturally to the conclusion that 
hotter matter differs from colder in that its atoms’ random motion is 
more rapid. In a liquid, the motion could be visualized as people in a 
milling crowd shoving past each other more quickly. In a solid, where 
the atoms are packed together, the motion is a random vibration of 
each atom as it knocks against its neighbors. 


We thus achieve a great simplification in the theory of heat. Heat 
is simply a form of kinetic energy, the total kinetic energy of random 
motion of all the atoms in an object. With this new understanding, 
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it becomes possible to answer at one stroke the questions posed 
at the beginning of the section. Yes, it is at least theoretically 
possible to remove all the heat from an object. The coldest possible 
temperature, known as absolute zero, is that at which all the atoms 
have zero velocity, so that their kinetic energies, K = (1/2)mv7, are 
all zero. No, there is no maximum amount of heat that a certain 
quantity of matter can have, and no maximum to the temperature 
scale, since arbitrarily large values of v can create arbitrarily large 
amounts of kinetic energy per atom. 


The kinetic theory of heat also provides a simple explanation of 
the true nature of temperature. Temperature is a measure of the 
amount of energy per molecule, whereas heat is the total amount of 
energy possessed by all the molecules in an object. 


There is an entire branch of physics, called thermodynamics, 
that deals with heat and temperature and forms the basis for tech- 
nologies such as refrigeration. Thermodynamics is discussed in more 
detail in chapter 5, and I’ve provided here only a brief overview of 
the thermodynamic concepts that relate directly to energy. 


2.4.2 All energy comes from particles moving or interacting. 


If I stretch the spring in figure c and then release it, it snaps taut 
again. The creation of some kinetic energy shows that there must 
have been some other form of energy that was destroyed. What was 
it? 

We could just invent a new type of energy called “spring energy,” 
study its behavior, and call it quits, but that would be ugly. Are 
we going to have to invent a new forms of energy like this, over 
and over? No: the title of this book doesn’t lie, and physics really 
is fundamentally simple. As shown in figure d, when we bend or 
stretch an object, we’re really changing the distances between the 
atoms, resulting in a change in electrical energy. Electrical energy 
isn’t really our topic right now — that’s what most of the second 
half of this book is about — but conceptually it’s very similar to 
gravitational energy. Like gravitational energy, it depends on 1/r, 
although there are some interesting new phenomena, such as the 
existence of both attraction and repulsion, which doesn’t occur with 
gravity because gravitational mass can’t be negative. The real point 
is that all the apparently dissimilar forms of energy in figure d turn 
out to be due to electrical interactions among atoms. Even if we 
wish to include nuclear reactions (figure e) in the picture, there still 
turn out to be only four fundamental types of energy: 


kinetic energy (including heat) 
gravitational energy 

electrical and magnetic energy 
nuclear energy 


Astute students have often asked me how light fits into this pic- 


c/The spring’s energy is re- 
ally due to electrical interactions 
among atoms. 


RE Iokes 
Seep em 33988 


hemical 
° 
O . reactions o®, 


d/All these energy  transfor- 
mations turn out at the atomic 
level to be due to changes in the 


distances between atoms that 
interact electrically. 
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nuclear 
reactions 


e/This figure looks — similar 
to the previous ones, but the 
scale is a million times smaller. 
The little balls are the neutrons 
and protons that make up the 
tiny nucleus at the center of a 
uranium atom. When the nucleus 
splits (fissions), the source of the 
kinetic energy is partly electrical 
and partly nuclear. 


ture. This is a very good question, and in fact it could be argued 
that it is the basic question that led to Einstein’s theory of relativ- 
ity as well as the modern quantum picture of nature. Since these 
are topics for the second half of the book, we’ll have to be content 
with half an answer at this point. For now, we may think of light 
energy as a form of kinetic energy, but one calculated not according 
to (1/2)mv? but by some other equation. (We know that (1/2)mv? 
would not make sense, because light has no mass, and furthermore, 
high-energy beams of light do not differ in speed from low-energy 
ones. ) 


Temperature during boiling example 21 

> If you stick a thermometer in a pan of water, and watch the 
temperature as you bring the water to a boil, you’ll notice an in- 
teresting fact. The temperature goes up until the boiling point is 
reached, but then stays at 100°C during the whole time the water 
is being boiled off. The temperature of the steam is also 100°C. 
Why does the temperature “stick” like this? What’s happening to 
all the energy that the stove’s burner is putting into the pan? 


> As shown in figure d, boiling requires an increase in electrical 
energy, because the atoms coming out as gas are moving away 
from the other atoms, which attract them electrically. It is only this 
electrical energy that is increasing, not the atoms’ kinetic energy, 
which is what the thermometer can measure. 


Diffusion example 22 
> A drop of food coloring in a cup of water will gradually spread 
out, even if you don’t do any mixing with a spoon. This is called 
diffusion. Why would this happen, and what effect would temper- 
ature have? What would happen with solids or gases? 


> Figure b shows that the atoms in a liquid mingle because of 
their random thermal motion. Diffusion is slow (typically on the 
order of a centimeter a minute), despite the high speeds of the 
atoms (typically hundreds of miles per hour). This is due to the 
randomness of the motion: a particular atom will take a long time 
to travel any significant distance, because it doesn’t travel in a 
straight line. 


Based on this picture, we expect that the speed of diffusion should 
increase as a function of temperature, and experiments show that 
this is true. 


Diffusion also occurs in gases, which is why you can smell things 
even when the air is still. The soeeds are much faster, because 
the typical distance between collisions is much longer than in a 
liquid. 

We can see from figure b that diffusion won't occur in solids, be- 
cause each atom vibrates around an equilibrium position. 
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Discussion Questions 


A _— I’mnot making this up. XS Energy Drink has ads that read like this: 
All the “Energy” ... Without the Sugar! Only 8 Calories!” Comment on 
this. 


2.5 Oscillations 


Let’s revisit the example of the stretched spring from the previ- 
ous section. We know that its energy is a form of electrical energy 
of interacting atoms, which is nice conceptually but doesn’t help 
us to solve problems, since we don’t know how the energy, U, de- 
pends on the length of the spring. All we know is that there’s an 
equilibrium (figure a/1), which is a local minimum of the function 
U. An extremely important problem which arises in this connec- 
tion is how to calculate oscillatory motion around an equilibrium, 
as in a/4-13. Even if we did special experiments to find out how the 
spring’s energy worked, it might seem like we’d have to go through 
just as much work to deal with any other kind of oscillation, such 
as a sapling swinging back and forth in the breeze. 


Surprisingly, it’s possible to analyze this type of oscillation in a 
very general and elegant manner, as long as the analysis is limited 
to small oscillations. We’ll talk about the mass on the spring for 
concreteness, but there will be nothing in the discussion at all that 
is restricted to that particular physical system. First, let’s choose 
a coordinate system in which x = 0 corresponds to the position of 
the mass where the spring is in equilibrium, and since interaction 
energies like U are only well defined up to an additive constant, we’ll 
simply define it to be zero at equilibrium: 


U(0) =0 


Since x = 0 is an equilibrium, U(x) must have a local minimum 
there, and a differentiable function (which we assume U is) has a 
zero derivative at a local minimum: 


dU 
ae 


There are still infinitely many functions that could satisfy these 
criteria, including the three shown in figure b, which are x?/2, 
ax? /2(1+27), and (e” + e~** — 2)/18. Note, however, how all three 
functions are virtually identical right near the minimum. That’s 
because they all have the same curvature. More specifically, each 
function has its second derivative equal to 1 at x = 0, and the sec- 
ond derivative is a measure of curvature. We write k for the second 
derivative of the energy at an equilibrium point, 


eu 
dx? 


(0) =k 
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a/The spring has a minimum- 
energy length, 1, and energy is 
required in order to compress or 
stretch it, 2 and 3. A mass at- 
tached to the spring will oscillate 
around the equilibrium, 4-13. 
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b/ Three functions with the 
same curvature at x=0. 


CHT MINTI 


>| >| 

AA 
c/The amplitude would usu- 
ally be defined as the distance 
from equilibrium to one extreme 
of the motion, i.e., half the total 
travel. 


Physically, k is a measure of stiffness. For example, the heavy-duty 
springs in a car’s shock absorbers would have a high value of k. It 
is often referred to as the spring constant, but we’re only using a 
spring as an example here. As shown in figure b, any two functions 
that have U(0) = 0, dU/dx = 0, and d?U/dx? = k, with the same 
value of k, are virtually indistinguishable for small values of x, so if 
we want to analyze small oscillations, it doesn’t even matter which 
function we assume. For simplicity, we'll just use U(a) = (1/2)ka? 
from now on. 


Now we're ready to analyze the mass-on-a-spring system, while 
keeping in mind that it’s really only a representative example of 
a whole class of similar oscillating systems. We expect that the 
motion is going to repeat itself over and over again, and since we’re 
not going to include frictional heating in our model, that repetition 
should go on forever without dying out. The most interesting thing 
to know about the motion would be the period, T, which is the 
amount of time required for one complete cycle of the motion. We 
might expect that the period would depend on the spring constant, 
k, the mass, m, and and the amplitude, A, defined in figure c.!4 


In examples like the brachistochrone and the Apollo 11 mission, 
it was generally necessary to use numerical techniques to determine 
the amount of time required for a certain motion. Once again, let’s 
dust off the time3 function from page 93 and modify it for our 
purposes. For flexibility, we’ll define the function U(x) as a separate 
Python function. We really want to calculate the time required for 
the mass to come back to its starting point, but that would be 
awkward to set up, since our function works by dividing up the 
distance to be traveled into tiny segments. By symmetry, the time 
required to go from one end to the other equals the time required 
to come back to the start, so we’ll just calculate the time for half a 
cycle and then double it when we return the result at the end of the 
function. The test at lines 16-19 is necessary because otherwise at 
the very end of the motion we can end up trying to take the square 
root of a negative number due to rounding errors. 


11Many kinds of oscillations are possible, so there is no standard definition of 
the amplitude. For a pendulum, the natural definition would be in terms of an 
angle. For a radio transmitter, we’d use some kind of electrical units. 
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import math 
def u(k,x): 
return .5*k*x**2 


def osc(m,k,a,n): 

x=a 

v=0 

dx = -2.*a/n 

t=0 

e = u(k,x)+.5*m*v**2 

for i in range(n): 
x_old = x 
v_old =v 
x = xt+dx 
kinetic = e-u(k,x) 
if kinetic<o. 

v=0. 

print "warning, K=",kinetic,"<0" 
else : 

v = -math.sqrt(2.*kinetic/m) 
v_avg = (v+v_old)/2. 
dt=dx/v_avg 
t=t+dt 

return 2.¥*t 


>>> print(osc(1.,1.,1.,100000) ) 
warning, K= -1.43707268307e-12 <0 
6 .2831854132667919 


The first thing to notice is that with this particular set of inputs 
(m=1 kg, k = 1 J/m?, and A = 1 m), the program has done an ex- 
cellent job of computing 27 = 6.2831853.... This is Mother Nature 
giving us a strong hint that the problem has an algebraic solution, 
not just a numerical one. The next interesting thing happens when 
we change the amplitude from 1 m to 2 m: 


>>> print(osc(1.,1.,2.,100000) ) 
warning, K= -5.7482907323e-12 <0 
6 .2831854132667919 


Even though the mass had to travel double the distance in each 
direction, the period is the same to within the numerical accuracy 
of the calculation! 


With these hints, it seems like we should start looking for an 
algebraic solution. For guidance, here’s a graph of x as a function 
of t, as calculated by the osc function with n=10. 
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d/Example 23. The rod piv- 
ots on the hinge at the bottom. 


ae 
3 


a 

This looks like a cosine function, so let’s see if a x = Acos (wt+0) 
is a solution to the conservation of energy equation — it’s not un- 
common to try to “reverse-engineer” the cryptic results of a numer- 
ical calculation like this. The symbol w = 27/T (Greek omega), 
called angular frequency, is a standard symbol for the number of 
radians per second of oscillation. Except for the factor of 27, it is 
identical to the ordinary frequency f = 1/T, which has units of s~! 
or Hz (Hertz). The phase angle 6 is to allow for the possibility that 
t = 0 doesn’t coincide with the beginning of the motion. The energy 
is 


E=kK+U 


1 1 
= rue a she 


t fda\? . Te 
=5"(Z) ad 


m[—Aw sin (wt + 5)]? + sh [A cos (wt + 6)]? 


A? [mw sin ?(wt + 6) + k cos? (wt + 6)] 


According to conservation of energy, this has to be a constant. Using 
the identity sin? + cos? = 1, we can see that it will be a constant if 
we have mw” = k, or w = \/k/m, ie., T = 20,/m/k. Note that the 
period is independent of amplitude. 


A spring and a lever example 23 
> What is the period of small oscillations of the system shown in 
the figure? Neglect the mass of the lever and the spring. Assume 
that the spring is so stiff that gravity is not an important effect. 
The spring is relaxed when the lever is vertical. 


> This is a little tricky, because the spring constant k, although 
it is relevant, is not the k we should be putting into the equation 
T = 2n,/m/k. The k that goes in there has to be the second 
derivative of U with respect to the position, x, of the mass that’s 
moving. The energy U stored in the spring depends on how far 
the tip of the lever is from the center. This distance equals (L/b)x, 
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so the energy in the spring is 


ye ete 


and the k we have to put in T = 27,/m/k is 
dx2 sb 
The result is 


The leverage of the lever makes it as if the spring was stronger, 
and decreases the period of the oscillations by a factor of b/L. 


Water in a U-shaped tube example 24 
> What is the period of oscillation of the water in figure e? 


> In example 13 on p. 89, we found U(y) = pgAy?, so the “spring 
constant,” which really isn’t a spring constant here at all, is 

_ dU 

tay 

= 209A 
This is an interesting example, because k can be calculated with- 
out any approximations, but the kinetic energy requires an ap- 
proximation, because we don’t know the details of the pattern 
of flow of the water. It could be very complicated. There will 
be a tendency for the water near the walls to flow more slowly 
due to friction, and there may also be swirling, turbulent motion. e / Water in a U-shaped tube. 
However, if we make the approximation that all the water moves 
with the same velocity as the surface, dy/dt, then the mass-on- 
a-spring analysis applies. Letting L be the total length of the filled 
part of the tube, the mass is pLA, and we have 


T = 2n\/m/k 


oLA 
2pgA 


IL 


This chapter is summarized on page 891. Notation and terminology 
are tabulated on pages 885-886. 
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Problems 
The symbols V, “, etc. are explained on page 125. 


1 Experiments show that the power consumed by a boat’s engine 
is approximately proportional to the third power of its speed. (We 
assume that it is moving at constant speed.) 
(a) When a boat is cruising at constant speed, what type of energy 
transformation do you think is being performed? 
(b) If you upgrade to a motor with double the power, by what factor 
is your boat’s maximum cruising speed increased? 

> Solution, p. 877 


2 Object A has a kinetic energy of 13.4 J. Object B has a mass 
that is greater by a factor of 3.77, but is moving more slowly by a 
factor of 2.34. What is object B’s kinetic energy? 

> Solution, p. 877 


3 My 1.25 kW microwave oven takes 126 seconds to bring 250 
g of water from room temperature to a boil. What percentage of 
the power is being wasted? Where might the rest of the energy be 
going? > Solution, p. 877 & 


4 The multiflash photograph shows a collision between two pool 
balls. The ball that was initially at rest shows up as a dark image 
in its initial position, because its image was exposed several times 
before it was struck and began moving. By making measurements 
on the figure, determine numerically whether or not energy appears 
to have been conserved in the collision. What systematic effects 
would limit the accuracy of your test? [From an example in PSSC 
Physics. } 


Problem 4. 
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5 A grasshopper with a mass of 110 mg falls from rest from a 

height of 310 cm. On the way down, it dissipates 1.1 mJ of heat due 

to air resistance. At what speed, in m/s, does it hit the ground? 
> Solution, p. 877 & 


6 A ball rolls up a ramp, turns around, and comes back down. 
When does it have the greatest gravitational energy? The greatest 
kinetic energy? [Based on a problem by Serway and Faughn.] & 


7 (a) You release a magnet on a tabletop near a big piece of iron, 
and the magnet leaps across the table to the iron. Does the magnetic 
energy increase, or decrease? Explain. (b) Suppose instead that you 
have two repelling magnets. You give them an initial push towards 
each other, so they decelerate while approaching each other. Does 
the magnetic energy increase, or decrease? Explain. z 


8 Estimate the kinetic energy of an Olympic sprinter. 


9 You are driving your car, and you hit a brick wall head on, 
at full speed. The car has a mass of 1500 kg. The kinetic energy 
released is a measure of how much destruction will be done to the car 
and to your body. Calculate the energy released if you are traveling 
at (a) 40 mi/hr, and again (b) if you’re going 80 mi/hr. What is 
counterintuitive about this, and what implication does this have for 
driving at high speeds? v 


10 A closed system can be a bad thing — for an astronaut 
sealed inside a space suit, getting rid of body heat can be difficult. 
Suppose a 60-kg astronaut is performing vigorous physical activity, 
expending 200 W of power. If none of the heat can escape from her 
space suit, how long will it take before her body temperature rises 
by 6°C (11°F), an amount sufficient to kill her? Assume that the 
amount of heat required to raise her body temperature by 1°C is 
the same as it would be for an equal mass of water. Express your 
answer in units of minutes. vo 


11 The following table gives the amount of energy required in 


order to heat, melt, or boil a gram of water. 
heat 1 g of ice by 1°C 2.05 J 


melt 1 g of ice 333 J 
heat 1 g of liquid by 19°C 4.19 J 
boil 1 g of water 2500 J 


heat 1 g of steam by 1°C 2.01 J 
(a) How much energy is required in order to convert 1.00 g of ice at 


-20 °C into steam at 137 °C? v 
(b) What is the minimum amount of hot water that could melt 1.00 
g of ice? vio 


12 Anya climbs to the top of a tree, while Ivan climbs half-way 
to the top. They both drop pennies to the ground. Compare the 
kinetic energies and velocities of the pennies on impact, using ratios. 


_ 
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Problem 16. 


m2 


m4 


Problem 17. 


Problem 18. 


Problem 19. 


13 Anya and Ivan lean over a balcony side by side. Anya throws 
a penny downward with an initial speed of 5 m/s. Ivan throws 
a penny upward with the same speed. Both pennies end up on 
the ground below. Compare their kinetic energies and velocities on 
impact. 


14 (a) A circular hoop of mass m and radius r spins like a wheel 
while its center remains at rest. Let w (Greek letter omega) be the 
number of radians it covers per unit time, i.e., w = 27/T, where 
the period, T, is the time for one revolution. Show that its kinetic 
energy equals (1/2)mw?r?. 
(b) Show that the answer to part a has the right units. (Note 
that radians aren’t really units, since the definition of a radian is a 
unitless ratio of two lengths.) 
(c) If such a hoop rolls with its center moving at velocity v, its 
kinetic energy equals (1/2)mv?, plus the amount of kinetic energy 
found in part a. Show that a hoop rolls down an inclined plane with 
half the acceleration that a frictionless sliding block would have. 
[P= 


15 On page 83, I used the chain rule to prove that the acceler- 
ation of a free-falling object is given by a = —g. In this problem, 
you'll use a different technique to prove the same thing. Assume 
that the acceleration is a constant, a, and then integrate to find 
v and y, including appropriate constants of integration. Plug your 
expressions for v and y into the equation for the total energy, and 
show that a = —g is the only value that results in constant energy. 


~ 


16 The figure shows two unequal masses, m1 and m2, connected 
by a string running over a pulley. Find the acceleration. 
> Hint, p. 863 V 


17 What ratio of masses will balance the pulley system shown 
in the figure? > Hint, p. 863 — 


18 (a) For the apparatus shown in the figure, find the equilib- 
rium angle # in terms of the two masses. v 
(b) Interpret your result in the case of M >> m (M much greater 
than m). Does it make sense physically? 

(c) For what combinations of masses would your result give non- 
sense? Interpret this physically. > Hint, p. 863 & 


19 In the system shown in the figure, the pulleys on the left and 
right are fixed, but the pulley in the center can move to the left or 
right. The two hanging masses are identical, and the pulleys and 
ropes are all massless. Find the upward acceleration of the mass on 
the left, in terms of g only. > Hint, p. 863 V o 


20 Two atoms will interact via electrical forces between their 
protons and electrons. One fairly good approximation to the elec- 
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trical energy is the Lennard-Jones formula, 


vey=«(()°-2(9)] 


where r is the center-to-center distance between the atoms and k is 
a positive constant. Show that (a) there is an equilibrium point at 
r=a, 

(b) the equilibrium is stable, and 

(c) the energy required to bring the atoms from their equilibrium 
separation to infinity is k. > Hint, p. 863 © 


21 The International Space Station orbits at an altitude of about 
360 to 400 km. What is the gravitational field of the earth at this 
altitude? Vv 


22 (a) A geosynchronous orbit is one in which the satellite orbits 
above the equator, and has an orbital period of 24 hours, so that it 
is always above the same point on the spinning earth. Calculate the 
altitude of such a satellite. Vv 
(b) What is the gravitational field experienced by the satellite? Give 
your answer as a percentage in relation to the gravitational field at 
the earth’s surface. > Hint, p. 863 V 


~~ 


23 Astronomers calculating orbits of planets often work in a 
nonmetric system of units, in which the unit of time is the year, 
the unit of mass is the sun’s mass, and the unit of distance is the 
astronomical unit (A.U.), defined as half the long axis of the earth’s 
orbit. In these units, find an exact expression for the gravitational 
constant, G. Vv Oo 


24 The star Lalande 21185 was found in 1996 to have two planets 
in roughly circular orbits, with periods of 6 and 30 years. What is 
the ratio of the two planets’ orbital radii? Vv oO 


25 A projectile is moving directly away from a planet of mass 
M at exactly escape velocity. (a) Find r, the distance from the 
projectile to the center of the planet, as a function of time, t, and 
also find v(t). 
(b) Check the units of your answer. 
(c) Does v show the correct behavior as ¢ approaches infinity? 

> Hint, p. 863 & 


26 The purpose of this problem is to estimate the height of the 
tides. The main reason for the tides is the moon’s gravity, and we’ll 
neglect the effect of the sun. Also, real tides are heavily influenced 
by landforms that channel the flow of water, but we’ll think of the 
earth as if it was completely covered with oceans. Under these 
assumptions, the ocean surface should be a surface of constant U/m. 
That is, a thimbleful of water, m, should not be able to gain or lose 
any gravitational energy by moving from one point on the ocean 
surface to another. If only the spherical earth’s gravity was present, 
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. Earth's orbit “ 


Mars’ orbit 


then we’d have U/m = —GM_/r, and a surface of constant U/m 
would be a surface of constant r, i.e., the ocean’s surface would be 
spherical. Taking into account the moon’s gravity, the main effect is 
to shift the center of the sphere, but the sphere also becomes slightly 
distorted into an approximately ellipsoidal shape. (The shift of the 
center is not physically related to the tides, since the solid part of 
the earth tends to be centered within the oceans; really, this effect 
has to do with the motion of the whole earth through space, and 
the way that it wobbles due to the moon’s gravity.) Determine the 
amount by which the long axis of the ellipsoid exceeds the short 
axis. > Hint, p. 863 & 


27 You are considering going on a space voyage to Mars, in which 
your route would be half an ellipse, tangent to the Earth’s orbit at 
one end and tangent to Mars’ orbit at the other. Your spacecraft’s 
engines will only be used at the beginning and end, not during the 
voyage. How long would the outward leg of your trip last? (The 
orbits of Earth and Mars are nearly circular, and Mars’s is bigger 
by a factor of 1.52.) vo 


28 When you buy a helium-filled balloon, the seller has to inflate 
it from a large metal cylinder of the compressed gas. The helium 
inside the cylinder has energy, as can be demonstrated for example 
by releasing a little of it into the air: you hear a hissing sound, 
and that sound energy must have come from somewhere. The total 
amount of energy in the cylinder is very large, and if the valve is 
inadvertently damaged or broken off, the cylinder can behave like 
bomb or a rocket. 


Suppose the company that puts the gas in the cylinders prepares 
cylinder A with half the normal amount of pure helium, and cylinder 
B with the normal amount. Cylinder B has twice as much energy, 
and yet the temperatures of both cylinders are the same. Explain, at 
the atomic level, what form of energy is involved, and why cylinder 
B has twice as much. & 


29 Explain in terms of conservation of energy why sweating 
cools your body, even though the sweat is at the same temperature 
as your body. Describe the forms of energy involved in this energy 
transformation. Why don’t you get the same cooling effect if you 
wipe the sweat off with a towel? Hint: The sweat is evaporating. 


30 [This problem is now problem 3-73.| & 


31 All stars, including our sun, show variations in their light 
output to some degree. Some stars vary their brightness by a factor 
of two or even more, but our sun has remained relatively steady dur- 
ing the hundred years or so that accurate data have been collected. 
Nevertheless, it is possible that climate variations such as ice ages 
are related to long-term irregularities in the sun’s light output. If 
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the sun was to increase its light output even slightly, it could melt 
enough Antarctic ice to flood all the world’s coastal cities. The total 
sunlight that falls on Antarctica amounts to about 1 x 10!° watts. 
In the absence of natural or human-caused climate change, this heat 
input to the poles is balanced by the loss of heat via winds, ocean 
currents, and emission of infrared light, so that there is no net melt- 
ing or freezing of ice at the poles from year to year. Suppose that 
the sun changes its light output by some small percentage, but there 
is no change in the rate of heat loss by the polar caps. Estimate the 
percentage by which the sun’s light output would have to increase 
in order to melt enough ice to raise the level of the oceans by 10 me- 
ters over a period of 10 years. (This would be enough to flood New 
York, London, and many other cities.) Melting 1 kg of ice requires 
S510? a. B 


32 The figure shows the oscillation of a microphone in response 
to the author whistling the musical note “A.” The horizontal axis, 
representing time, has a scale of 1.0 ms per square. Find the period 
T, the frequency f, and the angular frequency w. Vv 


33 (a) A mass m is hung from a spring whose spring constant is 
k;. Write down an expression for the total interaction energy of the 
system, U, and find its equilibrium position. > Hint, p. 863 
(b) Explain how you could use your result from part a to determine 
an unknown spring constant. & 


34 A certain mass, when hung from a certain spring, causes 
the spring to stretch by an amount h compared to its equilibrium 
length. If the mass is displaced vertically from this equilibrium, it 
will oscillate up and down with a period Tosc.. Give a numerical 
comparison between Tos- and Tyqi, the time required for the mass 
to fall from rest through a height h, when it isn’t attached to the 
spring. (You will need the result of problem 33). vo 


35 Find the period of vertical oscillations of the mass m. The 
spring, pulley, and ropes have negligible mass. 
> Hint, p. 863 V 


36 The equilibrium length of each spring in the figure is b, so 
when the mass m is at the center, neither spring exerts any force 
on it. When the mass is displaced to the side, the springs stretch; 
their spring constants are both k. 

(a) Find the energy, U, stored in the springs, as a function of y, the 
distance of the mass up or down from the center. Vv 
(b) Show that the period of small up-down oscillations is infinite. 


= 
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37 Two springs with spring constants kj and kg are put to- 
gether end-to-end. Let x; be the amount by which the first spring 
is stretched relative to its equilibrium length, and similarly for x. If 
the combined double spring is stretched by an amount 0 relative to 
its equilibrium length, then 6 = x; + x. Find the spring constant, 
K, of the combined spring in terms of k; and ko. 

> Hint, p. 864 > Answer, p. 874. V 


~ 


38 A mass m on a spring oscillates around an equilibrium at 
x = 0. Any function U(x) with an equilibrium at x = 0 can be 
approximated as U(a) = (1/2)kx?, and if the energy is symmet- 
ric with respect to positive and negative values of x, then the next 
level of improvement in such an approximation would be U(x) = 
(1/2)kx? + bx*. The general idea here is that any smooth function 
can be approximated locally by a polynomial, and if you want a bet- 
ter approximation, you can use a polynomial with more terms in it. 
When you ask your calculator to calculate a function like sin or e”, 
it’s using a polynomial approximation with 10 or 12 terms. Physi- 
cally, a spring with a positive value of b gets stiffer when stretched 
strongly than an “ideal” spring with b = 0. A spring with a nega- 
tive 6 is like a person who cracks under stress — when you stretch 
it too much, it becomes more elastic than an ideal spring would. 
We should not expect any spring to give totally ideal behavior no 
matter no matter how much it is stretched. For example, there has 
to be some point at which it breaks. 


Do a numerical simulation of the oscillation of a mass on a spring 
whose energy has a nonvanishing b. Is the period still independent 
of amplitude? Is the amplitude-independent equation for the period 
still approximately valid for small enough amplitudes? Does the 
addition of a positive x* term tend to increase the period, or decrease 
it? Include a printout of your program and its output with your 
homework paper. z 


39 An idealized pendulum consists of a pointlike mass m on the 
end of a massless, rigid rod of length L. Its amplitude, 6, is the angle 
the rod makes with the vertical when the pendulum is at the end 
of its swing. Write a numerical simulation to determine the period 
of the pendulum for any combination of m, L, and 6. Examine the 
effect of changing each variable while manipulating the others. = 


40 A ball falls from a height h. Without air resistance, the 
time it takes to reach the floor is \/2h/g. A numerical version 
of this calculation was given in program time2 on page 92. Now 
suppose that air resistance is not negligible. For a smooth sphere of 
radius r, moving at speed v through air of density p, the amount of 
energy dQ dissipated as heat as the ball falls through a height dy is 
given (ignoring signs) by dQ = (1/4)pv2r2dy. Modify the program 
to incorporate this effect, and find the resulting change in the fall 
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time in the case of a 21 g ball of radius 1.0 cm, falling from a height 
of 1.0 m. The density of air at sea level is about 1.2 kg/m?. Turn in 
a printout of both your program and its output. Answer: 0.34 ms. 


— 


41 The factorial of an integer n, written n!, is defined as the 
product of all the positive integers less than or equal to n. For 
example, 3! = 1x2x3=6. Write a Python program to compute the 
factorial of a number. Test it with a small number whose factorial 
you can check by hand. Then use it to compute 30!. (Python 
computes integer results with unlimited precision, so you won’t get 
any problems with rounding or overflows.) Turn in a printout of 
your program and its output, including the test. z 


Key to symbols: 
easy . typical challenging Mdifficult © very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 
Exercise 2A: Reasoning with Ratios and Powers 
Equipment: 

ping-pong balls and paddles 

two-meter sticks 


You have probably bounced a ping pong ball straight up and down in the air. The time between 
hits is related to the height to which you hit the ball. If you take twice as much time between 
hits, how many times higher do you think you will have to hit the ball? Write down your 
hypothesis:___________-__ 


Your instructor will first beat out a tempo of 240 beats per minute (four beats per second), 
which you should try to match with the ping-pong ball. Measure the height to which the ball 


Now try it at 120 beats per minute:________ 
Compare your hypothesis and your results with the rest of the class. 
Exercise 2B: The Shell Theorem 


This exercise is an approximate numerical test of the shell theorem. There are seven masses 
A-G, each being one kilogram. Masses A-E, each one meter from the center, form a shape like 
two Egyptian pyramids joined at their bases; this is a rough approximation to a six-kilogram 
spherical shell of mass. Mass G is five meters from the center of the main group. The class will 
divide into six groups and split up the work required in order to calculate the total gravitational 
energy of mass G. 


A 
© 
: a 
1 & 
Cc “— onge? ©; sedodenns te peniest cede tee denspe shades tse © 
Ee 6 
© 
B 


1. Each group should write its results on the board in units of picojoules, retaining six significant 
figures of precision. 


2. The class will add the results and compare with the result that would be obtained with the 
shell theorem. 
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Exercise 2C: Vibrations 

Equipment: 
e air track and carts of two different masses 
e springs 


e weights 


spring 
GOTO RO OO) ORONO) 


Place the cart on the air track and attach springs so that it can vibrate. 


1. Test whether the period of vibration depends on amplitude. Try at least one moderate 
amplitude, for which the springs do not go slack, at least one amplitude that is large enough so 
that they do go slack, and one amplitude that’s the very smallest you can possibly observe. 


2. Try a cart with a different mass. Does the period change by the expected factor, based on 
the equation T = 27,\/m/k? 

3. In homework problem 33 on page 123, you showed that a spring’s spring constant can be 
determined by hanging a weight from it. Use this technique to find the spring constant of each 
of the two springs. The equivalent spring constant of these two springs, attached to the cart in 
this way, can be found by adding their spring constants. 


4. Test the equation T = 27,/m/k numerically. 
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Pa TSE Lo... oN Forces transfer momentum to the 
2, URRY Mea er girl. 


Chapter 3 
Conservation of Momentum 


I think, therefore I am. 


T hope that posterity will judge me kindly, not only as to the things 
which I have explained, but also to those which I have intentionally 
omitted so as to leave to others the pleasure of discovery. 


René Descartes 
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a/ Systems consisting of material 
particles that interact through an 


energy U(r). Top: The galaxy 
M100. Here the “particles” are 
stars. Middle: The pool balls 
don’t interact until they come to- 
gether and become compressed; 
the energy U(r) has a sharp 
upturn when the center-to-center 
distance r gets small enough 
for the balls to be in contact. 
Bottom: A uranium nucleus 
undergoing fission. The energy 
U(r) has a repulsive contribution 
from the electrical interactions 
of the protons, plus an attractive 
one due to the strong nuclear 
interaction. (M100: Hubble Space 
Telescope image.) 


3.1 Momentum In One Dimension 


3.1.1 Mechanical momentum 


In the martial arts movie Crouching Tiger, Hidden Dragon, those 
who had received mystical enlightenment are able to violate the laws 
of physics. Some of the violations are obvious, such as their ability 
to fly, but others are a little more subtle. The rebellious young 
heroine/antiheroine Jen Yu gets into an argument while sitting at a 
table in a restaurant. A young tough, Iron Arm Lu, comes running 
toward her at full speed, and she puts up one arm and effortlessly 
makes him bounce back, without even getting out of her seat or 
bracing herself against anything. She does all this between bites. 


Although kinetic energy doesn’t depend on the direction of mo- 
tion, we’ve already seen on page 89 how conservation of energy com- 
bined with Galilean relativity allows us to make some predictions 
about the direction of motion. One of the examples was a demon- 
stration that it isn’t possible for a hockey puck to spontaneously re- 
verse its direction of motion. In the scene from the movie, however, 
the woman’s assailant isn’t just gliding through space. He’s inter- 
acting with her, so the previous argument doesn’t apply here, and 
we need to generalize it to more than one object. We consider the 
case of a physical system composed of pointlike material particles, 
in which every particle interacts with every other particle through 
an energy U(r) that depends only on the distance r between them. 
This still allows for a fairly general mechanical system, by which I 
mean roughly a system made of matter, not light. The characters 
in the movie are made of protons, neutrons, and electrons, so they 
would constitute such a system if the interactions among all these 
particles were of the form U(r).! We might even be able to get 
away with thinking of each person as one big particle, if it’s a good 
approximation to say that every part of each person’s whole body 
moves in the same direction at the same speed. 


The basic insight can be extracted from the special case where 
there are only two particles interacting, and they only move in one 
dimension, as in the example shown in figure b. Conservation of 
energy says 


Kyi t+ Ko + Ui = Kig + Kop + Us 


For simplicity, let’s assume that the interactions start after the time 
we’re calling initial, and end before the instant we choose as final. 
This is true in figure b, for example. Then U; = Uy, and we can 
subtract the interaction energies from both sides, giving. 


Kyi + Koi = Kip + Kop 


2,1 2 1 2 1 2 
ar = 322i = gery =F gatas 


‘Electrical and magnetic interactions don’t quite behave like this, which is a 
point we’ll take up later in the book. 
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As in the one-particle argument on page 89, the trick is to require 
conservation of energy not just in one particular frame of reference, 
but in every frame of reference. In a frame of reference moving at 
velocity u relative to the first one, the velocities all have u added 
onto them:? 


1 1 1 
57 (Ui by)? + 5ma(vai tu)” = aim (vif t+u)*+ a ia(voy-+u)” 
When we square a quantity like (v1; + u)°, we get the same vj, that 
occurred in the original frame of reference, plus two u-dependent 
terms, 2v;;u + u?. Subtracting the original conservation of energy 
equation from the version in the new frame of reference, we have 


MV + M2QVYyjU = M1V1 FU + MQU2FU . 
or, dividing by u, 


MVE + M2Qvai = M1 VF + M2QvIF 


This is a statement that when you add up mv for the whole system, 
that total remains constant over time. In other words, this is a 
conservation law. The quantity mv is called momentum, notated p 
for obscure historical reasons. Its units are kg - m/s. 


Unlike kinetic energy, momentum depends on the direction of 
motion, since the velocity is not squared. In one dimension, motion 
in the same direction as the positive x axis is represented with posi- 
tive values of v and p. Motion in the opposite direction has negative 
v and p. 


Jen Yu meets Iron Arm Lu example 1 
> Initially, Jen Yu is at rest, and lron Arm Lu is charging to the left, 
toward her, at 5 m/s. Jen Yu’s mass is 50 kg, and Lu’s is 100 kg. 
After the collision, the movie shows Jen Yu still at rest, and Lu 
rebounding at 5 m/s to the right. Is this consistent with the laws 
of physics, or would it be impossible in real life? 


> This is perfectly consistent with conservation of mass (50 kg+100 
kg=50 kg+100 kg), and also with conservation of energy, since 
neither person’s kinetic energy changes, and there is therefore 
no change in the total energy. (We don’t have to worry about 
interaction energies, because the two points in time we're con- 
sidering are ones at which the two people aren't interacting.) To 
analyze whether the scene violates conservation of momentum, 
we have to pick a coordinate system. Let’s define positive as 


?We can now see that the derivation would have been equally valid for U; 4 
Uy. The two observers agree on the distance between the particles, so they 
also agree on the interaction energies, even though they disagree on the kinetic 
energies. 
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b/A_ collision between’ two 
pool balls is seen in two different 
frames of reference. The solid 
ball catches up with the striped 
ball. Velocities are shown with 
arrows. The second observer is 
moving to the left at velocity u 
compared to the first observer, 
so all the velocities in the second 
frame have u added onto them. 
The two observers must agree on 
conservation of energy. 
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c/The ion drive engine of 
the NASA Deep Space 1 probe, 
shown under construction (top) 
and being tested in a vacuum 
chamber (bottom) prior to its 
October 1998 launch. Intended 
mainly as a test vehicle for 
new technologies, the craft 
nevertheless also carried out a 
scientific program that included 
a rendezvous with a comet in 
2004.(NASA) 


‘A cannon 


‘lon drive 


being to the right. The initial momentum is (50 kg)(O m/s)+(100 
kg)(—5 m/s)=—500 kg - m/s, and the final momentum is (50 kg)(0 
m/s)+(100 kg)(5 m/s)=500 kg - m/s. This is a change of 1000 
kg - m/s, which is impossible if the two people constitute a closed 
system. 


One could argue that they’re not a closed system, since Lu might 
be exchanging momentum with the floor, and Jen Yu might be 
exchanging momentum with the seat of her chair. This is a rea- 
sonable objection, but in the following section we'll see that there 
are physical reasons why, in this situation, the force of friction 
would be relatively weak, and would not be able to transfer that 
much momentum in a fraction of a second. 


This example points to an intuitive interpretation of conserva- 


tion of momentum, which is that interactions are always mutual. 
That is, Jen Yu can’t change Lu’s momentum without having her 
own momentum changed as well. 


example 2 
> A cannon of mass 1000 kg fires a 10-kg shell at a velocity of 
200 m/s. At what speed does the cannon recoil? 


> The law of conservation of momentum tells us that 


Pcannon,i + Pshell,i = Pcannon,f + Pshell,f 


Choosing a coordinate system in which the cannon points in the 
positive direction, the given information is 


Pcannon, i =0 
pshell,i =0 
Pshell,t = 2000 kg - m/s 


We must have Peannon,f = —2000 kg - m/s, so the recoil velocity of 
the cannon is 2 m/s. 


example 3 
> The experimental solar-powered ion drive of the Deep Space 

1 space probe expels its xenon gas exhaust at a speed of 30,000 
m/s, ten times faster than the exhaust velocity for a typical chemical- 
fuel rocket engine. Roughly how many times greater is the maxi- 
mum speed this spacecraft can reach, compared with a chemical- 
fueled probe with the same mass of fuel (“reaction mass”) avail- 
able for pushing out the back as exhaust? 


> Momentum equals mass multiplied by velocity. Both spacecraft 
are assumed to have the same amount of reaction mass, and the 
ion drive’s exhaust has a velocity ten times greater, so the mo- 
mentum of its exhaust is ten times greater. Before the engine 
starts firing, neither the probe nor the exhaust has any momen- 
tum, so the total momentum of the system is zero. By conserva- 
tion of momentum, the total momentum must also be zero after 
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all the exhaust has been expelled. If we define the positive di- 
rection as the direction the spacecraft is going, then the negative 
momentum of the exhaust is canceled by the positive momen- 
tum of the spacecraft. The ion drive allows a final speed that is 
ten times greater. (This simplified analysis ignores the fact that 
the reaction mass expelled later in the burn is not moving back- 
ward as fast, because of the forward speed of the already-moving 
spacecraft.) 


3.1.2 Nonmechanical momentum 


So far, it sounds as though conservation of momentum can be 
proved mathematically, unlike conservation of mass and energy, which 
are entirely based on observations. The proof, however, was only for 
a mechanical system, with interactions of the form U(r). Conser- 
vation of momentum can be extended to other systems as well, but 
this generalization is based on experiments, not mathematical proof. 
Light is the most important example of momentum that doesn’t 
equal mv — light doesn’t have mass at all, but it does have mo- 
mentum. For example, a flashlight left on for an hour would absorb 
about 10-° kg - m/s of momentum as it recoiled. 


‘Halley's comet example 4 
Momentum is not always equal to mv. Halley’s comet, shown in 
figure d, has a very elongated elliptical orbit, like those of many 
other comets. About once per century, its orbit brings it close 
to the sun. The comet’s head, or nucleus, is composed of dirty 
ice, so the energy deposited by the intense sunlight gradually re- 
moves ice from the surface and turns it into water vapor. The 
bottom photo shows a view of the water coming off of the nucleus 
from the European Giotto space probe, which passed within 596 
km of the comet’s head on March 13, 1986. 


The sunlight does not just carry energy, however. It also carries 
momentum. Once the steam comes off, the momentum of the 
sunlight impacting on it pushes it away from the sun, forming a 
tail as shown in in the top image. The tail always points away 
from the sun, so when the comet is receding from the sun, the tail 


is in front. By analogy with matter, for which momentum equals d/Halley’s comet. Top: A 
mv, you would expect that massless light would have zero mo- photograph made from earth. 
mentum, but the equation p = mv is not the correct one for light, Bottom: A view of the nucleus 
and light does have momentum. (Some comets also have a sec- from the Giotto space probe. (W. 


ond tail, which is propelled by electrical forces rather than by the Eile ang eurapean Spate AgENEY) 


momentum of sunlight.) 


The reason for bringing this up is not so that you can plug 
numbers into formulas in these exotic situations. The point is that 
the conservation laws have proven so sturdy exactly because they 
can easily be amended to fit new circumstances. The momentum of 
light will be a natural consequence of the discussion of the theory 
of relativity in chapter 7. 
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3.1.3 Momentum compared to kinetic energy 


Momentum and kinetic energy are both measures of the quantity 
of motion, and a sideshow in the Newton-Leibniz controversy over 
who invented calculus was an argument over whether mv (i.e., mo- 
mentum) or mv? (i.e., kinetic energy without the 1/2 in front) was 
the “true” measure of motion. The modern student can certainly 
be excused for wondering why we need both quantities, when their 
complementary nature was not evident to the greatest minds of the 
1700s. The following table highlights their differences. 


Kinetic energy... 


Momentum... 


doesn’t depend on direc- 
tion. 


depends on direction. 


is always positive, and can- 
not cancel out. 


cancels with momentum in 
the opposite direction. 


can be traded for forms of 
energy that do not involve 


is always conserved in a 
closed system. 


motion. Kinetic energy is 
not a conserved quantity 


by itself. 
is quadrupled if the veloc- | is doubled if the velocity is 
ity is doubled. doubled. 


Here are some examples that show the different behaviors of the 
two quantities. 


A spinning top example 5 
A spinning top has zero total momentum, because for every mov- 
ing point, there is another point on the opposite side that cancels 
its momentum. It does, however, have kinetic energy. 


Momentum and kinetic energy in firing a rifle example 6 
The rifle and bullet have zero momentum and zero kinetic energy 
to start with. When the trigger is pulled, the bullet gains some mo- 
mentum in the forward direction, but this is canceled by the rifle’s 
backward momentum, so the total momentum is still zero. The 
kinetic energies of the gun and bullet are both positive numbers, 
however, and do not cancel. The total kinetic energy is allowed to 
increase, because kinetic energy is being traded for other forms 
of energy. Initially there is chemical energy in the gunpowder. 
This chemical energy is converted into heat, sound, and kinetic 
energy. The gun’s “backward” kinetic energy does not refrigerate 
the shooter’s shoulder! 


The wobbly earth example 7 
As the moon completes half a circle around the earth, its motion 
reverses direction. This does not involve any change in kinetic 
energy. The reversed velocity does, however, imply a reversed 
momentum, so conservation of momentum in the closed earth- 
moon system tells us that the earth must also reverse its momen- 
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tum. In fact, the earth wobbles in a little “orbit” about a point below 
its surface on the line connecting it and the moon. The two bod- 
ies’ momenta always point in opposite directions and cancel each 
other out. 


The earth and moon get a divorce example 8 
Why can’t the moon suddenly decide to fly off one way and the 
earth the other way? It is not forbidden by conservation of mo- 
mentum, because the moon’s newly acquired momentum in one 
direction could be canceled out by the change in the momen- 
tum of the earth, supposing the earth headed off in the opposite 
direction at the appropriate, slower speed. The catastrophe is for- 
bidden by conservation of energy, because their energies would 
have to increase greatly. 


Momentum and kinetic energy of a glacier example 9 
A cubic-kilometer glacier would have a mass of about 10'* kg. If 
it moves at a speed of 10~° m/s, then its momentum is 107 kg - 
m/s. This is the kind of heroic-scale result we expect, perhaps 
the equivalent of the space shuttle taking off, or all the cars in LA 
driving in the same direction at freeway speed. Its kinetic energy, 
however, is only 50 J, the equivalent of the calories contained 
in a poppy seed or the energy in a drop of gasoline too small 
to be seen without a microscope. The surprisingly small kinetic 
energy is because kinetic energy is proportional to the square of 
the velocity, and the square of a small number is an even smaller 
number. 


Discussion Questions 


A lf all the air molecules in the room settled down in a thin film on the 
floor, would that violate conservation of momentum as well as conserva- 
tion of energy? 


B A refrigerator has coils in back that get hot, and heat is molecular 
motion. These moving molecules have both energy and momentum. Why 
doesn’t the refrigerator need to be tied to the wall to keep it from recoiling 
from the momentum it loses out the back? 


3.1.4 Collisions in one dimension 


Physicists employ the term “collision” in a broader sense than in 
ordinary usage, applying it to any situation where objects interact 
for a certain period of time. A bat hitting a baseball, a cosmic ray 
damaging DNA, and a gun and a bullet going their separate ways 
are all examples of collisions in this sense. Physical contact is not 
even required. A comet swinging past the sun on a hyperbolic orbit 
is considered to undergo a collision, even though it never touches 
the sun. All that matters is that the comet and the sun interacted 
gravitationally with each other. 


The reason for broadening the term “collision” in this way is 
that all of these situations can be attacked mathematically using 
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e / Example 8. 


f/ This Hubble Space  Tele- 
scope photo shows a_ small 
galaxy (yellow blob in the lower 
right) that has collided with a 
larger galaxy (spiral near the 
center), producing a wave of star 
formation (blue track) due to the 
shock waves passing through 
the galaxies’ clouds of gas. This 
is considered a collision in the 
physics sense, even though it is 
statistically certain that no star in 
either galaxy ever struck a star 
in the other — the stars are very 
small compared to the distances 
between them.(NASA) 
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the same conservation laws in similar ways. In our first example, 
conservation of momentum is all that is required. 


Getting rear-ended example 10 
> Ms. Chang is rear-ended at a stop light by Mr. Nelson, and sues 
to make him pay her medical bills. He testifies that he was only 
going 55 km per hour when he hit Ms. Chang. She thinks he was 
going much faster than that. The cars skidded together after the 
impact, and measurements of the length of the skid marks show 
that their joint velocity immediately after the impact was 30 km 
per hour. Mr. Nelson’s Nissan has a mass of 1400 kg, and Ms. 
Chang ’s Cadillac is 2400 kg. Is Mr. Nelson telling the truth? 


> Since the cars skidded together, we can write down the equation 
for conservation of momentum using only two velocities, v for Mr. 
Nelson’s velocity before the crash, and v’ for their joint velocity 
afterward: 

MyV = myv’ + mov! 


Solving for the unknown, v, we find 


= 80 km/hr 


He is lying. 


The above example was simple because both cars had the same 
velocity afterward. In many one-dimensional collisions, however, the 
two objects do not stick. If we wish to predict the result of such a 
collision, conservation of momentum does not suffice, because both 
velocities after the collision are unknown, so we have one equation 
in two unknowns. 


Conservation of energy can provide a second equation, but its 
application is not as straightforward, because kinetic energy is only 
the particular form of energy that has to do with motion. In many 
collisions, part of the kinetic energy that was present before the 
collision is used to create heat or sound, or to break the objects 
or permanently bend them. Cars, in fact, are carefully designed to 
crumple in a collision. Crumpling the car uses up energy, and that’s 
good because the goal is to get rid of all that kinetic energy in a 
relatively safe and controlled way. At the opposite extreme, a super- 
ball is “super” because it emerges from a collision with almost all its 
original kinetic energy, having only stored it briefly as interatomic 
electrical energy while it was being squashed by the impact. 


Collisions of the superball type, in which almost no kinetic en- 
ergy is converted to other forms of energy, can thus be analyzed 
more thoroughly, because they have Kr = K;, as opposed to the 
less useful inequality Ky < K; for a case like a tennis ball bouncing 
on grass. 
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Pool balls colliding head-on example 11 
> Two pool balls collide head-on, so that the collision is restricted 
to one dimension. Pool balls are constructed so as to lose as little 
kinetic energy as possible in a collision, so under the assumption 
that no kinetic energy is converted to any other form of energy, 
what can we predict about the results of such a collision? 


> Pool balls have identical masses, so we use the same symbol 
m for both. Conservation of energy and no loss of kinetic energy 
give us the two equations 


MV4j + IMVo; = MV4¢ + MVoF¢ 


1 1 
2 2 
=Mv;; + =Mv5 


US eee 


c 2 2 

The masses and the factors of 1/2 can be divided out, and we 
eliminate the cumbersome subscripts by replacing the symbols 
V4;,--. with the symbols A, B, C, and D: 


A+B=C+D 
AaB aCe 


A little experimentation with numbers shows that given values of A 
and B, it is impossible to find C and D that satisfy these equations 
unless C and D equal A and B, or C and D are the same as A 
and B but swapped around. A formal proof of this fact is given 
in the sidebar. In the special case where ball 2 is initially at rest, 
this tells us that ball 1 is stopped dead by the collision, and ball 
2 heads off at the velocity originally possessed by ball 1. This 
behavior will be familiar to players of pool. 


Often, as in example 11, the details of the algebra are the least 
interesting part of the problem, and considerable physical insight 
can be gained simply by counting the number of unknowns and 
comparing to the number of equations. Suppose a beginner at pool 
notices a case where her cue ball hits an initially stationary ball 
and stops dead. “Wow, what a good trick,” she thinks. “I bet I 
could never do that again in a million years.” But she tries again, 
and finds that she can’t help doing it even if she doesn’t want to. 
Luckily she has just learned about collisions in her physics course. 
Once she has written down the equations for conservation of energy 
and no loss of kinetic energy, she really doesn’t have to complete the 
algebra. She knows that she has two equations in two unknowns, 
so there must be a well-defined solution. Once she has seen the 
result of one such collision, she knows that the same thing must 
happen every time. The same thing would happen with colliding 
marbles or croquet balls. It doesn’t matter if the masses or velocities 
are different, because that just multiplies both equations by some 
constant factor. 
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Gory Details of the Proof in 
Example 11 


The equation A+B = C+D says 
that the change in one ball’s ve- 
locity is equal and opposite to 
the change in the other’s. We 
invent a symbol x = C — A for 
the change in ball 1’s velocity. 
The second equation can then 
be rewritten as A? + B? = (A+ 
x)? + (B— x)*. Squaring out the 
quantities in parentheses and then 
simplifying, we get 0 = Ax—Bx+ 
x. The equation has the trivial 
solution x = 0, i.e., neither ball’s 
velocity is changed, but this is 
physically impossible because the 
balls can’t travel through each 
other like ghosts. Assuming x # 
0, we can divide by x and solve 
for x = B—A. This means that 
ball 1 has gained an amount of 
velocity exactly sufficient to mat- 
ch ball 2’s initial velocity, and vice- 
versa. The balls must have swap- 
ped velocities. 
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The discovery of the neutron 


This was the type of reasoning employed by James Chadwick in 
his 1932 discovery of the neutron. At the time, the atom was imag- 
ined to be made out of two types of fundamental particles, protons 
and electrons. The protons were far more massive, and clustered 
together in the atom’s core, or nucleus. Electrical attraction caused 
the electrons to orbit the nucleus in circles, in much the same way 
that gravity kept the planets from cruising out of the solar system. 
Experiments showed, for example, that twice as much energy was 
required to strip the last electron off of a helium atom as was needed 
to remove the single electron from a hydrogen atom, and this was 
explained by saying that helium had two protons to hydrogen’s one. 
The trouble was that according to this model, helium would have 
two electrons and two protons, giving it precisely twice the mass of 
a hydrogen atom with one of each. In fact, helium has about four 
times the mass of hydrogen. 


Chadwick suspected that the helium nucleus possessed two addi- 
tional particles of a new type, which did not participate in electrical 
interactions at all, ie., were electrically neutral. If these particles 
had very nearly the same mass as protons, then the four-to-one mass 
ratio of helium and hydrogen could be explained. In 1930, a new 
type of radiation was discovered that seemed to fit this description. 
It was electrically neutral, and seemed to be coming from the nuclei 
of light elements that had been exposed to other types of radia- 
tion. At this time, however, reports of new types of particles were 
a dime a dozen, and most of them turned out to be either clusters 
made of previously known particles or else previously known parti- 
cles with higher energies. Many physicists believed that the “new” 
particle that had attracted Chadwick’s interest was really a pre- 
viously known particle called a gamma ray, which was electrically 
neutral. Since gamma rays have no mass, Chadwick decided to try 
to determine the new particle’s mass and see if it was nonzero and 
approximately equal to the mass of a proton. 


Unfortunately a subatomic particle is not something you can 
just put on a scale and weigh. Chadwick came up with an ingenious 
solution. The masses of the nuclei of the various chemical elements 
were already known, and techniques had already been developed for 
measuring the speed of a rapidly moving nucleus. He therefore set 
out to bombard samples of selected elements with the mysterious 
new particles. When a direct, head-on collision occurred between 
a mystery particle and the nucleus of one of the target atoms, the 
nucleus would be knocked out of the atom, and he would measure 
its velocity. 


Suppose, for instance, that we bombard a sample of hydrogen 
atoms with the mystery particles. Since the participants in the 
collision are fundamental particles, there is no way for kinetic energy 
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to vacuum 
pump 


= 


polonium — beryllium 


g / Chadwick’s subatomic pool table. A disk of the naturally occur- 
ring metal polonium provides a source of radiation capable of kicking 
neutrons out of the beryllium nuclei. The type of radiation emitted by 
the polonium is easily absorbed by a few mm of air, so the air has to be 
pumped out of the left-hand chamber. The neutrons, Chadwick’s mystery 
particles, penetrate matter far more readily, and fly out through the wall 
and into the chamber on the right, which is filled with nitrogen or hydrogen 
gas. When a neutron collides with a nitrogen or hydrogen nucleus, it 
kicks it out of its atom at high speed, and this recoiling nucleus then rips 
apart thousands of other atoms of the gas. The result is an electrical 
pulse that can be detected in the wire on the right. Physicists had already 
calibrated this type of apparatus so that they could translate the strength 
of the electrical pulse into the velocity of the recoiling nucleus. The 
whole apparatus shown in the figure would fit in the palm of your hand, in 
dramatic contrast to today’s giant particle accelerators. 


to be converted into heat or any other form of energy, and Chadwick 
thus had two equations in three unknowns: 


equation #1: conservation of momentum 

equation #2: no loss of kinetic energy 

unknown #1: mass of the mystery particle 
unknown #2: initial velocity of the mystery particle 
unknown #3: final velocity of the mystery particle 


The number of unknowns is greater than the number of equa- 
tions, so there is no unique solution. But by creating collisions with 
nuclei of another element, nitrogen, he gained two more equations 
at the expense of only one more unknown: 


equation #3: conservation of momentum in the new collision 
equation #4: no loss of kinetic energy in the new collision 


unknown #4: final velocity of the mystery particle in the new 
collision 


He was thus able to solve for all the unknowns, including the 
mass of the mystery particle, which was indeed within 1% of the 
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mass of a proton. He named the new particle the neutron, since it 
is electrically neutral. 


Discussion Questions 


A Good pool players learn to make the cue ball spin, which can cause 
it not to stop dead in a head-on collision with a stationary ball. If this does 
not violate the laws of physics, what hidden assumption was there in the 
example in the text where it was proved that the cue ball must stop? 


3.1.5 The center of mass 


Figures h and j show two examples where a motion that appears 
complicated actually has a very simple feature. In both cases, there 
is a particular point, called the center of mass, whose motion is 
surprisingly simple. The highjumper flexes his body as he passes 
over the bar, so his motion is intrinsically very complicated, and 
yet his center of mass’s motion is a simple parabola, just like the 
parabolic arc of a pointlike particle. The wrench’s center of mass 
travels in a straight line as seen from above, which is what we’d 
expect for a pointlike particle flying through the air. 


h/ The highjumper’s body 
passes over the bar, but his 
center of mass passes under 
it.(Dunia Young) 


The highjumper and the wrench are both complicated systems, 


j/In this multiple-flash photograph, we see the wrench from above 
as it flies through the air, rotating as it goes. Its center of mass, marked 
with the black cross, travels along a straight line, unlike the other points 
on the wrench, which execute loops. (PSSC Physics) 


each consisting of zillions of subatomic particles. To understand 
what’s going on, let’s instead look at a nice simple system, two pool 
balls colliding. We assume the balls are a closed system (i.e., their 
interaction with the felt surface is not important) and that their 
rotation is unimportant, so that we’ll be able to treat each one as a 
single particle. By symmetry, the only place their center of mass can 
be is half-way in between, at an x coordinate equal to the average 
of the two balls’ positions, Lem = (#1 + £2) /2. 


Figure i makes it appear that the center of mass, marked with 
i / Two pool balls collide. an xX, moves with constant velocity to the right, regardless of the 
collision, and we can easily prove this using conservation of momen- 
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tum: 


Vem = Gem (at 
1 
- rica + v2) 


1 
= am + mv2) 


Ptotal 
™Meetotal 


Since momentum is conserved, the last expression is constant, which 
proves that ven, is constant. 


Rearranging this a little, we have piotai = MiotalUem- In other 
words, the total momentum of the system is the same as if all its 
mass was concentrated at the center of mass point. 


Sigma notation 


When there is a large, potentially unknown number of particles, 
we can write sums like the ones occurring above using symbols like 
“4...” but that gets awkward. It’s more convenient to use the 
Greek uppercase sigma, ©, to indicate addition. For example, the 
sum 1? + 2? + 3? + 4? = 30 could be written as 


nm 
~—/=30 
j=l 


read “the sum from j = 1 to n of j?.” The variable 7 is a dummy 


variable, just like the dz in an integral that tells you you’re integrat- 
ing with respect to x, but has no significance outside the integral. 
The j below the sigma tells you what variable is changing from one 
term of the sum to the next, but 7 has no significance outside the 
sum. 


As an example, let’s generalize the proof of piotat = MtotalVem tO 
the case of an arbitrary number n of identical particles moving in 
one dimension, rather than just two particles. The center of mass 
is at 


n 
1 
fem =—> vj ) 
n 4 
j=l 


where 2x1 is the mass of the first particle, and so on. The velocity of 
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k/No matter what point you 
hang the pear from, the string 
lines up with the pear’s center 
of mass. The center of mass 
can therefore be defined as the 
intersection of all the lines made 
by hanging the pear in this way. 
Note that the X in the figure 
should not be interpreted as 
implying that the center of mass 
is on the surface — it is actually 
inside the pear. 


1/The circus performers hang 
with the ropes passing through 
their centers of mass. 
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center of mass Bg. 
~ , 


y, 
4 
ZO 


m/An _ improperly balanced 
wheel has a center of mass that 
is not at its geometric center. 
When you get a new tire, the 
mechanic clamps little weights to 
the rim to balance the wheel. 


n/This toy was_ intentionally 
designed so that the mushroom- 
shaped piece of metal on top 
would throw off the center of 
mass. When you wind it up, the 
mushroom spins, but the center 
of mass doesn’t want to move, 
so the rest of the toy tends to 
counter the mushroom’s motion, 
causing the whole thing to jump 
around. 


the center of mass is 


Ucm = ean, / At 


n 


™Meetotal 


What about a system containing objects with unequal masses, 
or containing more than two objects? The reasoning above can be 
generalized to a weighted average: 


a1 MX; 


9 i = 
= pa Be) 


‘The solar system's center of mass example 12 
In the discussion of the sun’s gravitational field on page 99, | 
mentioned in a footnote that the sun doesn’t really stay in one 
place while the planets orbit around it. Actually, motion is relative, 
so it’s meaningless to ask whether the sun is absolutely at rest, 
but it is meaningful to ask whether it moves in a straight line at 
constant velocity. We can now see that since the solar system 
is a closed system, its total momentum must be constant, and 
Ptotal = total Vem then tells us that it’s the solar system’s center 
of mass that has constant velocity, not the sun. The sun wob- 
bles around this point irregularly due to its interactions with the 
planets, Jupiter in particular. 


‘The earth-moon system example 13 
The earth-moon system is much simpler than the solar system 
because it contains only two objects. Where is the center of mass 
of this system? Let x=0 be the earth’s center, so that the moon 
lies at x = 3.8 x 10° km. Then 


2 
jet NX 


2 
Deja MN 
M4 X1 + MeXo 
~ m + Mo 


Xem = 


v] 


and letting 1 be the earth and 2 the moon, we have 


Mearth X 0 + MmoonXmoon 


Xem = 
Mearth + Mmoon 


= 4600 km , 


or about three quarters of the way from the earth’s center to its 
surface. 
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The center of mass as an average example 14 
> Explain how we know that the center of mass of each object is 
at the location shown in figure o. 


POIeC 


> The center of mass is a sort of average, so the height of the 
centers of mass in 1 and 2 has to be midway between the two 
squares, because that height is the average of the heights of the 
two squares. Example 3 is a combination of examples 1 and 
2, SO we can find its center of mass by averaging the horizontal 
positions of their centers of mass. In example 4, each square 
has been skewed a little, but just as much mass has been moved 
up as down, so the average vertical position of the mass hasn’t 
changed. Example 5 is clearly not all that different from example 
4, the main difference being a slight clockwise rotation, so just as 
in example 4, the center of mass must be hanging in empty space, 
where there isn’t actually any mass. Horizontally, the center of 
mass must be between the heels and toes, or else it wouldn't be 
possible to stand without tipping over. 


Momentum and Galilean relativity example 15 
The principle of Galilean relativity states that the laws of physics 
are supposed to be equally valid in all inertial frames of refer- 
ence. If we first calculate some momenta in one frame of refer- 
ence and find that momentum is conserved, and then rework the 
whole problem in some other frame of reference that is moving 
with respect to the first, the numerical values of the momenta will 
all be different. Even so, momentum will still be conserved. All 
that matters is that we work a single problem in one consistent 
frame of reference. 


One way of proving this is to apply the equation Pigtail = Motal Vem: 
If the velocity of one frame relative to the other is u, then the only 
effect of changing frames of reference is to change Vem from its 
original value to Vem + u. This adds a constant onto the momen- 
tum, which has no effect on conservation of momentum. 


self-check A 

The figure shows a gymnast holding onto the inside of a big wheel. 
From inside the wheel, how could he make it roll one way or the other? 
> Answer, p. 867 


o/ Example 14. 


p / Self-check A. 
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q/The same collision of two 
pools balls, but now seen in 
the center of mass frame of 
reference. 


es Vat 


r/The sun’s’ frame of refer- 
ence. 


__ 
Vit 


s/Thec.m. frame. 


3.1.6 The center of mass frame of reference 


A particularly useful frame of reference in many cases is the 
frame that moves along with the center of mass, called the center 
of mass (c.m.) frame. In this frame, the total momentum is zero. 
The following examples show how the center of mass frame can be 
a powerful tool for simplifying our understanding of collisions. 


A collision of pool balls viewed in the c.m. frame example 16 
If you move your head so that your eye is always above the point 
halfway in between the two pool balls, as in figure q, you are 
viewing things in the center of mass frame. In this frame, the 
balls come toward the center of mass at equal speeds. By sym- 
metry, they must therefore recoil at equal speeds along the lines 
on which they entered. Since the balls have essentially swapped 
paths in the center of mass frame, the same must also be true in 
any other frame. This is the same result that required laborious 
algebra to prove previously without the concept of the center of 
mass frame. 


The slingshot effect example 17 
It is a counterintuitive fact that a spacecraft can pick up speed 
by swinging around a planet, if it arrives in the opposite direction 
compared to the planet’s motion. Although there is no physical 
contact, we treat the encounter as a one-dimensional collision, 
and analyze it in the center of mass frame. Since Jupiter is so 
much more massive than the spacecraft, the center of mass is 
essentially fixed at Jupiter's center, and Jupiter has zero velocity 
in the center of mass frame, as shown in figure 3.1.6. The c.m. 
frame is moving to the left compared to the sun-fixed frame used 
in figure 3.1.6, so the spacecraft’s initial velocity is greater in this 
frame than in the sun’s frame. 


Things are simpler in the center of mass frame, because it is more 
symmetric. In the sun-fixed frame, the incoming leg of the en- 
counter is rapid, because the two bodies are rushing toward each 
other, while their separation on the outbound leg is more grad- 
ual, because Jupiter is trying to catch up. In the c.m. frame, 
Jupiter is sitting still, and there is perfect symmetry between the 
incoming and outgoing legs, so by symmetry we have v4; = —V4;. 
Going back to the sun-fixed frame, the spacecraft’s final velocity 
is increased by the frames’ motion relative to each other. In the 
sun-fixed frame, the spacecraft’s velocity has increased greatly. 


Einstein's motorcycle example 18 
We've assumed we were dealing with a system of material ob- 
jects, for which the equation p = mv was true. What if our system 
contains only light rays, or a mixture of light and matter? As a col- 
lege student, Einstein kept worrying about was what a beam of 
light would look like if you could ride alongside it on a motorcycle. 
In other words, he imagined putting himself in the light beam’s 
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center of mass frame. Chapter 7 discusses Einstein’s resolution 
of this problem, but the basic point is that you can’t ride the mo- 
torcycle alongside the light beam, because material objects can’t 
go as fast as the speed of light. A beam of light has no center of 
mass frame of reference. 


Discussion Questions 


A Make up a numerical example of two unequal masses moving in one 
dimension at constant velocity, and verify the equation Pyorai = Mtotal Vem 
over a time interval of one second. 


B A more massive tennis racquet or baseball bat makes the ball fly 
off faster. Explain why this is true, using the center of mass frame. For 
simplicity, assume that the racquet or bat is simply sitting still before the 
collision, and that the hitter’s hands do not make any force large enough 
to have a significant effect over the short duration of the impact. 


3.2 Force In One Dimension 


3.2.1 Momentum transfer 


For every conserved quantity, we can define an associated rate 
of flow. An open system can have mass transferred in or out of it, 
and we can measure the rate of mass flow, dm/dt in units of kg/s. 
Energy can flow in or out, and the rate of energy transfer is the 
power, P = dE/dt, measured in watts.2 The rate of momentum 
transfer is called force, 


_ dp 


i= 


[definition of force] 

The units of force are kgm/s*, which can be abbreviated as newtons, 
1 N=kg-m/s?. Newtons are unfortunately not as familiar as watts. 
A newton is about how much force you’d use to pet a dog. The 
most powerful rocket engine ever built, the first stage of the Saturn 
V that sent astronauts to the moon, had a thrust of about 30 million 
newtons. In one dimension, positive and negative signs indicate the 
direction of the force — a positive force is one that pushes or pulls 
in the direction of the positive x axis. 


‘Walking into a lamppost example 19 
> Starting from rest, you begin walking, bringing your momentum 
up to 100 kg-m/s. You walk straight into a lamppost. Why is the 
momentum change of —100 kg-m/s so much more painful than 
the change of +100 kg-m/s when you started walking? 


> The forces are not really constant, but for this type of qualitative 
discussion we can pretend they are, and approximate dp/dt as 
Ap/At. It probably takes you about 1 s to speed up initially, so the 
ground’s force on you is F = Ap/At = 100 N. Your impact with 
the lamppost, however, is over in the blink of an eye, say 1/10 s or 


3Recall that uppercase P is power, while lowercase p is momentum. 


power = rate of 
transferring energy 


force = rate of 


a/Power and force are 
rates at which energy 
momentum are transferred. 
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transferring momentum 


the 
and 
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b/ The 
so as to reduce F = Ap/At. 


airbag increases At 


less. Dividing by this much smaller At gives a much larger force, 
perhaps thousands of newtons (with a negative sign). 


This is also the principle of airbags in cars. The time required 
for the airbag to decelerate your head is fairly long: the time it 
takes your face to travel 20 or 30 cm. Without an airbag, your face 
would have hit the dashboard, and the time interval would have 
been the much shorter time taken by your skull to move a couple of 
centimeters while your face compressed. Note that either way, the 
same amount of momentum is transferred: the entire momentum of 
your head. 


Force is defined as a derivative, and the derivative of a sum is 
the sum of the derivatives. Therefore force is additive: when more 
than one force acts on an object, you add the forces to find out 
what happens. An important special case is that forces can cancel. 
Consider your body sitting in a chair as you read this book. Let 
the positive x axis be upward. The chair’s upward force on you 
is represented with a positive number, which cancels out with the 
earth’s downward gravitational force, which is negative. The total 
rate of momentum transfer into your body is zero, and your body 
doesn’t change its momentum. 


‘Finding momentum from force example 20 
> An object of mass ™ starts at rest at t = f. A force varying as 
F = bt~?, where b is a constant, begins acting on it. Find the 
greatest speed it will ever have. 


> 
dp 
Pe at 
dp = Fdt 
p= | Fat+p. 
ae Po ) 


where Pp is a constant of integration. The given initial condition 
is that p = 0 at t = 6, so we find that pp = b/t. The negative 
term gets closer to zero with increasing time, so the maximum 
momentum is achieved by letting t approach infinity. That is, the 
object will never stop speeding up, but it will also never surpass a 
certain speed. In the limit t + oo, we identify p) as the momen- 
tum that the object will approach asymptotically. The maximum 
velocity is V = Po/M= b/mMto. 


Discussion Question 


A Many collisions, like the collision of a bat with a baseball, appear to 
be instantaneous. Most people also would not imagine the bat and ball as 
bending or being compressed during the collision. Consider the following 
possibilities: 
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(1) The collision is instantaneous. 

(2) The collision takes a finite amount of time, during which the ball and 
bat retain their shapes and remain in contact. 

(3) The collision takes a finite amount of time, during which the ball and 
bat are bending or being compressed. 


How can two of these be ruled out based on energy or momentum con- 
siderations? 


3.2.2 Newton’s laws 


Although momentum is the third conserved quantity we’ve en- 
countered, historically it was the first to be discovered. Isaac New- 
ton formulated a complete treatment of mechanical systems in terms 
of force and momentum. Newton’s theory was based on three laws 
of motion, which we now think of as consequences of conservation 
of mass, energy, and momentum. 


Newton’s laws in one dimension: 


Newton’s first law: If there is no force acting on an 
object, it stays in the same state of motion. 


Newton’s second law: The sum of all the forces acting 
on an object determines the rate at which its momen- 
tum changes, Fioray = dp/dt. 

Newton’s third law: Forces occur in opposite pairs. If 
object A interacts with object B, then A’s force on B 
and B’s force on A are related by F4p = —F pa. 


The second law is the definition of force, which we’ve already en- 
countered.4 The first law is a special case of the second law — 
if dp/dt is zero, then p = mv is a constant, and since mass is con- 
served, constant p implies constant v. The third law is a restatement 
of conservation of momentum: for two objects interacting, we have 
constant total momentum, so 0 = (pA +pp) = Feat Fas. 


‘a=F/m example 21 
Many modern textbooks restate Newton’s second law as a = 
F/m, i.€., aS an equation that predicts an object’s acceleration 
based on the force exerted on it. This is easily derived from New- 
ton’s original form as follows: a = dv/dt = (dp/dt)/m= F/m. 


‘Gravitational force related to g example 22 
As a special case of the previous example, consider an object 
in free fall, and let the x axis point down. Then a = +g, and 
F = ma = mg. For example, the gravitational force on a 1 kg 
mass at the earth’s surface is about 9.8 N. Even if other forces 
act on the object, and it isn’t in free fall, the gravitational force on 
it is still the same, and can still be calculated as mg. 


“This is with the benefit of hindsight. At the time, the word “force” already 
had certain connotations, and people thought they understood what it meant 
and how to measure it, e.g., by using a spring scale. From their point of view, 
F = dp/dt was not a definition but a testable — and controversial! — statement. 


c/lsaac Newton (1643-1727). 
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scale 


magnet 


magnet 


scale 


d/Two magnets exert forces 
on each other. 


Excuse me, ma'am, but it 
appears that the money in your 
purse would exactly cancel 
out my bar tab. 


al i 
Y eg 


e/l|t doesn’t make sense _ for 
the man to talk about the 
woman’s money canceling out his 
bar tab, because there is no good 
reason to combine his debts and 
her assets. 


oad 


f/Newton’s” third law does 
not mean that forces always can- 
cel out so that nothing can ever 
move. If these two ice skaters, 
initially at rest, push against each 
other, they will both move. 


Changing frames of reference example 23 
Suppose we change from one frame reference into another, which 
is moving relative to the first one at a constant velocity u. If an 
object of mass mis moving at velocity v (which need not be con- 
stant), then the effect is to change its momentum from mv in one 
frame to mv+mu in the other. Force is defined as the derivative of 
momentum with respect to time, and the derivative of a constant 
is zero, so adding the constant mu has no effect on the result. 
We therefore conclude that observers in different inertial frames 
of reference agree on forces. 


Using the third law correctly 


If you’ve already accepted Galilean relativity in your heart, then 
there is nothing really difficult about the first and second laws. The 
third law, however, is more of a conceptual challenge. The first 
hurdle is that it is counterintuitive. Is it really true that if a fighter 
jet collides with a mosquito, the mosquito’s force on the jet is just 
as strong as the jet’s force on the mosquito? Yes, it is true, but it 
is hard to believe at first. That amount of force simply has more of 
an effect on the mosquito, because it has less mass. 


A more humane and practical experiment is shown in figure d. A 
large magnet and a small magnet are weighed separately, and then 
one magnet is hung from the pan of the top balance so that it is 
directly above the other magnet. There is an attraction between the 
two magnets, causing the reading on the top scale to increase and the 
reading on the bottom scale to decrease. The large magnet is more 
“powerful” in the sense that it can pick up a heavier paperclip from 
the same distance, so many people have a strong expectation that 
one scale’s reading will change by a far different amount than the 
other. Instead, we find that the two changes are equal in magnitude 
but opposite in direction, so the upward force of the top magnet on 
the bottom magnet is of the same magnitude as the downward force 
of the bottom magnet on the top magnet. 


To students, it often sounds as though Newton’s third law im- 
plies nothing could ever change its motion, since the two equal and 
opposite forces would always cancel. As illustrated in figure e, the 
fallacy arises from assuming that we can add things that it doesn’t 
make sense to add. It only makes sense to add up forces that are 
acting on the same object, whereas two forces related to each other 
by Newton’s third law are always acting on two different objects. 
If two objects are interacting via a force and no other forces are 
involved, then both objects will accelerate — in opposite directions, 
as shown in figure f! 


Here are some suggestions for avoiding misapplication of New- 
ton’s third law: 
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1. It always relates exactly two forces, not more. 


2. The two forces involve exactly two objects, in the pattern A 
on B, B on A. 


3. The two forces are always of the same type, e.g., friction and 
friction, or gravity and gravity. 


Directions of forces 


We’ve already seen that momentum, unlike energy, has a direc- 
tion in space. Since force is defined in terms of momentum, force 
also has a direction in space. For motion in one dimension, we 
have to pick a coordinate system, and given that choice, forces and 
momenta, will be positive or negative. We’ve already used signs to 
represent directions of forces in Newton’s third law, F4p = —Fpa. 


There is, however, a complication with force that we were able 
to avoid with momentum. If an object is moving on a line, we’re 
guaranteed that its momentum is in one of two directions: the two 
directions along the line. But even an object that stays on a line 
may still be subject to forces that act perpendicularly to the line. 
For example, suppose a coin is sliding to the right across a table, h, 
and let’s choose a positive x axis that points to the right. The coin’s 
motion is along a horizontal line, and its momentum is positive and 
decreasing. Because the momentum is decreasing, its time derivative 
dp/dt is negative. This derivative equals the horizontal force of 
friction F, and its negative sign tells us that this force on the coin 
is to the left. 


But there are also vertical forces on the coin. The Earth exerts 
a downward gravitational force Fh on it, and the table makes an 
upward force F3 that prevents the coin from sinking into the wood. 
In fact, without these vertical forces the horizontal frictional force 
wouldn’t exist: surfaces don’t exert friction against one another 
unless they are being pressed together. 


To avoid mathematical complication, we want to postpone the 
full three-dimensional treatment of force and momentum until sec- 
tion 3.4. For now, we'll limit ourselves to examples like the coin, in 
which the motion is confined to a line, and any forces perpendicular 
to the line cancel each other out. 


Discussion Questions 


A Criticize the following incorrect statement: 

“If an object is at rest and the total force on it is zero, it stays at rest. 
There can also be cases where an object is moving and keeps on moving 
without having any total force on it, but that can only happen when there’s 
no friction, like in outer space.” 


g/A swimmer doing the breast 
stroke pushes backward against 
the water. By Newton’s third law, 
the water pushes forward on him. 


h/A coin slides across a ta- 
ble. Even for motion in one 
dimension, some of the forces 
may not lie along the line of the 
motion. 
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100 


t (s) 
1.84 
2.86 
3.80 
4.67 
5.53 
6.38 
7.23 
8.10 
8.96 
9.83 


B_ The table gives laser timing data for Ben Johnson’s 100 m dash at the 
1987 World Championship in Rome. (His world record was later revoked 
because he tested positive for steroids.) How does the total force on him 
change over the duration of the race? 


Cc You hit a tennis ball against a wall. Explain any and all incorrect 
ideas in the following description of the physics involved: “According to 
Newton’s third law, there has to be a force opposite to your force on the 
ball. The opposite force is the ball’s mass, which resists acceleration, and 
also air resistance.” 


D Tam Anh grabs Sarah by the hand and tries to pull her. She tries 
to remain standing without moving. A student analyzes the situation as 
follows. “If Tam Anh’s force on Sarah is greater than her force on him, 
he can get her to move. Otherwise, she'll be able to stay where she is.” 
What's wrong with this analysis? 


Q 


3.2.3. What force is not 

Violin teachers have to endure their beginning students’ screech- 
ing. A frown appears on the woodwind teacher’s face as she watches 
her student take a breath with an expansion of his ribcage but none 
in his belly. What makes physics teachers cringe is their students’ 
verbal statements about forces. Below I have listed several dicta 
about what force is not. 


Force is not a property of one object. 


A great many of students’ incorrect descriptions of forces could 
be cured by keeping in mind that a force is an interaction of two 
objects, not a property of one object. 


Incorrect statement: “That magnet has a lot of force.” 


X If the magnet is one millimeter away from a steel ball bearing, they 
may exert a very strong attraction on each other, but if they were a 
meter apart, the force would be virtually undetectable. The magnet’s 
strength can be rated using certain electrical units (ampere — meters’), 
but not in units of force. 


Force is not a measure of an object’s motion. 
If force is not a property of a single object, then it cannot be 
used as a measure of the object’s motion. 


Incorrect statement: “The freight train rumbled down the tracks with 
awesome force.” 


X Force is not a measure of motion. If the freight train collides with a 
stalled cement truck, then some awesome forces will occur, but if it hits 
a fly the force will be small. 


Force is not energy. 


Incorrect statement: “How can my chair be making an upward force on 
my rear end? It has no power!” 


X Power is a concept related to energy, e.g., a 100-waitt lightbulb uses 
up 100 joules per second of energy. When you sit in a chair, no energy 
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is used up, so forces can exist between you and the chair without any 
need for a source of power. 


Force is not stored or used up. 


Because energy can be stored and used up, people think force 
also can be stored or used up. 


Incorrect statement: “If you don't fill up your tank with gas, you'll run 
out of force.” 


X Energy is what you'll run out of, not force. 


Forces need not be exerted by living things or machines. 


Transforming energy from one form into another usually requires 
some kind of living or mechanical mechanism. The concept is not 
applicable to forces, which are an interaction between objects, not 
a thing to be transferred or transformed. 


Incorrect statement: “How can a wooden bench be making an upward 
force on my rear end? It doesn’t have any springs or anything inside it.” 


X No springs or other internal mechanisms are required. If the bench 
didn’t make any force on you, you would obey Newton’s second law and 
fall through it. Evidently it does make a force on you! 


A force is the direct cause of a change in motion. 


I can click a remote control to make my garage door change from 
being at rest to being in motion. My finger’s force on the button, 
however, was not the force that acted on the door. When we speak 
of a force on an object in physics, we are talking about a force that 
acts directly. Similarly, when you pull a reluctant dog along by its 
leash, the leash and the dog are making forces on each other, not 
your hand and the dog. The dog is not even touching your hand. 


self-check B 
Which of the following things can be correctly described in terms of 
force? 


(1) A nuclear submarine is charging ahead at full steam. 
(2) A nuclear submarine’s propellers spin in the water. 


(3) A nuclear submarine needs to refuel its reactor periodically. > 
Answer, p. 867 


Discussion Questions 


A Criticize the following incorrect statement: “If you shove a book 
across a table, friction takes away more and more of its force, until finally 
it stops.” 


B You hit a tennis ball against a wall. Explain any and all incorrect 
ideas in the following description of the physics involved: “The ball gets 
some force from you when you hit it, and when it hits the wall, it loses part 
of that force, so it doesn’t bounce back as fast. The muscles in your arm 
are the only things that a force can come from.” 


Section 3.2 Force In One Dimension 


151 


3.2.4 Forces between solids 


Conservation laws are more fundamental than Newton’s laws, 
and they apply where Newton’s laws don’t, e.g., to light and to the 
internal structure of atoms. However, there are certain problems 
that are much easier to solve using Newton’s laws. As a trivial 
example, if you drop a rock, it could conserve momentum and energy 
by levitating, or by falling in the usual manner.° With Newton’s 
laws, however, we can reason that a = F'/m, so the rock must 
respond to the gravitational force by accelerating. 


Less trivially, suppose a person is hanging onto a rope, and we 
want to know if she will slip. Unlike the case of the levitating rock, 
here the no-motion solution could be perfectly reasonable if her grip 
is strong enough. We know that her hand’s interaction with the rope 
is fundamentally an electrical interaction between the atoms in the 
surface of her palm and the nearby atoms in the surface of the rope. 
For practical problem-solving, however, this is a case where we’re 
better off forgetting the fundamental classification of interactions 
at the atomic level and working with a more practical, everyday 
classification of forces. In this practical scheme, we have three types 
of forces that can occur between solid objects in contact: 


A normal force, F,, is perpendicular to the surface of con- 
tact, and prevents objects from pass- 
ing through each other by becoming as 
strong as necessary (up to the point 
where the objects break). “Normal” 
means perpendicular. 


Static friction, Fs, is parallel to the surface of contact, and 
prevents the surfaces from starting to 
slip by becoming as strong as neces- 
sary, up to a maximum value of Fs maz: 
“Static” means not moving, i.e., not 
slipping. 

Kinetic friction, F;,, is parallel to the surface of contact, and 
tends to slow down any slippage once 
it starts. “Kinetic” means moving, i.e., 


slipping. 
self-check C 
Can a frictionless surface exert a normal force? Can a frictional force 
exist without a normal force? > Answer, p. 867 


If you put a coin on this page, which is horizontal, gravity pulls 
down on the coin, but the atoms in the paper and the coin repel each 
other electrically, and the atoms are compressed until the repulsion 
becomes strong enough to stop the downward motion of the coin. 
We describe this complicated and invisible atomic process by saying 


>This pathological solution was first noted on page 83, and discussed in more 
detail on page 855. 
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that the paper makes an upward normal force on the coin, and the 
coin makes a downward normal force on the paper. (The two normal 
forces are related by Newton’s third law. In fact, Newton’s third law 
only relates forces that are of the same type.) 


If you now tilt the book a little, static friction keeps the coin 
from slipping. The picture at the microscopic level is even more 
complicated than the previous description of the normal force. One 
model is to think of the tiny bumps and depressions in the coin 
as settling into the similar irregularities in the paper. This model 
predicts that rougher surfaces should have more friction, which is 
sometimes true but not always. Two very smooth, clean glass sur- 
faces or very well finished machined metal surfaces may actually 
stick better than rougher surfaces would, the probable explanation 
being that there is some kind of chemical bonding going on, and the 
smoother surfaces allow more atoms to be in contact. 


Finally, as you tilt the book more and more, there comes a point 
where static friction reaches its maximum value. The surfaces be- 
come unstuck, and the coin begins to slide over the paper. Kinetic 
friction slows down this slipping motion significantly. In terms of en- 
ergy, kinetic friction is converting mechanical energy into heat, just 
like when you rub your hands together to keep warm. One model 
of kinetic friction is that the tiny irregularities in the two surfaces 
bump against each other, causing vibrations whose energy rapidly 
converts to heat and sound — you can hear this sound if you rub 
your fingers together near your ear. 


For dry surfaces, experiments show that the following equations 
usually work fairly well: 


Ps max & bs Fy ; 


and 


Fy = Unk , 


where ps, the coefficient of static friction, and juz, the coefficient of 
kinetic friction, are constants that depend on the properties of the 
two surfaces, such as what they’re made of and how rough they are. 


self-check D 
1. When a baseball player slides in to a base, is the friction static, or 
kinetic? 


2. A mattress stays on the roof of a slowly accelerating car. Is the 
friction static, or kinetic? 


3. Does static friction create heat? Kinetic friction? | > Answer, p. 867 


ee) 


= 


aaa A 
i/ Static friction: the tray doesn’t 
slip on the waiter’s fingers. 


j / Kinetic friction: the car skids. 


Section 3.2 Force In One Dimension 153 


k/1. The cliffs normal force on 
the climber’s feet. 2. The track’s 
static frictional force on the wheel 
of the accelerating dragster. 3. 
The ball’s normal force on the 
bat. 


Maximum acceleration of a car example 24 
> Rubber on asphalt gives n, ~ 0.4 and us ~ 0.6. What is the 
upper limit on a car’s acceleration on a flat road, assuming that 
the engine has plenty of power and that air friction is negligible? 


> This isn’t a flying car, so we don’t expect it to accelerate verti- 
cally. The vertical forces acting on the car should cancel out. The 
earth makes a downward gravitational force on the car whose 
absolute value is mg, so the road apparently makes an upward 
normal force of the same magnitude, F, = mg. 


Now what about the horizontal motion? As is always true, the co- 
efficient of static friction is greater than the coefficient of kinetic 
friction, so the maximum acceleration is obtained with static fric- 
tion, i.e., the driver should try not to burn rubber. The maximum 
force of static friction is Fs max = UsFn = usmg. The maximum 
acceleration is a = F;/m = usg ~ 6 m/s?. This is true regard- 
less of how big the tires are, since the experimentally determined 
relationship Fs max = UsFn is independent of surface area. 


self-check E 
Find the direction of each of the forces in figure k. > Answer, p. 867 


Lp 
U 


LS 


Locomotives example 25 
Looking at a picture of a locomotive, |, we notice two obvious 
things that are different from an automobile. Where a car typi- 
cally has two drive wheels, a locomotive normally has many — 
ten in this example. (Some also have smaller, unpowered wheels 
in front of and behind the drive wheels, but this example doesn’t.) 
Also, cars these days are generally built to be as light as possi- 
ble for their size, whereas locomotives are very massive, and no 
effort seems to be made to keep their weight low. (The steam 
locomotive in the photo is from about 1900, but this is true even 
for modern diesel and electric trains.) 


The reason locomotives are built to be so heavy is for traction. 
The upward normal force of the rails on the wheels, Fry, cancels 
the downward force of gravity, Fy, so ignoring plus and minus 
signs, these two forces are equal in absolute value, Fyy = Fy. 
Given this amount of normal force, the maximum force of static 
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friction is Fs = usFy = usFy. This static frictional force, of the 
rails pushing forward on the wheels, is the only force that can 
accelerate the train, pull it uphill, or cancel out the force of air 
resistance while cruising at constant speed. The coefficient of 
static friction for steel on steel is about 1/4, so no locomotive can 
pull with a force greater than about 1/4 of its own weight. If the 
engine is capable of supplying more than that amount of force, the 
result will be simply to break static friction and spin the wheels. 


The reason this is all so different from the situation with a car is 
that a car isn’t pulling something else. If you put extra weight in 
a Car, you improve the traction, but you also increase the inertia 
of the car, and make it just as hard to accelerate. In a train, the 
inertia is almost all in the cars being pulled, not in the locomotive. 


The other fact we have to explain is the large number of driv- 
ing wheels. First, we have to realize that increasing the num- 
ber of driving wheels neither increases nor decreases the total 
amount of static friction, because static friction is independent of 
the amount of surface area in contact. (The reason four-wheel- 
drive is good in a car is that if one or more of the wheels is slip- 
ping on ice or in mud, the other wheels may still have traction. 
This isn’t typically an issue for a train, since all the wheels experi- 
ence the same conditions.) The advantage of having more driving 
wheels on a train is that it allows us to increase the weight of the 
locomotive without crushing the rails, or damaging bridges. 


3.2.5 Fluid friction 


Try to drive a nail into a waterfall and you will be confronted 
with the main difference between solid friction and fluid friction. 
Fluid friction is purely kinetic; there is no static fluid friction. The 
nail in the waterfall may tend to get dragged along by the water 
flowing past it, but it does not stick in the water. The same is true 
for gases such as air: recall that we are using the word “fluid” to 
include both gases and liquids. 


Unlike kinetic friction between solids, fluid friction increases 
rapidly with velocity. It also depends on the shape of the object, 
which is why a fighter jet is more streamlined than a Model T. For 
objects of the same shape but different sizes, fluid friction typically 
scales up with the cross-sectional area of the object, which is one 
of the main reasons that an SUV gets worse mileage on the freeway 
than a compact car. 


|/ Example 25. 


m/The wheelbases of the 
Hummer H3 and the Toyota Prius 
are surprisingly similar, differing 
by only 10%. The main difference 
in shape is that the Hummer is 
much taller and wider. It presents 
a much greater cross-sectional 
area to the wind, and this is the 
main reason that it uses about 2.5 
times more gas on the freeway. 
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A barge 


Discussion Question 
A Criticize the following analysis: “A book is sitting on a table. | shove 


it, overcoming static friction. Then it slows down until it has less force than 
static friction, and it stops.” 


3.2.6 Analysis of forces 

Newton’s first and second laws deal with the total of all the 
forces exerted on a specific object, so it is very important to be able 
to figure out what forces there are. Once you have focused your 
attention on one object and listed the forces on it, it is also helpful 
to describe all the corresponding forces that must exist according 
to Newton’s third law. We refer to this as “analyzing the forces” in 
which the object participates. 


example 26 


A barge is being pulled along a canal by teams of horses on the shores. Analyze all the forces in which the 


barge participates. 


force acting on barge 


force related to it by Newton's third law 


ropes’ forward normal forces on barge barge’s backward normal force on ropes 


water’s backward fluid friction force on barge | barge’s forward fluid friction force on water 


on barge 


planet earth’s downward gravitational force | barge’s upward gravitational force on earth 


water's upward “floating” force on barge barge’s downward “floating” force on water 


Here I’ve used the word “floating” force as an example of a sensible invented term for a type of force not 
classified on the tree in the previous section. A more formal technical term would be “hydrostatic force.” 

Note how the pairs of forces are all structured as “A’s force on B, B’s force on A”: ropes on barge and barge 
on ropes; water on barge and barge on water. Because all the forces in the left column are forces acting on 
the barge, all the forces in the right column are forces being exerted by the barge, which is why each entry in 


the column begins with “barge.” 


Often you may be unsure whether you have missed one of the 
forces. Here are three strategies for checking your list: 


1. See what physical result would come from the forces you’ve 
found so far. Suppose, for instance, that you’d forgotten the 
“floating” force on the barge in the example above. Looking 
at the forces you’d found, you would have found that there 
was a downward gravitational force on the barge which was 
not canceled by any upward force. The barge isn’t supposed 
to sink, so you know you need to find a fourth, upward force. 


2. Whenever one solid object touches another, there will be 
a normal force, and possibly also a frictional force; check for 
both. 


3. Make a drawing of the object, and draw a dashed boundary 
line around it that separates it from its environment. Look for 
points on the boundary where other objects come in contact 
with your object. This strategy guarantees that you'll find 
every contact force that acts on the object, although it won’t 
help you to find non-contact forces. 
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The following is another example in which we can profit by check- 
ing against our physical intuition for what should be happening. 


‘ Rappelling 


example 27 


As shown in the figure below, Cindy is rappelling down a cliff. Her downward motion is at constant speed, and 
she takes little hops off of the cliff, as shown by the dashed line. Analyze the forces in which she participates 


at a moment when her feet are on the cliff and she is pushing off. 


force acting on Cindy force related to it by Newton's third law 
planet earth’s downward gravitational force | Cindy’s upward gravitational force on earth 
on Cindy 

ropes upward frictional force on Cindy (her | Cindy’s downward frictional force on the rope 
hand) 

cliff’s rightward normal force on Cindy Cindy’s leftward normal force on the cliff 


The two vertical forces cancel, which is what they should be doing if she is to go down at a constant rate. The 
only horizontal force on her is the cliff’s force, which is not canceled by any other force, and which therefore 
will produce an acceleration of Cindy to the right. This makes sense, since she is hopping off. (This solution 
is a little oversimplified, because the rope is slanting, so it also applies a small leftward force to Cindy. As she 


flies out to the right, the slant of the rope will increase, pulling her back in more strongly.) 


1 
' 
> 


I believe that constructing the type of table described in this 
section is the best method for beginning students. Most textbooks, 
however, prescribe a pictorial way of showing all the forces acting on 
an object. Such a picture is called a free-body diagram. It should 
not be a big problem if a future physics professor expects you to 
be able to draw such diagrams, because the conceptual reasoning 
is the same. You simply draw a picture of the object, with arrows 
representing the forces that are acting on it. Arrows representing 
contact forces are drawn from the point of contact, noncontact forces 
from the center of mass. Free-body diagrams do not show the equal 
and opposite forces exerted by the object itself. 


Discussion Questions 


A When you fire a gun, the exploding gases push outward in all direc- 
tions, causing the bullet to accelerate down the barrel. What Newton’s- 
third-law pairs are involved? [Hint: Remember that the gases themselves 
are an object.] 


B In the example of the barge going down the canal, | referred to 
a “floating” or “hydrostatic” force that keeps the boat from sinking. If you 
were adding a new branch on the force-classification tree to represent this 
force, where would it go? 
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Cc A pool ball is rebounding from the side of the pool table. Analyze 
the forces in which the ball participates during the short time when it is in 
contact with the side of the table. 


D The earth’s gravitational force on you, i.e., your weight, is always 
equal to mg, where m is your mass. So why can you get a shovel to go 
deeper into the ground by jumping onto it? Just because you’re jumping, 
that doesn’t mean your mass or weight is any greater, does it? 


3.2.7 Transmission of forces by low-mass objects 


You’re walking your dog. The dog wants to go faster than you 
do, and the leash is taut. Does Newton’s third law guarantee that 
your force on your end of the leash is equal and opposite to the dog’s 
force on its end? If they’re not exactly equal, is there any reason 
why they should be approximately equal? 


If there was no leash between you, and you were in direct contact 
with the dog, then Newton’s third law would apply, but Newton’s 
third law cannot relate your force on the leash to the dog’s force 
on the leash, because that would involve three separate objects. 
Newton’s third law only says that your force on the leash is equal 
and opposite to the leash’s force on you, 


Pyt = =F rags 


and that the dog’s force on the leash is equal and opposite to its 
force on the dog 
Far = —Fra. 


Still, we have a strong intuitive expectation that whatever force we 
make on our end of the leash is transmitted to the dog, and vice- 
versa. We can analyze the situation by concentrating on the forces 
that act on the leash, Fy, and Fy,. According to Newton’s second 
law, these relate to the leash’s mass and acceleration: 


Far + Fyt = ™mrpar. 


The leash is far less massive then any of the other objects involved, 
and if mz is very small, then apparently the total force on the leash 
is also very small, Fy, + Fy, © 0, and therefore 


Far © —Fyt 


Thus even though Newton’s third law does not apply directly to 
these two forces, we can approximate the low-mass leash as if it was 
not intervening between you and the dog. It’s at least approximately 
as if you and the dog were acting directly on each other, in which 
case Newton’s third law would have applied. 


In general, low-mass objects can be treated approximately as if 
they simply transmitted forces from one object to another. This can 
be true for strings, ropes, and cords, and also for rigid objects such 
as rods and sticks. 
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n/\f we imagine dividing a taut rope up into small segments, then 
any segment has forces pulling outward on it at each end. If the rope 
is of negligible mass, then all the forces equal +7 or —7T, where 7, the 
tension, is a single number. 


If you look at a piece of string under a magnifying glass as you 
pull on the ends more and more strongly, you will see the fibers 
straightening and becoming taut. Different parts of the string are 
apparently exerting forces on each other. For instance, if we think of 
the two halves of the string as two objects, then each half is exerting 
a force on the other half. If we imagine the string as consisting 
of many small parts, then each segment is transmitting a force to 
the next segment, and if the string has very little mass, then all 
the forces are equal in magnitude. We refer to the magnitude of 
the forces as the tension in the string, JT. Although the tension 
is measured in units of Newtons, it is not itself a force. There are 
many forces within the string, some in one direction and some in the 
other direction, and their magnitudes are only approximately equal. 
The concept of tension only makes sense as a general, approximate 
statement of how big all the forces are. 


If a rope goes over a pulley or around some other object, then the 
tension throughout the rope is approximately equal so long as there 
is not too much friction. A rod or stick can be treated in much the 
same way as a string, but it is possible to have either compression 
or tension. 


Since tension is not a type of force, the force exerted by a rope 
on some other object must be of some definite type such as static 
friction, kinetic friction, or a normal force. If you hold your dog’s 
leash with your hand through the loop, then the force exerted by the 
leash on your hand is a normal force: it is the force that keeps the 
leash from occupying the same space as your hand. If you grasp a 
plain end of a rope, then the force between the rope and your hand 
is a frictional force. 


A more complex example of transmission of forces is the way 
a car accelerates. Many people would describe the car’s engine as 
making the force that accelerates the car, but the engine is part of 
the car, so that’s impossible: objects can’t make forces on them- 
selves. What really happens is that the engine’s force is transmitted 
through the transmission to the axles, then through the tires to the 
road. By Newton’s third law, there will thus be a forward force from 
the road on the tires, which accelerates the car. 


o/The Golden Gate Bridge’s 
roadway is held up by the tension 
in the vertical cables. 
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Discussion Question 


A When you step on the gas pedal, is your foot’s force being transmitted 
in the sense of the word used in this section? 


3.2.8 Work 
Energy transferred to a particle 


To change the kinetic energy, K = (1/2)mv?, of a particle mov- 
ing in one dimension, we must change its velocity. That will entail a 
change in its momentum, p = mv, as well, and since force is the rate 
of transfer of momentum, we conclude that the only way to change 
a particle’s kinetic energy is to apply a force.® A force in the same 
direction as the motion speeds it up, increasing the kinetic energy, 
while a force in the opposite direction slows it down. 


Consider an infinitesimal time interval during which the particle 
moves an infinitesimal distance dz, and its kinetic energy changes 
by dk. In one dimension, we represent the direction of the force 
and the direction of the motion with positive and negative signs for 
F and dz, so the relationship among the signs can be summarized 
as follows: 


F>0O|dr>0)dk >0 
F<0|drx<0)/dk >0 
F>0O|dr<0)dk <0 
F<0|dr>0)dk <0 


This looks exactly like the rule for determining the sign of a product, 
and we can easily show using the chain rule that this is indeed a 
multiplicative relationship: 


= waar a [chain rule] 

= (mv)(a)(1/v)dx 

=madz 

= Fda [Newton’s second law] 


®The converse isn’t true, because kinetic energy doesn’t depend on the di- 
rection of motion, but momentum does. We can change a particle’s momentum 
without changing its energy, as when a pool ball bounces off a bumper, reversing 
the sign of p. 
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We can verify that force multiplied by distance has units of energy: 


ke- 

_ ke-m/s \ 
Ss 

= kg-m?/s? 

=J 


N-m 


A TV picture tube example 28 
> At the back of a typical TV’s picture tube, electrical forces accel- 
erate each electron to an energy of 5 x 10~'® J over a distance 
of about 1 cm. How much force is applied to a single electron? 
(Assume the force is constant.) What is the corresponding accel- 
eration? 


> Integrating 


dK = Fdx ; 
we find 
Ky — Kj = F(xX¢ — Xi) 
or 
AK = FAx 
The force is 
F=AK/Ax 
5x10" J 
~ 0.01 m 
=5x10-'4N 


This may not sound like an impressive force, but it’s enough to 
supply an electron with a spectacular acceleration. Looking up 
the mass of an electron on p. 887, we find 


a=F/m 
=5 x 10'6 m/s? 
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p/A_ simplified drawing of an 
airgun. 


q/The black box does work 
by reeling in its cable. 


An air gun example 29 
> An airgun, figure p, uses compressed air to accelerate a pellet. 
As the pellet moves from x; to Xo, the air decompresses, so the 
force is not constant. Using methods from chapter 5, one can 
show that the air’s force on the pellet is given by F = bx’”®. A 
typical high-end airgun used for competitive target shooting has 


x1 =0.046m 

%=0.41m , 
and 

b=4.4N-m’ 


What is the kinetic energy of the pellet when it leaves the muzzle? 
(Assume friction is negligible.) 


> Since the force isn’t constant, it would be incorrect to do F = 
AK/Ax. Integrating both sides of the equation dK = Fdx, we 
have 


AK 


I 
7 
roy 
x 


In general, when energy is transferred by a force,’ we use the 


term work to refer to the amount of energy transferred. This is dif- 
ferent from the way the word is used in ordinary speech. If you stand 
for a long time holding a bag of cement, you get tired, and everyone 
will agree that you’ve worked hard, but you haven’t changed the en- 
ergy of the cement, so according to the definition of the physics term, 
you haven’t done any work on the bag. There has been an energy 
transformation inside your body, of chemical energy into heat, but 
this just means that one part of your body did positive work (lost 
energy) while another part did a corresponding amount of negative 
work (gained energy). 


Work in general 


I derived the expression Fda for one particular type of kinetic- 
energy transfer, the work done in accelerating a particle, and then 
defined work as a more general term. Is the equation correct for 
other types of work as well? For example, if a force lifts a mass m 
against the resistance of gravity at constant velocity, the increase in 


"The part of the definition about “by a force” is meant to exclude the transfer 
of energy by heat conduction, as when a stove heats soup. 
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the mass’s gravitational energy is d(mgy) = mgdy = Fdy, so again 
the equation works, but this still doesn’t prove that the equation is 
always correct as a way of calculating energy transfers. 


Imagine a black box®, containing a gasoline-powered engine, 
which is designed to reel in a steel cable, exerting a certain force 
F. For simplicity, we imagine that this force is always constant, so 
we can talk about Az rather than an infinitesimal dz. If this black 
box is used to accelerate a particle (or any mass without internal 
structure), and no other forces act on the particle, then the original 
derivation applies, and the work done by the box is W = FAz. 
Since F’ is constant, the box will run out of gas after reeling in a 
certain amount of cable Ax. The chemical energy inside the box 
has decreased by —W, while the mass being accelerated has gained 
W worth of kinetic energy.? 


Now what if we use the black box to pull a plow? The energy in- 
crease in the outside world is of a different type than before; it takes 
the forms of (1) the gravitational energy of the dirt that has been 
lifted out to the sides of the furrow, (2) frictional heating of the dirt 
and the plowshare, and (3) the energy needed to break up the dirt 
clods (a form of electrical energy involving the attractions among 
the atoms in the clod). The box, however, only communicates with 
the outside world via the hole through which its cable passes. The 
amount of chemical energy lost by the gasoline can therefore only 
depend on F' and Az, so it is the same —W as when the box was 
being used to accelerate a mass, and thus by conservation of energy, 
the work done on the outside world is again W. 


This is starting to sound like a proof that the force-times-distance 
method is always correct, but there was one subtle assumption in- 
volved, which was that the force was exerted at one point (the end of 
the cable, in the black box example). Real life often isn’t like that. 
For example, a cyclist exerts forces on both pedals at once. Serious 
cyclists use toe-clips, and the conventional wisdom is that one should 
use equal amounts of force on the upstroke and downstroke, to make 
full use of both sets of muscles. This is a two-dimensional example, 
since the pedals go in circles. We’re only discussing one-dimensional 
motion right now, so let’s just pretend that the upstroke and down- 
stroke are both executed in straight lines. Since the forces are in 
opposite directions, one is positive and one is negative. The cyclist’s 
total force on the crank set is zero, but the work done isn’t zero. We 
have to add the work done by each stroke, W = Fi Aa, + FoArg. 
(I’m pretending that both forces are constant, so we don’t have to 
do integrals.) Both terms are positive; one is a positive number 


8 “Black box” is a traditional engineering term for a device whose inner work- 
ings we don’t care about. 

°For conceptual simplicity, we ignore the transfer of heat energy to the outside 
world via the exhaust and radiator. In reality, the sum of these energies plus the 
useful kinetic energy transferred would equal W. 


r/The wheel spinning in the 


air has Kem = O. The space 
shuttle has all its kinetic energy 
in the form of center of mass 
motion, K = Kgm. The rolling 
ball has some, but not all, of its 
energy in the form of center of 
mass motion, Kem < K.(Space 
Shuttle photo by NASA) 
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multiplied by a positive number, while the other is a negative times 
a negative. 


This might not seem like a big deal — just remember not to use 
the total force — but there are many situations where the total force 
is all we can measure. The ultimate example is heat conduction. 
Heat conduction is not supposed to be counted as a form of work, 
since it occurs without a force. But at the atomic level, there are 
forces, and work is done by one atom on another. When you hold a 
hot potato in your hand, the transfer of heat energy through your 
skin takes place with a total force that’s extremely close to zero. 
At the atomic level, atoms in your skin are interacting electrically 
with atoms in the potato, but the attractions and repulsions add 
up to zero total force. It’s just like the cyclist’s feet acting on the 
pedals, but with zillions of forces involved instead of two. There is 
no practical way to measure all the individual forces, and therefore 
we can’t calculate the total energy transferred. 


To summarize, )) Fjdx; is a correct way of calculating work, 
where F’; is the individual force acting on particle 7, which moves a 
distance dx;. However, this is only useful if you can identify all the 
individual forces and determine the distance moved at each point of 
contact. For convenience, [’ll refer to this as the work theorem. (It 
doesn’t have a standard name.) 


There is, however, something useful we can do with the total 
force. We can use it to calculate the part of the work done on an 
object that consists of a change in the kinetic energy it has due to 
the motion of its center of mass. The proof is essentially the same 
as the proof on p. 160, except that now we don’t assume the force 
is acting on a single particle, so we have to be a little more delicate. 
Let the object consist of n particles. Its total kinetic energy is 
K= ie (1/ 2)mjv;, but this is what we’ve already realized can’t 
be calculated using the total force. The kinetic energy it has due to 
motion of its center of mass is 


1 
2 
Kem = 2 MetotalVem 


Figure r shows some examples of the distinction between Ke, and 
K. Differentiating Kem, we have 


dKem = MtotalVemdvem 
dvem dt 


= MtotalUem —. ——da%em [chain rule] 
= Meouat oo datom [dé fdiicn = LO 


a APtotal 
dt 


= Fieigdton 


d&em [Ptotal = ™Mtotal eral 
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Pll call this the kinetic energy theorem — like the work theorem, it 
has no standard name. 


An ice skater pushing off from a wall example 30 
The kinetic energy theorem tells us how to calculate the skater'’s 
kinetic energy if we know the amount of force and the distance 
her center of mass travels while she is pushing off. 


The work theorem tells us that the wall does no work on the 
skater, since the point of contact isn’t moving. This makes sense, 
because the wall does not have any source of energy. 


Absorbing an impact without recoiling? example 31 
> Is it possible to absorb an impact without recoiling? For in- 
stance, if a ping-pong ball hits a brick wall, does the wall “give” at 
all? 


> There will always be a recoil. In the example proposed, the wall 
will surely have some energy transferred to it in the form of heat 
and vibration. The work theorem tells us that we can only have 
an energy transfer if the distance traveled by the point of contact 
is nonzero. 


Dragging a refrigerator at constant velocity example 32 
The fridge’s momentum is constant, so there is no net momen- 
tum transfer, and the total force on it must be zero: your force 
is canceling the floor’s kinetic frictional force. The kinetic energy 
theorem is therefore true but useless. It tells us that there is zero 
total force on the refrigerator, and that the refrigerator’s kinetic 
energy doesn’t change. 


The work theorem tells us that the work you do equals your hand’s 
force on the refrigerator multiplied by the distance traveled. Since 
we know the floor has no source of energy, the only way for the 
floor and refrigerator to gain energy is from the work you do. We 
can thus calculate the total heat dissipated by friction in the re- 
frigerator and the floor. 


Note that there is no way to find how much of the heat is dissi- 
pated in the floor and how much in the refrigerator. 


Accelerating a cart example 33 
If you push on a cart and accelerate it, there are two forces acting 
on the cart: your hand’s force, and the static frictional force of the 
ground pushing on the wheels in the opposite direction. 


Applying the work theorem to your force tells us how to calculate 
the work you do. 


Applying the work theorem to the floor’s force tells us that the 
floor does no work on the cart. There is no motion at the point 
of contact, because the atoms in the floor are not moving. (The 
atoms in the surface of the wheel are also momentarily at rest 
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when they touch the floor.) This makes sense, because the floor 
does not have any source of energy. 


The kinetic energy theorem refers to the total force, and because 
the floors backward force cancels part of your force, the total 
force is less than your force. This tells us that only part of your 
work goes into the kinetic energy associated with the forward mo- 
tion of the cart’s center of mass. The rest goes into rotation of the 
wheels. 


Discussion Questions 


A Criticize the following incorrect statement: “A force doesn’t do any 
work unless it’s causing the object to move.” 


B_ To stop your car, you must first have time to react, and then it takes 
some time for the car to slow down. Both of these times contribute to the 
distance you will travel before you can stop. The figure shows how the 
average stopping distance increases with speed. Because the stopping 
distance increases more and more rapidly as you go faster, the rule of 
one car length per 10 m.p.h. of speed is not conservative enough at high 
speeds. In terms of work and kinetic energy, what is the reason for the 
more rapid increase at high speeds? 


. \ . 
distance one car actual 


covered length stopping 
before per 10 distance 
reacting mph 


s / Discussion question B. 
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3.2.9 Simple machines 


Conservation of energy provided the necessary tools for analyz- 
ing some mechanical systems, such as the seesaw on page 85 and the 
pulley arrangements of the homework problems on page ??, but we 
could only analyze those machines by computing the total energy 
of the system. That approach wouldn’t work for systems like the 
biceps/forearm machine on page 85, or the one in figure t, where 
the energy content of the person’s body is impossible to compute 
directly. Even though the seesaw and the biceps/forearm system 
were clearly just two different forms of the lever, we had no way 
to treat them both on the same footing. We can now successfully 
attack such problems using the work and kinetic energy theorems. 


Constant tension around a pulley example 34 
> In figure t, what is the relationship between the force applied by 
the person’s hand and the force exerted on the block? 


> If we assume the rope and the pulley are ideal, i.e., frictionless 
and massless, then there is no way for them to absorb or release 
energy, so the work done by the hand must be the same as the 
work done on the block. Since the hand and the block move the 
same distance, the work theorem tells us the two forces are the 
same. 


Similar arguments provide an alternative justification for the state- 
ment made in section 3.2.7 that show that an idealized rope ex- 
erts the same force, the tension, anywhere it’s attached to some- 
thing, and the same amount of force is also exerted by each seg- 
ment of the rope on the neighboring segments. Going around an 
ideal pulley also has no effect on the tension. 


This is an example of a simple machine, which is any mechanical 
system that manipulates forces to do work. This particular ma- 
chine reverses the direction of the motion, but doesn’t change the 
force or the speed of motion. 


A mechanical advantage example 35 
The idealized pulley in figure u has negligible mass, so its kinetic 
energy is zero, and the kinetic energy theorem tells us that the 
total force on it is zero. We know, as in the preceding example, 
that the two forces pulling it to the right are equal to each other, 
so the force on the left must be twice as strong. This simple 
machine doubles the applied force, and we refer to this ratio as 
a mechanical advantage (M.A.) of 2. There’s no such thing as 
a free lunch, however; the distance traveled by the load is cut in 
half, and there is no increase in the amount of work done. 


Inclined plane and wedge example 36 
In figure v, the force applied by the hand is equal to the one ap- 
plied to the load, but there is a mechanical advantage compared 
to the force that would have been required to lift the load straight 


Len™ 


t/The force is transmitted to 


the block. 


u/A — mechanical 
of 2. 


fre*, 


advantage 


v/ An inclined plane. 
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w / A wedge. 


x / Archimedes’ screw 


up. The distance traveled up the inclined plane is greater by a 
factor of 1/sin 8, so by the work theorem, the force is smaller by 
a factor of sin 8, and we have M.A.=1/sin 8. The wedge, w, is 
similar. 


‘Archimedes’ screw example 37 
In one revolution, the crank travels a distance 27b, and the water 
rises by a height h. The mechanical advantage is 27b/h. 


3.2.10 Force related to interaction energy 


In section 2.3, we saw that there were two equivalent ways of 
looking at gravity, the gravitational field and the gravitational en- 
ergy. They were related by the equation dU = mgdr, so if we knew 
the field, we could find the energy by integration, U = { mgdr, and 
if we knew the energy, we could find the field by differentiation, 
g = (1/m)dU/dr. 


The same approach can be applied to other interactions, for 
example a mass on a spring. The main difference is that only in 
gravitational interactions does the strength of the interaction de- 
pend on the mass of the object, so in general, it doesn’t make sense 
to separate out the factor of m as in the equation dU = mgdr. Since 
F = mq is the gravitational force, we can rewrite the equation in 
the more suggestive form dU = F'dr. This form no longer refers to 
gravity specifically, and can be applied much more generally. The 
only remaining detail is that I’ve been fairly cavalier about positive 
and negative signs up until now. That wasn’t such a big problem 
for gravitational interactions, since gravity is always attractive, but 
it requires more careful treatment for nongravitational forces, where 
we don’t necessarily know the direction of the force in advance, and 
we need to use positive and negative signs carefully for the direction 
of the force. 


In general, suppose that forces are acting on a particle — we 
can think of them as coming from other objects that are “off stage” 
— and that the interaction between the particle and the off-stage 
objects can be characterized by an interaction energy, U, which 
depends only on the particle’s position, x. Using the kinetic energy 
theorem, we have dk = Fdz. (It’s not necessary to write Kem, since 
a particle can’t have any other kind of kinetic energy.) Conservation 
of energy tells us dk + dU = 0, so the relationship between force 


and interaction energy is dU = —F dz, or 
dU : 
P= taps [relationship between force and interaction energy] 
x 
‘Force exerted by a spring example 38 


> A mass is attached to the end of a spring, and the energy of the 
spring is U = (1/2)kx, where x is the position of the mass, and 
X = 0 is defined to be the equilibrium position. What is the force 
the spring exerts on the mass? Interpret the sign of the result. 
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> Differentiating, we find 


If x is positive, then the force is negative, i.e., it acts so as to bring 
the mass back to equilibrium, and similarly for x < 0 we have 
F>0. 


Most books do the F = —kx form before the U = (1/2)kx? form, 
and call it Hooke’s law. Neither form is really more fundamental 
than the other — we can always get from one to the other by 
integrating or differentiating. 


Newton's law of gravity example 39 
> Given the equation U = —Gm,mp2/r for the energy of gravita- 
tional interactions, find the corresponding equation for the grav- 
itational force on mass mo. Interpret the positive and negative 
signs. 


> We have to be a little careful here, because we've been taking 
r to be positive by definition, whereas the position, x, of mass ms 
could be positive or negative, depending on which side of m, it’s 
on. 


For positive x, we have r = x, and differentiation gives 


= —Gm4 m2 /x? 


As in the preceding example, we have F < 0 when x is positive, 
because the object is being attracted back toward x = 0. 


When x is negative, the relationship between r and x becomes 
r = —x, and the result for the force is the same as before, but with 
a minus sign. We can combine the two equations by writing 


|F| = Gmyme/r? ’ 


and this is the form traditionally known as Newton’s law of grav- 
ity. As in the preceding example, the U and F equations contain 
equivalent information, and neither is more fundamental than the 
other. 


Equilibrium example 40 
| previously described the condition for equilibrium as a local max- 
imum or minimum of U. A differentiable function has a zero deriva- 
tive at its extrema, and we can now relate this directly to force: 
zero force acts on an object when it is at equilibrium. 
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a/An_ x-versus-t graph for a 
swing pushed at resonance. 


b/A swing pushed at twice 
its resonant frequency. 


| i 


c/The F-versus-t graph for 
an impulsive driving force. 


d/A_ sinusoidal driving force. 


3.3. Resonance 


Resonance is a phenomenon in which an oscillator responds most 
strongly to a driving force that matches its own natural frequency 
of vibration. For example, suppose a child is on a playground swing 
with a natural frequency of 1 Hz. That is, if you pull the child 
away from equilibrium, release her, and then stop doing anything 
for a while, she’ll oscillate at 1 Hz. If there was no friction, as we 
assumed in section 2.5, then the sum of her gravitational and kinetic 
energy would remain constant, and the amplitude would be exactly 
the same from one oscillation to the next. However, friction is going 
to convert these forms of energy into heat, so her oscillations would 
gradually die out. To keep this from happening, you might give 
her a push once per cycle, i.e., the frequency of your pushes would 
be 1 Hz, which is the same as the swing’s natural frequency. As 
long as you stay in rhythm, the swing responds quite well. If you 
start the swing from rest, and then give pushes at 1 Hz, the swing’s 
amplitude rapidly builds up, as in figure a, until after a while it 
reaches a steady state in which friction removes just as much energy 
as you put in over the course of one cycle. 


self-check F 

In figure a, compare the amplitude of the cycle immediately following 
the first push to the amplitude after the second. Compare the energies 
as well. > Answer, p. 867 


What will happen if you try pushing at 2 Hz? Your first push 
puts in some momentum, p, but your second push happens after 
only half a cycle, when the swing is coming right back at you, with 
momentum —p! The momentum transfer from the second push is 
exactly enough to stop the swing. The result is a very weak, and 
not very sinusoidal, motion, b. 


Making the math easy 
This is a simple and physically transparent example of resonance: 
the swing responds most strongly if you match its natural rhythm. 
However, it has some characteristics that are mathematically ugly 
and possibly unrealistic. The quick, hard pushes are known as im- 
pulse forces, c, and they lead to an x-t graph that has nondifferen- 
tiable kinks. Impulsive forces like this are not only badly behaved 
mathematically, they are usually undesirable in practical terms. In 
a car engine, for example, the engineers work very hard to make 
the force on the pistons change smoothly, to avoid excessive vibra- 
tion. Throughout the rest of this section, we’ll assume a driving 
force that is sinusoidal, d, i.e., one whose F’-t graph is either a sine 
function or a function that differs from a sine wave in phase, such 
as a cosine. The force is positive for half of each cycle and negative 
for the other half, i.e., there is both pushing and pulling. Sinusoidal 
functions have many nice mathematical characteristics (we can dif- 
ferentiate and integrate them, and the sum of sinusoidal functions 
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that have the same frequency is a sinusoidal function), and they are 
also used in many practical situations. For instance, my garage door 
zapper sends out a sinusoidal radio wave, and the receiver is tuned 
to resonance with it. 


A second mathematical issue that I glossed over in the swing 
example was how friction behaves. In section 3.2.4, about forces 
between solids, the empirical equation for kinetic friction was inde- 
pendent of velocity. Fluid friction, on the other hand, is velocity- 
dependent. For a child on a swing, fluid friction is the most im- 
portant form of friction, and is approximately proportional to v?. 
In still other situations, e.g., with a low-density gas or friction be- 
tween solid surfaces that have been lubricated with a fluid such as 
oil, we may find that the frictional force has some other dependence 
on velocity, perhaps being proportional to v, or having some other 
complicated velocity dependence that can’t even be expressed with 
a simple equation. It would be extremely complicated to have to 
treat all of these different possibilities in complete generality, so for 
the rest of this section, we’ll assume friction proportional to velocity 


F=—bv ; 


simply because the resulting equations happen to be the easiest to 
solve. Even when the friction doesn’t behave in exactly this way, 
many of our results may still be at least qualitatively correct. 


3.3.1 Damped, free motion 
Numerical treatment 


An oscillator that has friction is referred to as damped. Let’s use 
numerical techniques to find the motion of a damped oscillator that 
is released away from equilibrium, but experiences no driving force 
after that. We can expect that the motion will consist of oscillations 
that gradually die out. 


In section 2.5, we simulated the undamped case using our tried 
and true Python function based on conservation of energy. Now, 
however, that approach becomes a little awkward, because it in- 
volves splitting up the path to be traveled into n tiny segments, but 
in the presence of damping, each swing is a little shorter than the 
last one, and we don’t know in advance exactly how far the oscilla- 
tion will get before turning around. An easier technique here is to 
use force rather than energy. Newton’s second law, a = F'/m, gives 
a = (—ka — bv)/m, where we’ve made use of the result of example 
38 for the force exerted by the spring. This becomes a little prettier 
if we rewrite it in the form 


ma+ bv + kx = 0 : 


which gives symmetric treatment to three terms involving x and its 
first and second derivatives, v and a. Now instead of calculating 
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the time At = Az/v required to move a predetermined distance 
Az, we pick At and determine the distance traveled in that time, 
Az = vAt. Also, we can no longer update v based on conservation of 
energy, since we don’t have any easy way to keep track of how much 
mechanical energy has been changed into heat energy. Instead, we 
recalculate the velocity using Av = aAt. 


1 import math 

2  k=39.4784 # chosen to give a period of 1 second 
3 m=1. 

4 b=0.211 # chosen to make the results simple 
5 x=1. 

6 v=0. 

7 t=0. 

8 dt=.01 

9  n=1000 

10 for j in range(n): 

11 x=x+v¥dt 

12 a= (-k*x-b*v) /m 

13 if (v>0) and (vta*dt<0) 

14 print("turnaround at t=",t,", x=",x) 

15 v=evtaxdt 

16 t=tt+dt 
turnaround at t= 0.99 , x= 0.899919262445 
turnaround at t= 1.99 , x= 0.809844934046 
turnaround at t= 2.99 , x= 0.728777519477 
turnaround at t= 3.99 , x= 0.655817260033 
turnaround at t= 4.99 , x= 0.590154191135 
turnaround at t= 5.99 , x= 0.531059189965 
turnaround at t= 6.99 , x= 0.477875914756 
turnaround at t= 7.99 , x= 0.430013546991 
turnaround at t= 8.99 , x= 0.386940256644 
turnaround at t= 9.99 , x= 0.348177318484 


The spring constant, k = 47 = 39.4784 N/m, is designed so 
that if the undamped equation f = (1/27),/k/m was still true, the 
frequency would be 1 Hz. We start by noting that the addition of a 
small amount of damping doesn’t seem to have changed the period 
at all, or at least not to within the accuracy of the calculation.!° 
You can check for yourself, however, that a large value of b, say 5 
N-s/m, does change the period significantly. 


We release the mass from x« = 1 m, and after one cycle, it only 
comes back to about z = 0.9 m. I chose b = 0.211 N-s/m by fiddling 


10This subroutine isn’t as accurate a way of calculating the period as the 
energy-based one we used in the undamped case, since it only checks whether 
the mass turned around at some point during the time interval At. 


172 Chapter 3 Conservation of Momentum 


around until I got this result, since a decrease of exactly 10% is 
easy to discuss. Notice how the amplitude after two cycles is about 
0.81 m, ie., 1 m times 0.97: the amplitude has again dropped by 
exactly 10%. This pattern continues for as long as the simulation 
runs, e.g., for the last two cycles, we have 0.34818 /0.38694=0.89982, 
or almost exactly 0.9 again. It might have seemed capricious when I 
chose to use the unrealistic equation F’ = —bv, but this is the payoff. 
Only with —bv friction do we get this kind of mathematically simple 
exponential decay. 


Because the decay is exponential, it never dies out completely; 
this is different from the behavior we would have had with Coulomb 
friction, which does make objects grind completely to a stop at some 
point. With friction that acts like F = —bv, v gets smaller as the 
oscillations get smaller. The smaller and smaller force then causes 
them to die out at a rate that is slower and slower. 


Analytic treatment 


Taking advantage of this unexpectedly simple result, let’s find 
an analytic solution for the motion. The numerical output suggests 
that we assume a solution of the form 


x = Ae sin(w yt + 6) , 


where the unknown constants w+ and c will presumably be related to 
m, b, and k. The constant c indicates how quickly the oscillations die 
out. The constant wy is, as before, defined as 27 times the frequency, 
with the subscript f to indicate a free (undriven) solution. All 
our equations will come out much simpler if we use ws everywhere 
instead of fs from now on, and, as physicists often do, I’ll generally 
use the word “frequency” to refer to w when the context makes it 
clear what I’m talking about. The phase angle 6 has no real physical 
significance, since we can define t = 0 to be any moment in time we 


like. 


self-check G 
In figure f, which graph has the greater value of c? ~—> Answer, p. 867 


The factor A for the initial amplitude can also be omitted with- 
out loss of generality, since the equation we’re trying to solve, ma+ 
bu + kx = 0, is linear. That is, v and a are the first and second 
derivatives of x, and the derivative of Ax is simply A times the 
derivative of x. Thus, if x(t) is a solution of the equation, then 
multiplying it by a constant gives an equally valid solution. This is 
another place where we see that a damping force proportional to v is 
the easiest to handle mathematically. For a damping force propor- 
tional to v?, for example, we would have had to solve the equation 
ma + bv? + kx = 0, which is nonlinear. 


For the purpose of determining wy and c, the most general form 
we need to consider is therefore « = e~“ sin wrt , whose first and 


e/A damped sine wave, 


of 


the form x = Ae~“sin(wt + 8). 


f / Self-check G. 
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SS 


g/A damped sine wave is 
compared with an undamped 
one, with m and k kept the same 
and only b changed. 


second derivatives are v = e~% ( 


—csinw st +wcosw rt) and a = 
ee G sin wrt — 2w¢ccos wrt — ws sinw,t). Plugging these into the 
equation ma + bv + kx = 0 and setting the sine and cosine parts 
equal to zero gives, after some tedious algebra, 


oe 
~ 2m 
and 
0h b2 
“F m  4m2 


Intuitively, we expect friction to “slow down” the motion, as when 
we ride a bike into a big patch of mud. “Slow down,” however, could 
have more than one meaning here. It could mean that the oscillator 
would take more time to complete each cycle, or it could mean that 
as time went on, the oscillations would die out, thus giving smaller 
velocities. 


Our mathematical results show that both of these things hap- 
pen. The first equation says that c, which indicates how quickly the 
oscillations damp out, is directly related to b, the strength of the 
damping. 


The second equation, for the frequency, can be compared with 
the result from page 116 of \/k/m for the undamped system. Let’s 
refer to this now as wo, to distinguish it from the actual frequency 
wy of the free oscillations when damping is present. The result for 
wy will be less than w,, due to the presence of the b?/4m? term. This 
tells us that the addition of friction to the system does increase the 
time required for each cycle. However, it is very common for the 
b?/4m? term to be negligible, so that WF XY Wo. 


Figure g shows an example. The damping here is quite strong: 
after only one cycle of oscillation, the amplitude has already been 
reduced by a factor of 2, corresponding to a factor of 4 in energy. 
However, the frequency of the damped oscillator is only about 1% 
lower than that of the undamped one; after five periods, the ac- 
cumulated lag is just barely visible in the offsetting of the arrows. 
We can see that extremely strong damping — even stronger than 
this — would have been necessary in order to make wf & wo a poor 
approximation. 


3.3.2 The quality factor 


It’s usually impractical to measure b directly and determine c 
from the equation c = b/2m. For a child on a swing, measuring 
6 would require putting the child in a wind tunnel! It’s usually 
much easier to characterize the amount of damping by observing the 
actual damped oscillations and seeing how many cycles it takes for 
the mechanical energy to decrease by a certain factor. The unitless 
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quality factor, Q, is defined as Q = w,/2c, and in the limit of weak 
damping, where w & Wo, this can be interpreted as the number of 
cycles required for the mechanical energy to fall off by a factor of 
e2™ — 535.49... Using this new quantity, we can rewrite the equation 
for the frequency of damped oscillations in the slightly more elegant 
form wf = Wor/1 — 1/4Q?. 

self-check H 

What if we wanted to make a simpler definition of Q, as the number of 

oscillations required for the vibrations to die out completely, rather than 

the number required for the energy to fall off by this obscure factor? pb 

Answer, p. 867 


A graph example 41 
The damped motion in figure g has Q ~ 4.5, giving \/1 — 1/4Q? ~ 
0.99, as claimed at the end of the preceding subsection. 


Exponential decay in a trumpet example 42 
> The vibrations of the air column inside a trumpet have a Q of 
about 10. This means that even after the trumpet player stops 
blowing, the note will keep sounding for a short time. If the player 
suddenly stops blowing, how will the sound intensity 20 cycles 
later compare with the sound intensity while she was still blowing? 


> The trumpet’s Q is 10, so after 10 cycles the energy will have 
fallen off by a factor of 535. After another 10 cycles we lose an- 
other factor of 535, so the sound intensity is reduced by a factor 
of 535 x 535 = 2.9 x 10°. 


The decay of a musical sound is part of what gives it its charac- 
ter, and a good musical instrument should have the right Q, but the 
Q that is considered desirable is different for different instruments. 
A guitar is meant to keep on sounding for a long time after a string 
has been plucked, and might have a Q of 1000 or 10000. One of the 
reasons why a cheap synthesizer sounds so bad is that the sound 
suddenly cuts off after a key is released. 


3.3.3 Driven motion 


The driven case is extremely important in science, technology, 
and engineering. We have an external driving force F = F,, sinwt, 
where the constant F;, indicates the maximum strength of the force 
in either direction. The equation of motion is now 


[1] ma+bv+kr = Fy, sinwt 


[equation of motion for a driven oscillator] 


After the driving force has been applied for a while, we expect that 
the amplitude of the oscillations will approach some constant value. 
This motion is known as the steady state, and it’s the most inter- 
esting thing to find out; as we’ll see later, the most general type of 
motion is only a minor variation on the steady-state motion. For 


Summary of Notation 


k 
m 
b 


T 
f 
w 


Wo 


Fin 


spring constant 

mass of the oscillator 

sets the amount of damp- 
ing, F = —bv 

period 

frequency, 1/T 

(Greek letter omega), an- 
gular frequency, 27f, often 
referred to simply as “fre- 
quency” 

frequency the oscillator 
would have without damp- 
ing, y/k/m 

frequency of the free vi- 
brations 

sets the time scale for the 
exponential decay enve- 
lope e~* of the free vibra- 
tions 

strength of the driving 
force, which is assumed to 
vary sinusoidally with fre- 
quency w 

amplitude of the steady- 
state response 

phase angle of the steady- 
state response 
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h/ Dependence of the amplitude 
and phase angle on the driving 
frequency, for an undamped 
oscillator. The amplitudes were 
calculated with F,,, m, and wo, all 
set to 1. 


the steady-state motion, we’re going to look for a solution of the 
form 
x = Asin(wt + 6) 


In contrast to the undriven case, here it’s not possible to sweep A 
and 6 under the rug. The amplitude of the steady-state motion, A, 
is actually the most interesting thing to know about the steady-state 
motion, and it’s not true that we still have a solution no matter how 
we fiddle with A; if we have a solution for a certain value of A, then 
multiplying A by some constant would break the equality between 
the two sides of the equation of motion. It’s also no longer true that 
we can get rid of 6 simply be redefining when we start the clock; 
here 6 represents a difference in time between the start of one cycle 
of the driving force and the start of the corresponding cycle of the 
motion. 


The velocity and acceleration are v = wAcos(wt + 6) and a = 
—w?Asin(wt + 6), and if we plug these into the equation of motion, 
[1], and simplify a little, we find 


F, 
[2] (k — mw?) sin(wt + 6) + wbcos(wt + 5) = i sin wt 
The sum of any two sinusoidal functions with the same frequency 
is also a sinusoidal, so the whole left side adds up to a sinusoidal. 
By fiddling with A and 6 we can make the amplitudes and phases 
of the two sides of the equation match up. 


Steady state, no damping 


A and 6 are easy to find in the case where there is no damping 
at all. There are now no cosines in equation [2] above, only sines, so 
if we wish we can set 6 to zero, and we find A = Fi, /(k — mw?) = 
Fy, /m(w2 — w?). This, however, makes A negative for w > wo. The 
variable 6 was designed to represent this kind of phase relationship, 
so we prefer to keep A positive and set 6 = am for w > wo. Our 
results are then 


~ m|w? — w| 


and 


s={ 0, w<Wo 


T, W>Wo 


The most important feature of the result is that there is a reso- 
nance: the amplitude becomes greater and greater, and approaches 
infinity, as w approaches the resonant frequency wy. This is the phys- 
ical behavior we anticipated on page 170 in the example of pushing 
a child on a swing. If the driving frequency matches the frequency 
of the free vibrations, then the driving force will always be in the 
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right direction to add energy to the swing. At a driving frequency 
very different from the resonant frequency, we might get lucky and 
push at the right time during one cycle, but our next push would 
come at some random point in the next cycle, possibly having the 
effect of slowing the swing down rather than speeding it up. 


The interpretation of the infinite amplitude at w = wy is that 
there really isn’t any steady state if we drive the system exactly at 
resonance — the amplitude will just keep on increasing indefinitely. 
In real life, the amplitude can’t be infinite both because there is 
always some damping and because there will always be some differ- 
ence, however small, between w and wy. Even though the infinity is 
unphysical, it has entered into the popular consciousness, starting 
with the eccentric Serbian-American inventor and physicist Nikola 
Tesla. Around 1912, the tabloid newspaper The World Today credu- 
lously reported a story which Tesla probably fabricated — or wildly 
exaggerated — for the sake of publicity. Supposedly he created a 
steam-powered device “no larger than an alarm clock,” containing 
a piston that could be made to vibrate at a tunable and precisely 
controlled frequency. “He put his little vibrator in his coat-pocket 
and went out to hunt a half-erected steel building. Down in the 
Wall Street district, he found one — ten stories of steel framework 
without a brick or a stone laid around it. He clamped the vibrator 
to one of the beams, and fussed with the adjustment [presumably 
hunting for the building’s resonant frequency] until he got it. Tesla 
said finally the structure began to creak and weave and the steel- 
workers came to the ground panic-stricken, believing that there had 
been an earthquake. Police were called out. Tesla put the vibrator 
in his pocket and went away. Ten minutes more and he could have 
laid the building in the street. And, with the same vibrator he could 
have dropped the Brooklyn Bridge into the East River in less than 
an hour.” 


The phase angle 6 also exhibits surprising behavior. As the fre- 
quency is tuned upward past resonance, the phase abruptly shifts so 
that the phase of the response is opposite to that of the driving force. 
There is a simple interpretation for this. The system’s mechanical 
energy can only change due to work done by the driving force, since 
there is no damping to convert mechanical energy to heat. In the 
steady state, then, the power transmitted by the driving force over 
a full cycle of motion must average out to zero. In general, the work 
theorem dE = Fdzx can always be divided by dt on both sides to 
give the useful relation P = Fv. If Fv is to average out to zero, 
then F' and v must be out of phase by +77/2, and since v is ahead of 
x by a phase angle of 7/2, the phase angle between x and F’ must 
be zero or 7. 


Given that these are the two possible phases, why is there a 
difference in behavior between w < wo and w > wo? At the low- 
frequency limit, consider w = 0, i.e., a constant force. A constant 
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force will simply displace the oscillator to one side, reaching an 
equilibrium that is offset from the usual one. The force and the 
response are in phase, e.g., if the force is to the right, the equilibrium 
will be offset to the right. This is the situation depicted in the 
amplitude graph of figure h at w = 0. The response, which is not 
zero, is simply this static displacement of the oscillator to one side. 


At high frequencies, on the other hand, imagine shaking the 
poor child on the swing back and forth with a force that oscillates 
at 10 Hz. This is so fast that there is essentially no time for the 
force F' = —kax from gravity and the chain to act from one cycle 
to the next. The problem becomes equivalent to the oscillation of 
a free object. If the driving force varies like sin(wt), with 6 = 0, 
then the acceleration is also proportional to the sine. Integrating, 
we find that the velocity goes like minus a cosine, and a second 
integration gives a position that varies as minus the sine — opposite 
in phase to the driving force. Intuitively, this mathematical result 
corresponds to the fact that at the moment when the object has 
reached its maximum displacement to the right, that is the time 
when the greatest force is being applied to the left, in order to turn 
it around and bring it back toward the center. 


A practice mute for a violin example 43 
The amplitude of the driven vibrations, A = Fm/(m|w? —w?|), con- 
tains an inverse proportionality to the mass of the vibrating object. 
This is simply because a given force will produce less accelera- 
tion when applied to a more massive object. An application is 
shown in figure 43. 


In a stringed instrument, the strings themselves don’t have enough 
surface area to excite sound waves very efficiently. In instruments 
of the violin family, as the strings vibrate from left to right, they 
cause the bridge (the piece of wood they pass over) to wiggle 
clockwise and counterclockwise, and this motion is transmitted to 
the top panel of the instrument, which vibrates and creates sound 
waves in the air. 


A string player who wants to practice at night without bothering 
the neighbors can add some mass to the bridge. Adding mass to 
the bridge causes the amplitude of the vibrations to be smaller, 
and the sound to be much softer. A similar effect is seen when 
an electric guitar is used without an amp. The body of an electric 
guitar is so much more massive than the body of an acoustic 
guitar that the amplitude of its vibrations is very small. 
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Steady state, with damping 


The extension of the analysis to the damped case involves some 
lengthy algebra, which I’ve outlined on page 857 in appendix 2. The 
results are shown in figure j. It’s not surprising that the steady state 
response is weaker when there is more damping, since the steady 
state is reached when the power extracted by damping matches the 
power input by the driving force. The maximum amplitude, at the 
peak of the resonance curve, is approximately proportional to Q. 


self-check | 

From the final result of the analysis on page 857, substitute w = Wo, 
and satisfy yourself that the result is proportional to Q. Why is Ares «x Q 
only an approximation? > Answer, p. 868 


What is surprising is that the amplitude is strongly affected by 
damping close to resonance, but only weakly affected far from it. In 
other words, the shape of the resonance curve is broader with more 
damping, and even if we were to scale up a high-damping curve so 
that its maximum was the same as that of a low-damping curve, it 
would still have a different shape. The standard way of describing 
the shape numerically is to give the quantity Aw, called the full 
width at half-mazimum, or FWHM, which is defined in figure k. 
Note that the y axis is energy, which is proportional to the square 
of the amplitude. Our previous observations amount to a statement 
that Aw is greater when the damping is stronger, i.e., when the Q 
is lower. It’s not hard to show from the equations on page 857 that 
for large Q, the FWHM is given approximately by 


Aw © Wo/Q 


Another thing we notice in figure j is that for small values of Q 
the frequency Wes of the maximum A is less than wo.!! At even 


"The relationship is Wmaz A/Wo = \/1 — 1/2Q?, which is similar in form to 
the equation for the frequency of the free vibration, wr/wo = \/1—1/4Q?. A 
subtle point here is that although the maximum of A and the maximum of A? 
must occur at the same frequency, the maximum energy does not occur, as we 
might expect, at the same frequency as the maximum of A?. This is because 
the interaction energy is proportional to A? regardless of frequency, but the 
kinetic energy is proportional to A?w?. The maximum energy actually occurs 
are precisely wo. 


i/Example 43: a viola without 
a mute (left), and with a mute 
(right). The mute doesn’t touch 
the strings themselves. 


energy of 
steady- 
state 
vibrations 


k/ The definition of Aw, the 
full width at half maximum. 
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j/ Dependence of the amplitude 
and phase angle on the driving 
frequency. The undamped case 
is Q = co, and the other curves 
represent Q=1, 3, and 10. Fm, m, 
and wo, are all set to 1. 


lower values of Q, like Q = 1, the A —w curve doesn’t even have a 
maximum near w > 0. 


An opera singer breaking a wineglass example 44 
In order to break a wineglass by singing, an opera singer must 
first tap the glass to find its natural frequency of vibration, and 
then sing the same note back, so that her driving force will pro- 
duce a response with the greatest possible amplitude. If she’s 
shopping for the right glass to use for this display of her prowess, 
she should look for one that has the greatest possible Q, since 
the resonance curve has a higher maximum for higher values of 
Q. 
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‘Collapse of the Nimitz Freeway example 45 
Figure | shows a section of the Nimitz Freeway in Oakland, CA, 
that collapsed during an earthquake in 1989. An earthquake con- 
sists of many low-frequency vibrations that occur simultaneously, 
which is why it sounds like a rumble of indeterminate pitch rather 
than a low hum. The frequencies that we can hear are not even 
the strongest ones; most of the energy is in the form of vibrations 
in the range of frequencies from about 1 Hz to 10 Hz. 


All the structures we build are resting on geological layers of dirt, 
mud, sand, or rock. When an earthquake wave comes along, the 
topmost layer acts like a system with a certain natural frequency 
of vibration, sort of like a cube of jello on a plate being shaken 
from side to side. The resonant frequency of the layer depends 
on how stiff it is and also on how deep it is. The ill-fated section 
of the Nimitz freeway was built on a layer of mud, and analysis by 
geologist Susan E. Hough of the U.S. Geological Survey shows 
that the mud layer’s resonance was centered on about 2.5 Hz, 
and had a width covering a range from about 1 Hz to 4 Hz. 


When the earthquake wave came along with its mixture of fre- 
quencies, the mud responded strongly to those that were close to 
its own natural 2.5 Hz frequency. Unfortunately, an engineering 
analysis after the quake showed that the overpass itself had a res- 
onant frequency of 2.5 Hz as well! The mud responded strongly to 
the earthquake waves with frequencies close to 2.5 Hz, and the 
bridge responded strongly to the 2.5 Hz vibrations of the mud, 
causing sections of it to collapse. 


Physical reason for the relationship between Q and the FWHM 


What is the reason for this surprising relationship between the 
damping and the width of the resonance? Fundamentally, it has to 
do with the fact that friction causes a system to lose its “memory” 
of its previous state. If the Pioneer 10 space probe, coasting through 
the frictionless vacuum of interplanetary space, is detected by aliens 
a million years from now, they will be able to trace its trajectory 
backwards and infer that it came from our solar system. On the 
other hand, imagine that I shove a book along a tabletop, it comes 
to rest, and then someone else walks into the room. There will be 
no clue as to which direction the book was moving before it stopped 
— friction has erased its memory of its motion. Now consider the 
playground swing driven at twice its natural frequency, figure m, 
where the undamped case is repeated from figure b on page 170. In 
the undamped case, the first push starts the swing moving with mo- 
mentum p, but when the second push comes, if there is no friction 
at all, it now has a momentum of exactly —p, and the momentum 
transfer from the second push is exactly enough to stop it dead. 
With moderate damping, however, the momentum on the rebound 
is not quite —p, and the second push’s effect isn’t quite as disas- 


1/The collapsed 
the Nimitz Freeway 
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undamped 


moderately damped 


strongly damped 


m/An_ x-versus-t graph _ of 
the steady-state motion of a 
swing being pushed at twice 
its resonant frequency by an 
impulsive force. 


trous. With very strong damping, the swing comes essentially to 
rest long before the second push. It has lost all its memory, and the 
second push puts energy into the system rather than taking it out. 
Although the detailed mathematical results with this kind of impul- 
sive driving force are different,!? the general results are the same 
as for sinusoidal driving: the less damping there is, the greater the 
penalty you pay for driving the system off of resonance. 


High-Q speakers example 46 
Most good audio speakers have Q = 1, but the resonance curve 
for a higher-Q oscillator always lies above the corresponding curve 
for one with a lower Q, so people who want their car stereos to 
be able to rattle the windows of the neighboring cars will often 
choose speakers that have a high Q. Of course they could just 
use speakers with stronger driving magnets to increase F,,, but 
the speakers might be more expensive, and a high-Q speaker 
also has less friction, so it wastes less energy as heat. 


One problem with this is that whereas the resonance curve of a 
low-Q speaker (its “response curve” or “frequency response” in 
audiophile lingo) is fairly flat, a higher-Q speaker tends to em- 
phasize the frequencies that are close to its natural resonance. 
In audio, a flat response curve gives more realistic reproduction 
of sound, so a higher quality factor, Q, really corresponds to a 
lower-quality speaker. 


Another problem with high-Q speakers is discussed in example 
49 on page 184. 


Changing the pitch of a wind instrument example 47 
> A saxophone player normally selects which note to play by 
choosing a certain fingering, which gives the saxophone a cer- 
tain resonant frequency. The musician can also, however, change 
the pitch significantly by altering the tightness of her lips. This 
corresponds to driving the horn slightly off of resonance. If the 
pitch can be altered by about 5% up or down (about one musi- 
cal half-step) without too much effort, roughly what is the Q of a 
saxophone? 


> Five percent is the width on one side of the resonance, so the 
full width is about 10%, Af/f ~ 0.1. The equation Aw = w,/Qis 
defined in terms of angular frequency, w = 27 f, and we’ve been 
given our data in terms of ordinary frequency, f. The factors of 27 


!2For example, the graphs calculated for sinusoidal driving have resonances 
that are somewhat below the natural frequency, getting lower with increasing 
damping, until for Q < 1 the maximum response occurs at w = 0. In figure m, 
however, we can see that impulsive driving at w = 2w. produces a steady state 
with more energy than at w = wo. 
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end up canceling out, however: 


Wo 
~ Aw 
2nfo 
~ OnAf 
fo 
a 3 
10 


In other words, once the musician stops blowing, the horn will 
continue sounding for about 10 cycles before its energy falls off 
by a factor of 535. (Blues and jazz saxophone players will typi- 
cally choose a mouthpiece that gives a low Q, so that they can 
produce the bluesy pitch-slides typical of their style. “Legit,” i.e., 
classically oriented players, use a higher-Q setup because their 
style only calls for enough pitch variation to produce a vibrato, 
and the higher Q makes it easier to play in tune.) 


Q of a radio receiver example 48 
> A radio receiver used in the FM band needs to be tuned in to 
within about 0.1 MHz for signals at about 100 MHz. What is its 
Q? 


> As in the last example, we’re given data in terms of fs, not ws, 
but the factors of 27 cancel. The resulting Q is about 1000, which 
is extremely high compared to the Q values of most mechanical 
systems. 


Transients 


What about the motion before the steady state is achieved? 
When we computed the undriven motion numerically on page 171, 
the program had to initialize the position and velocity. By changing 
these two variables, we could have gotten any of an infinite number 
of simulations.!? The same is true when we have an equation of mo- 
tion with a driving term, ma+bu+ka = F,, sinwt (p. 175, equation 
[1]). The steady-state solutions, however, have no adjustable param- 
eters at all — A and 6 are uniquely determined by the parameters 
of the driving force and the oscillator itself. If the oscillator isn’t 
initially in the steady state, then it will not have the steady-state 
motion at first. What kind of motion will it have? 


The answer comes from realizing that if we start with the so- 
lution to the driven equation of motion, and then add to it any 
solution to the free equation of motion, the result, 


x = Asin(wt + 6) + A’e~“ sin(wyt + 0’) ; 


13Tf you’ve learned about differential equations, you'll know that any second- 
order differential equation requires the specification of two boundary conditions 
in order to specify solution uniquely. 
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is also a solution of the driven equation. Here, as before, wy is the 
frequency of the free oscillations (we & wo for small Q), w is the 
frequency of the driving force, A and 6 are related as usual to the 
parameters of the driving force, and A’ and 6’ can have any values 
at all. Given the initial position and velocity, we can always choose 
A’ and 6’ to reproduce them, but this is not something one often 
has to do in real life. What’s more important is to realize that the 
second term dies out exponentially over time, decaying at the same 
rate at which a free vibration would. For this reason, the A’ term is 
called a transient. A high-Q oscillator’s transients take a long time 
to die out, while a low-Q oscillator always settles down to its steady 
state very quickly. 


Boomy bass example 49 
In example 46 on page 182, I’ve already discussed one of the 
drawbacks of a high-Q speaker, which is an uneven response 
curve. Another problem is that in a high-Q speaker, transients 
take a long time to die out. The bleeding-eardrums crowd tend 
to focus mostly on making their bass loud, so it’s usually their 
woofers that have high Qs. The result is that bass notes, “ring” 
after the onset of the note, a phenomenon referred to as “boomy 
bass.” 


Overdamped motion 


The treatment of free, damped motion on page 173 skipped 
over a subtle point: in the equation wp = /k/m-—b?/4m? = 
Wo\/1—1/4Q?, Q < 1/2 results in an answer that is the square 
root of a negative number. For example, suppose we had k = 0, 
which corresponds to a neutral equilibrium. A physical example 
would be a mass sitting in a tub of syrup. If we set it in motion, 
it won’t oscillate — it will simply slow to a stop. This system has 
@ = 0. The equation of motion in this case is ma+ bv = 0, or, more 
suggestively, 

dv 


dt 


One can easily verify that this has the solution v = (constant)e 
and integrating, we find x = (constant)e~"'/™ + (constant). In other 
words, the reason wf comes out to be mathematical nonsense!“ is 
that we were incorrect in assuming a solution that oscillated at a 
frequency wf. The actual motion is not oscillatory at all. 


m— + bv =0 


—bt/m 
Ps 


In general, systems with Q < 1/2, called overdamped systems, 
do not display oscillatory motion. Most cars’ shock absorbers are 
designed with Q = 1/2, since it’s undesirable for the car to undulate 
up and down for a while after you go over a bump. (Shocks with ex- 
tremely low values of Q are not good either, because such a system 
takes a very long time to come back to equilibrium.) It’s not par- 


™ Actually, if you know about complex numbers and Euler’s theorem, it’s not 
quite so nonsensical. 
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ticularly important for our purposes, but for completeness I’ll note, 
as you can easily verify, that the general solution to the equation of 
motion for 0 < Q < 1/2 is of the form x = Ae~* + Be~“, while 
Q = 1/2, called the critically damped case, gives x = (A+ Bt)e~. 
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a/The car can change its x 
and y motions by one square 
every turn. 


3.4 Motion In Three Dimensions 


3.4.1 The Cartesian perspective 


When my friends and I were bored in high school, we used to play 
a paper-and-pencil game which, although we never knew it, was Very 
Educational — in fact, it pretty much embodies the entire world- 
view of classical physics. To play the game, you draw a racetrack on 
graph paper, and try to get your car around the track before anyone 
else. The default is for your car to continue at constant speed in a 
straight line, so if it moved three squares to the right and one square 
up on your last turn, it will do the same this turn. You can also 
control the car’s motion by changing its Ax and Ay by up to one 
unit. If it moved three squares to the right last turn, you can have 
it move anywhere from two to four squares to the right this turn. 


b/ French mathematician René Descartes invented analytic geometry; 
Cartesian (xyz) coordinates are named after him. He did work in phi- 
losophy, and was particularly interested in the mind-body problem. He 
was a skeptic and an antiaristotelian, and, probably for fear of religious 
persecution, spent his adult life in the Netherlands, where he fathered 
a daughter with a Protestant peasant whom he could not marry. He kept 
his daughter’s existence secret from his enemies in France to avoid giving 
them ammunition, but he was crushed when she died of scarlatina at age 
5. A pious Catholic, he was widely expected to be sainted. His body was 
buried in Sweden but then reburied several times in France, and along 
the way everything but a few fingerbones was stolen by peasants who 
expected the body parts to become holy relics. 


The fundamental way of dealing with the direction of an ob- 
ject’s motion in physics is to use conservation of momentum, since 
momentum depends on direction. Up until now, we’ve only done 
momentum in one dimension. How does this relate to the racetrack 
game? In the game, the motion of a car from one turn to the next 
is represented by its Ax and Ay. In one dimension, we would only 
need Az, which could be related to the velocity, Ax/At, and the 
momentum, mAz/At. In two dimensions, the rules of the game 
amount to a statement that if there is no momentum transfer, then 
both mAa/At and mAy/At stay the same. In other words, there 
are two flavors of momentum, and they are separately conserved. 
All of this so far has been done with an artificial division of time 
into “turns,” but we can fix that by redefining everything in terms 
of derivatives, and for motion in three dimensions rather than two, 
we augment x and y with z: 


v= deat vy = dy/dt Uz = dz/dt 
and 


Px = MVz Py = MvVy Pz = Mvz 
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c/ Bullets are dropped and shot at the same time. 


We call these the zx, y, and z components of the velocity and the 
momentum. 


There is both experimental and theoretical evidence that the 
x, y, and z momentum components are separately conserved, and 
that a momentum transfer (force) along one axis has no effect on 
the momentum components along the other two axes. On page 
89, for example, I argued that it was impossible for an air hockey 
puck to make a 180-degree turn spontaneously, because then in the 
frame moving along with the puck, it would have begun moving 
after starting from rest. Now that we’re working in two dimensions, 
we might wonder whether the puck could spontaneously make a 90- 
degree turn, but exactly the same line of reasoning shows that this 
would be impossible as well, which proves that the puck can’t trade 
x-momentum for y-momentum. A more general proof of separate 
conservation will be given on page 212, after some of the appropriate 
mathematical techniques have been introduced. 


As an example of the experimental evidence for separate con- 
servation of the momentum components, figure c shows correct and 
incorrect predictions of what happens if you shoot a rifle and arrange 
for a second bullet to be dropped from the same height at exactly 
the same moment when the first one left the barrel. Nearly every- 
one expects that the dropped bullet will reach the dirt first, and 
Aristotle would have agreed, since he believed that the bullet had 
to lose its horizontal motion before it could start moving vertically. 
In reality, we find that the vertical momentum transfer between the 
earth and the bullet is completely unrelated to the horizontal mo- 
mentum. The bullet ends up with p, < 0, while the planet picks 
up an upward momentum p, > 0, and the total momentum in the 
y direction remains zero. Both bullets hit the ground at the same 
time. This is much simpler than the Aristotelian version! 


The Pelton waterwheel example 50 
> There is a general class of machines that either do work on a 
gas or liquid, like a boat’s propeller, or have work done on them 
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d/Two- surfaces’ that could 
be used to extract energy from a 
stream of water. 


ihe. 


= 


> 


sunlight 


e/An_ asteroid absorbs  visi- 
ble light from the sun, and gets 
rid of the energy by radiating 
infrared light. 


by a gas or liquid, like the turbine in a hydroelectric power plant. 
Figure d shows two types of surfaces that could be attached to 
the circumference of an old-fashioned waterwheel. Compare the 
force exerted by the water in the two cases. 


> Let the x axis point to the right, and the y axis up. In both 
cases, the stream of water rushes down onto the surface with 
momentum py; = —Po, where the subscript / stands for “initial,” 
i.e., before the collision. 


In the case of surface 1, the streams of water leaving the surface 
have no momentum in the y direction, and their momenta in the x 
direction cancel. The final momentum of the water is zero along 
both axes, so its entire momentum, —/,, has been transferred to 
the waterwheel. 


When the water leaves surface 2, however, its momentum isn’t 
zero. If we assume there is no friction, it’s py + = +p, with the 
positive sign indicating upward momentum. The change in the 
water’s momentum is py ¢ — Py, = 2, and the momentum trans- 
ferred to the waterwheel is —2po. 


Force is defined as the rate of transfer of momentum, so surface 
2 experiences double the force. A waterwheel constructed in this 
way is known as a Pelton waterwheel. 


The Yarkovsky effect example 51 
We think of the planets and asteroids as inhabiting their orbits 
permanently, but it is possible for an orbit to change over periods 
of millions or billions of years, due to a variety of effects. For 
asteroids with diameters of a few meters or less, an important 
mechanism is the Yarkovsky effect, which is easiest to understand 
if we consider an asteroid spinning about an axis that is exactly 
perpendicular to its orbital plane. 


The illuminated side of the asteroid is relatively hot, and radiates 
more infrared light than the dark (night) side. Light has momen- 
tum, and a total force away from the sun is produced by combined 
effect of the sunlight hitting the asteroid and the imbalance be- 
tween the momentum radiated away on the two sides. This force, 
however, doesn’t cause the asteroid’s orbit to change over time, 
since it simply cancels a tiny fraction of the sun’s gravitational 
attraction. The result is merely a tiny, undetectable violation of 
Kepler’s law of periods. 


Consider the sideways momentum transfers, however. In figure e, 
the part of the asteroid on the right has been illuminated for half a 
spin-period (half a “day”) by the sun, and is hot. It radiates more 
light than the morning side on the left. This imbalance produces 
a total force in the x direction which points to the left. If the aster- 
oid’s orbital motion is to the left, then this is a force in the same 
direction as the motion, which will do positive work, increasing 
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the asteroid’s energy and boosting it into an orbit with a greater 
radius. On the other hand, if the asteroid’s spin and orbital motion 
are in opposite directions, the Yarkovsky push brings the asteroid 
spiraling in closer to the sun. 


Calculations show that it takes on the order of 10” to 108 years for 
the Yarkovsky effect to move an asteroid out of the asteroid belt 
and into the vicinity of earth’s orbit, and this is about the same as 
the typical age of a meteorite as estimated by its exposure to cos- 
mic rays. The Yarkovsky effect doesn’t remove all the asteroids 
from the asteroid belt, because many of them have orbits that 
are stabilized by gravitational interactions with Jupiter. However, 
when collisions occur, the fragments can end up in orbits which 
are not stabilized in this way, and they may then end up reaching 
the earth due to the Yarkovsky effect. The cosmic-ray technique 
is really telling us how long it has been since the fragment was 
broken out of its parent. 


Discussion Questions 


A The following is an incorrect explanation of a fact about target shoot- 
ing: 

“Shooting a high-powered rifle with a high muzzle velocity is different from 
shooting a less powerful gun. With a less powerful gun, you have to aim 
quite a bit above your target, but with a more powerful one you don’t have 
to aim so high because the bullet doesn’t drop as fast.” 


What is the correct explanation? 


a 


f / Discussion question A. 


B You have thrown a rock, and it is flying through the air in an arc. If 
the earth’s gravitational force on it is always straight down, why doesn’t it 
just go straight down once it leaves your hand? 


Cc Consider the example of the bullet that is dropped at the same 
moment another bullet is fired from a gun. What would the motion of the 
two bullets look like to a jet pilot flying alongside in the same direction as 
the shot bullet and at the same horizontal speed? 


3.4.2 Rotational invariance 


The Cartesian approach requires that we choose x, y, and z 
axes. How do we choose them correctly? The answer is that it had 
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better not matter which directions the axes point (provided they’re 
perpendicular to each other), or where we put the origin, because if 
it did matter, it would mean that space was asymmetric. If there 
was a certain point in the universe that was the right place to put 
the origin, where would it be? The top of Mount Olympus? The 
United Nations headquarters? We find that experiments come out 
the same no matter where we do them, and regardless of which way 
the laboratory is oriented, which indicates that no location in space 
or direction in space is special in any way.!° 


This is closely related to the idea of Galilean relativity stated on 
page 62, from which we already know that the absolute motion of 
a frame of reference is irrelevant and undetectable. Observers using 
frames of reference that are in motion relative to each other will not 
even agree on the permanent identity of a particular point in space, 
so it’s not possible for the laws of physics to depend on where you are 
in space. For instance, if gravitational energies were proportional 
to mymz in one location but to (mymz)!! in another, then it 
would be possible to determine when you were in a state of absolute 
motion, because the behavior of gravitational interactions would 
change as you moved from one region to the other. 


Because of this close relationship, we restate the principle of 
Galilean relativity in a more general form. This extended principle 
of Galilean relativity states that the laws of physics are no different 
in one time and place than in another, and that they also don’t de- 
pend on your orientation or your motion, provided that your motion 
is in a straight line and at constant speed. 


The irrelevance of time and place could have been stated in chap- 
ter 1, but since this section is the first one in which we’re dealing 
with three-dimensional physics in full generality, the irrelevance of 
orientation is what we really care about right now. This property of 
the laws of physics is called rotational invariance. The word “invari- 
ance” means a lack of change, i.e., the laws of physics don’t change 
when we reorient our frame of reference. 


Rotational invariance of gravitational interactions example 52 
Gravitational energies depend on the quantity 1/r, which by the 
Pythagorean theorem equals 


, 
VJ Ax? + Ay? + Az? 


Rotating a line segment doesn’t change its length, so this expres- 
sion comes out the same regardless of which way we orient our 


Of course, you could tell in a sealed laboratory which way was down, but 
that’s because there happens to be a big planet nearby, and the planet’s grav- 
itational field reaches into the lab, not because space itself has a special down 
direction. Similarly, if your experiment was sensitive to magnetic fields, it might 
matter which way the building was oriented, but that’s because the earth makes 
a magnetic field, not because space itself comes equipped with a north direction. 
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coordinate axes. Even though Ax, Ay, and Az are different in 
differently oriented coordinate systems, r is the same. 


Kinetic energy example 53 
Kinetic energy equals (1/2) mv, but what does that mean in three 
dimensions, where we have vx, vy, and vz? If you were tempted 
to add the components and calculate K = (1/2)m(Vvy + vy + vz)?, 
figure g should convince you otherwise. Using that method, we'd 
have to assign a kinetic energy of zero to ball number 1, since 
its negative vy would exactly cancel its positive vx, whereas ball 
number 2’s kinetic energy wouldn’t be zero. This would violate g / Two balls roll down a cone and 
rotational invariance, since the balls would behave differently. onto a plane. 


The only possible way to generalize kinetic energy to three di- 
mensions, without violating rotational invariance, is to use an ex- 
pression that resembles the Pythagorean theorem, 


which results in 


Since the velocity components are squared, the positive and neg- 
ative signs don’t matter, and the two balls in the example behave 
the same way. 
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3.4.3 Vectors 


Remember the title of this book? It would have been possible to 
obtain the result of example 53 by applying the Pythagorean theo- 
rem to dz, dy, and dz, and then dividing by dt, but the rotational 
invariance approach is simpler, and is useful in a much broader 
context. Even with a quantity you presently know nothing about, 
say the magnetic field, you can infer that if the components of the 
magnetic field are B,, By, and B,, then the physically useful way 
to talk about the strength of the magnetic field is to define it as 


Be+ Be + B?. Nature knows your brain cells are precious, and 


doesn’t want you to have to waste them by memorizing mathemat- 
ical rules that are different for magnetic fields than for velocities. 


When mathematicians see that the same set of techniques is 
useful in many different contexts, that’s when they start making 
definitions that allow them to stop reinventing the wheel. The an- 
cient Greeks, for example, had no general concept of fractions. They 
couldn’t say that a circle’s radius divided by its diameter was equal 
to the number 1/2. They had to say that the radius and the diameter 
were in the ratio of one to two. With this limited number concept, 
they couldn’t have said that water was dripping out of a tank at 
a rate of 3/4 of a barrel per day; instead, they would have had to 
say that over four days, three barrels worth of water would be lost. 
Once enough of these situations came up, some clever mathemati- 
cian finally realized that it would make sense to define something 
called a fraction, and that one could think of these fraction thingies 
as numbers that lay in the gaps between the traditionally recognized 
numbers like zero, one, and two. Later generations of mathemati- 
cians introduced further subversive generalizations of the number 
concepts, inventing mathematical creatures like negative numbers, 
and the square root of two, which can’t be expressed as a fraction. 


In this spirit, we define a vector as any quantity that has both 
an amount and a direction in space. In contradistinction, a scalar 
has an amount, but no direction. Time and temperature are scalars. 
Velocity, acceleration, momentum, and force are vectors. In one di- 
mension, there are only two possible directions, and we can use pos- 
itive and negative numbers to indicate the two directions. In more 
than one dimension, there are infinitely many possible directions, so 
we can’t use the two symbols + and — to indicate the direction of 
a vector. Instead, we can specify the three components of the vec- 
tor, each of which can be either negative or positive. We represent 
vector quantities in handwriting by writing an arrow above them, 
so for example the momentum vector looks like this, p, but the ar- 
row looks ugly in print, so in books vectors are usually shown in 
bold-face type: p. A straightforward way of thinking about vectors 
is that a vector equation really represents three different equations. 
For instance, conservation of momentum could be written in terms 
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of the three components, 


Apy = 0 
Ap, = 0 
Ap, =0 ) 


or as a single vector equation,!® 


Ap = 0 


The following table summarizes some vector operations. 


operation definition 


|vector| vector? + vector? + vector? 


vector + vector | Add component by component. 
vector — vector | Subtract component by component. 
vector - scalar | Multiply each component by the scalar. 
vector / scalar | Divide each component by the scalar. 


The first of these is called the magnitude of the vector; in one dimen- 
sion, where a vector only has one component, it amounts to taking 
the absolute value, hence the similar notation. 


self-check J 

Translate the equations Fy=ma,, Fy=may, and Fz=maz into a single 
equation in vector notation. > Answer, p. 868 
An explosion example 54 


> Astronomers observe the planet Mars as the Martians fight a 
nuclear war. The Martian bombs are so powerful that they rip the 
planet into three separate pieces of liquefied rock, all having the 
same mass. If one fragment flies off with velocity components 
Vix = 0, V4y=1.0x10* km/hr, and the second with Voy=1.0x104 
km/hr, V2y = 0, what is the magnitude of the third one’s velocity? 


> We work the problem in the center of mass frame, in which 
the planet initially had zero momentum. After the explosion, the 
vector sum of the momenta must still be zero. Vector addition can 
be done by adding components, so 


MV1x + MVoy + MV3y = 0 
and 
MVyy + MV2y + MVzy = 0 ; 


where we have used the same symbol m for all the terms, be- 
cause the fragments all have the same mass. The masses can 


'6The zero here is really a zero vector, i.e., a vector whose components are 
all zero, so we should really represent it with a boldface 0. There’s usually not 
much danger of confusion, however, so most books, including this one, don’t use 
boldface for the zero vector. 


h/ Example 54. 
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y component 
(negative) 
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j/The geometric  interpreta- 
tion of a vectors components. 


be eliminated by dividing each equation by m, and we find 


V3x = —1.0x10* km/hr 
V3y =—1.0x10* km/hr, 


which gives a magnitude of 


V3] = 4/ V3 + V3y 


= 1.4x10* km/hr 


A toppling box example 55 
If you place a box on a frictionless surface, it will fall over with a 
very complicated motion that is hard to predict in detail. We know, 
however, that its center of mass’s motion is related to its momen- 
tum, and the rate at which momentum is transferred is the force. 
Moreover, we know that these relationships apply separately to 
each component. Let x and y be horizontal, and z vertical. There 
are two forces on the box, an upward force from the table and a 
downward gravitational force. Since both of these are along the 
Z axis, Pz is the only component of the box’s momentum that can 
change. We conclude that the center of mass travels vertically. 
This is true even if the box bounces and tumbles. [Based on an 
example by Kleppner and Kolenkow.] 


Geometric representation of vectors 


A vector in two dimensions can be easily visualized by drawing 
an arrow whose length represents its magnitude and whose direction 
represents its direction. The « component of a vector can then be 
visualized, j, as the length of the shadow it would cast in a beam of 
light projected onto the x axis, and similarly for the y component. 
Shadows with arrowheads pointing back against the direction of the 
positive axis correspond to negative components. 


In this type of diagram, the negative of a vector is the vector 
with the same magnitude but in the opposite direction. Multiplying 
a vector by a scalar is represented by lengthening the arrow by that 
factor, and similarly for division. 

self-check K 


Given vector Q represented by an arrow below, draw arrows represent- 
ing the vectors 1.5Q and —Q. 


ie ce: 


> Answer, p. 868 
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A useless vector operation example 56 
The way I’ve defined the various vector operations above aren’t 
as arbitrary as they seem. There are many different vector oper- 
ations that we could define, but only some of the possible defini- 
tions are mathematically useful. Consider the operation of mul- 
tiplying two vectors component by component to produce a third 
vector: 


Ry = PyQy 
Ry = PyQy 
Rz = PzQ; 


As a simple example, we choose vectors P and Q to have length 
1, and make them perpendicular to each other, as shown in figure 
k/1. If we compute the result of our new vector operation using 
the coordinate system shown in k/2, we find: 


R, =0 
Ry =0 
R,=0 


The x component is zero because P, = 0, the y component is 
zero because Qy = 0, and the z component is of course zero be- 
cause both vectors are in the x-y plane. However, if we carry out 
the same operations in coordinate system k/3, rotated 45 degrees 
with respect to the previous one, we find 


Ry, = —1/2 
R,=0 


The operation’s result depends on what coordinate system we 
use, and since the two versions of R have different lengths (one 
being zero and the other nonzero), they don’t just represent the 
same answer expressed in two different coordinate systems. Such 
an operation will never be useful in physics, because experiments 
show physics works the same regardless of which way we orient 
the laboratory building! The useful vector operations, such as 
addition and scalar multiplication, are rotationally invariant, i.e., 
come out the same regardless of the orientation of the coordi- 
nate system. 


All the vector techniques can be applied to any kind of vector, 
but the graphical representation of vectors as arrows is particularly 
natural for vectors that represent lengths and distances. We define a 
vector called r whose components are the coordinates of a particular 
point in space, x, y, and z. The Ar vector, whose components are 
Ax, Ay, and Az, can then be used to represent motion that starts at 


Q 
_ 
(1) 
y 
(2) 
X 
y Xx 
(3) 


k/ Two. vectors, 


1, 


to which 


we apply the same operation in 
two different frames of reference, 


2 and 3. 
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one point and ends at another. Adding two Ar vectors is interpreted 
as a trip with two legs: by computing the Ar vector going from point 
A to point B plus the vector from B to C, we find the vector that 
would have taken us directly from A to C. 


Calculations with magnitude and direction 


If you ask someone where Las Vegas is compared to Los Angeles, 
she is unlikely to say that the Az is 290 km and the Ay is 230 km, 
in a coordinate system where the positive x axis is east and the y 
axis points north. She will probably say instead that it’s 370 km 
to the northeast. If she was being precise, she might specify the 
direction as 38° counterclockwise from east. In two dimensions, we 
can always specify a vector’s direction like this, using a single angle. 
A magnitude plus an angle suffice to specify everything about the 
vector. The following two examples show how we use trigonometry 
and the Pythagorean theorem to go back and forth between the x-y 
and magnitude-angle descriptions of vectors. 


Finding magnitude and angle from components example 57 
> Given that the Ar vector from LA to Las Vegas has Ax=290 km 
and Ay=230 km, how would we find the magnitude and direction 
of Ar? 


> We find the magnitude of Ar from the Pythagorean theorem: 


|Ar| = ,/Ax2 + Ay? 


= 370 km 


We know all three sides of the triangle, so the angle 6 can be 
found using any of the inverse trig functions. For example, we 
know the opposite and adjacent sides, so 


Ay 
= 

_ ocd 
8 = tan : 


= 38° 


Finding the components from the magnitude and angle example 
58 

> Given that the straight-line distance from Los Angeles to Las 
Vegas is 370 km, and that the angle 6 in the figure is 38°, how 
can the x and y components of the Ar vector be found? 


> The sine and cosine of 9 relate the given information to the 
information we wish to find: 


AX 

cos 8 = —— 
|Ar| 

é Ay 
sin 8 = —— 
|Ar| 
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Solving for the unknowns gives 


Ax = |Ar| cos 0 
= 290 km 

Ay = |Ar| sin 6 
= 230 km 


The following example shows the correct handling of the plus 
and minus signs, which is usually the main cause of mistakes by 
students. 


Negative components example 59 
> San Diego is 120 km east and 150 km south of Los Angeles. An 
airplane pilot is setting course from San Diego to Los Angeles. At 
what angle should she set her course, measured counterclock- 
wise from east, as shown in the figure? 


> If we make the traditional choice of coordinate axes, with x 
pointing to the right and y pointing up on the map, then her Ax is 
negative, because her final x value is less than her initial x value. 
Her Ay is positive, so we have 

Ax = —120 km 

Ay = 150 km 


If we work by analogy with the example 57, we get 


_, Ay 
Tar 
8 =tan 7 
= tan! (—1.25) 
= —51° 


According to the usual way of defining angles in trigonometry, 
a negative result means an angle that lies clockwise from the x 
axis, which would have her heading for the Baja California. What 
went wrong? The answer is that when you ask your calculator to 
take the arctangent of a number, there are always two valid pos- 
sibilities differing by 180°. That is, there are two possible angles 
whose tangents equal -1.25: 


tan 129° = —1.25 
tan (—51°) = —1.25 
You calculator doesn’t Know which is the correct one, so it just 
picks one. In this case, the one it picked was the wrong one, and 
it was up to you to add 180° to it to find the right answer. 
Addition of vectors given their components 


The easiest type of vector addition is when you are in possession 
of the components, and want to find the components of their sum. 


Los 


Ax 


m/ Example 59. 
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Las V — - ——— - = 
ae ‘San Diego to Las Vegas example 60 
> Given the Ax and Ay values from the previous examples, find 
the Ax and Ay from San Diego to Las Vegas. 


> 
AXtotal = AX, + AXo 
= —120 km+ 290 km 
Los = 170 km 
Angeles SEs AYtotal = AY1 + Ay2 
= 150 km + 230 km 
n/ Example 60. = 380 


Addition of vectors given their magnitudes and directions 


In this case, you must first translate the magnitudes and direc- 
tions into components, and the add the components. 


Graphical addition of vectors 


Often the easiest way to add vectors is by making a scale drawing 
on a piece of paper. This is known as graphical addition, as opposed 
to the analytic techniques discussed previously. 


o / Example 61. a Las Vegas a 


Los 
Angeles 
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From San Diego to Las Vegas, graphically example 61 
> Given the magnitudes and angles of the Ar vectors from San 
Diego to Los Angeles and from Los Angeles to Las Vegas, find 
the magnitude and angle of the Ar vector from San Diego to Las 
Vegas. 


> Using a protractor and a ruler, we make a careful scale drawing, 
as shown in figure 0 on page 198. A scale of 1 cm<10 km was 
chosen for this solution. With a ruler, we measure the distance 
from San Diego to Las Vegas to be 3.8 cm, which corresponds to 
380 km. With a protractor, we measure the angle 0 to be 71°. 


Even when we don’t intend to do an actual graphical calculation 
with a ruler and protractor, it can be convenient to diagram the 
addition of vectors in this way, as shown in figure p. With Ar 
vectors, it intuitively makes sense to lay the vectors tip-to-tail and 
draw the sum vector from the tail of the first vector to the tip of 
the second vector. We can do the same when adding other vectors 
such as force vectors. 


Unit vector notation 


When we want to specify a vector by its components, it can be 
cumbersome to have to write the algebra symbol for each compo- 
nent: 

Ax = 290 km, Ay = 230 km 


A more compact notation is to write 
Ar = (290 km)x + (230 km)y, 


where the vectors X, y, and z, called the unit vectors, are defined 
as the vectors that have magnitude equal to 1 and directions lying 
along the x, y, and z axes. In speech, they are referred to as “x-hat,” 
“y-hat,” and “z-hat.” 


A slightly different, and harder to remember, version of this 
notation is unfortunately more prevalent. In this version, the unit 
vectors are called i, j, and k: 


Ar = (290 km)i+ (230 km)j 


Applications to relative motion, momentum, and force 


Vector addition is the correct way to generalize the one-dimensional 


concept of adding velocities in relative motion, as shown in the fol- 
lowing example: 


Velocity vectors in relative motion example 62 
> You wish to cross a river and arrive at a dock that is directly 
across from you, but the river's current will tend to carry you 
downstream. To compensate, you must steer the boat at an an- 
gle. Find the angle 0, given the magnitude, |vy |, of the water’s 


p / Adding 


vectors 


A+B 


Ve 


graphically 


by placing them tip-to-tail, like a 


train. 
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velocity relative to the land, and the maximum speed, |Vgy, of 
which the boat is capable relative to the water. 


> The boat’s velocity relative to the land equals the vector sum of 
its velocity with respect to the water and the water’s velocity with 
respect to the land, 

Ve. = Vaw + Vw 


If the boat is to travel straight across the river, i.e., along the y 
axis, then we need to have vg,_, = 0. This x component equals 
the sum of the x components of the other two vectors, 


VeL, xX =VBW,x+VwL, x , 


or 
0= —|Vew| sin@ + \Vw_ 


Solving for 0, we find 


sin 8 = |Vwz|/|Vaw| 


so 
__4|V 
8 =sin 1 IVwel 
VBw 
‘How to generalize one-dimensional equations example 63 


> How can the one-dimensional relationships 
Ptotal = total Vem 
and 


ae dj MX 
cm — 
yy 


be generalized to three dimensions? 


> Momentum and velocity are vectors, since they have directions 
in space. Mass is a scalar. If we rewrite the first equation to show 
the appropriate quantities notated as vectors, 


Protal = MtotalVcm , 


we get a valid mathematical operation, the multiplication of a vec- 
tor by a scalar. Similarly, the second equation becomes 


which is also valid. Each term in the sum on top contains a vector 
multiplied by a scalar, which gives a vector. Adding up all these 
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vectors gives a vector, and dividing by the scalar sum on the bot- 
tom gives another vector. 


This kind of wave-the-magic-wand-and-write-it-all-in-bold-face tech- 
nique will always give the right generalization from one dimension 
to three, provided that the result makes sense mathematically — 
if you find yourself doing something nonsensical, such as adding 

a scalar to a vector, then you haven't found the generalization 
correctly. 


Colliding coins example 64 
> Take two identical coins, put one down on a piece of paper, and 
slide the other across the paper, shooting it fairly rapidly so that 
it hits the target coin off-center. If you trace the initial and final 
positions of the coins, you can determine the directions of their 
momentum vectors after the collision. The angle between these 
vectors is always fairly close to, but a little less than, 90 degrees. 
Why is this? 


> Let the velocity vector of the incoming coin be a, and let the two 

outgoing velocity vectors be b and c. Since the masses are the 

same, conservation of momentum amounts to a = b +, which 

means that it has to be possible to assemble the three vectors 

into a triangle. If we assume that no energy is converted into 

heat and sound, then conservation of energy gives (discarding F 
the common factor of m/2) a* = b* + c? for the magnitudes of the (1) 
three vectors. This is the Pythagorean theorem, which will hold 

only if the three vectors form a right triangle. 


The fact that we observe the angle to be somewhat less than 
90 degrees shows that the assumption used in the proof is only 
approximately valid: a little energy is converted into heat and 
sound. The opposite case would be a collision between two blobs 
of putty, where the maximum possible amount of energy is con- 
verted into heat and sound, the two blobs fly off together, giving 
an angle of zero between their momentum vectors. The real-life 
experiment interpolates between the ideal extremes of 0 and 90 
degrees, but comes much closer to 90. 


Force is a vector, and we add force vectors when more than one 
force acts on the same object. 


Pushing a block up a ramp example 65 (3) 
> Figure r/1 shows a block being pushed up a frictionless ramp F 
at constant speed by an applied force Fz. How much force is Fh 
required, in terms of the block’s mass, m, and the angle of the 

ramp, 0? 


et neh r/ Example 65. 
> We analyzed this simple machine in example 36 on page 167 


using the concept of work. Here we'll do it using vector addition 
of forces. Figure r/2 shows the other two forces acting on the 
block: a normal force, Fy, created by the ramp, and the gravita- 
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s/ Archimedes’ principle works 
regardless of whether the object 
is a cube. The fluid makes a force 
on every square millimeter of the 
object’s surface. 


tional force, Fg. Because the block is being pushed up at constant 
speed, it has zero acceleration, and the total force on it must be 
zero. In figure r/3, we position all the force vectors tip-to-tail for 
addition. Since they have to add up to zero, they must join up 
without leaving a gap, so they form a triangle. Using trigonometry 
we find 


= mg sin 0 


Buoyancy, again example 66 
In example 10 on page 85, we found that the energy required to 
raise a cube immersed in a fluid is as if the cube’s mass had been 
reduced by an amount equal to the mass of the fluid that other- 
wise would have been in the volume it occupies (Archimedes’ 
principle). From the energy perspective, this effect occurs be- 
cause raising the cube allows a certain amount of fluid to move 
downward, and the decreased gravitational energy of the fluid 
tends to offset the increased gravitational energy of the cube. The 
proof given there, however, could not easily be extended to other 
shapes. 


Thinking in terms of force rather than energy, it becomes easier 
to give a proof that works for any shape. A certain upward force is 
needed to support the object in figure s. If this force was applied, 
then the object would be in equilibrium: the vector sum of all the 
forces acting on it would be zero. These forces are Fa, the upward 
force just mentioned, Fg, the downward force of gravity, and Fr, 
the total force from the fluid: 


F,+F +F;=0 


Since the fluid is under more pressure at a greater depth, the part 
of the fluid underneath the object tends to make more force than 
the part above, so the fluid tends to help support the object. 


Now suppose the object was removed, and instantly replaced 
with an equal volume of fluid. The new fluid would be in equi- 
librium without any force applied to hold it up, so 


Fo¢ + Fy = 0 ; 


where Fo;, the weight of the fluid, is not the same as Fg, the 
weight of the object, but F; is the same as before, since the pres- 
sure of the surrounding fluid is the same as before at any partic- 
ular depth. We therefore have 


Fa = — (Fg — For) ’ 


which is Archimedes’ principle in terms of force: the force re- 
quired to support the object is lessened by an amount equal to 
the weight of the fluid that would have occupied its volume. 
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By the way, the word “pressure” that I threw around casually 
in the preceding example has a precise technical definition: force 
per unit area. The SI units of pressure are N/m?, which can be 
abbreviated as pascals, 1 Pa = 1 N/m?. Atmospheric pressure is 
about 100 kPa. By applying the equation F, + Fy = 0 to the top 
and bottom surfaces of a cubical volume of fluid, one can easily 
prove that the difference in pressure between two different depths 
is AP = pgAy. (In physics, “fluid” can refer to either a gas or a 
liquid.) Pressure is discussed in more detail in chapter 5. 


A solar sail example 67 
A solar sail, figure t/1, allows a spacecraft to get its thrust without 
using internal stores of energy or having to carry along mass that 
it can shove out the back like a rocket. Sunlight strikes the sail 
and bounces off, transferring momentum to the sail. A working 
30-meter-diameter solar sail, Cosmos 1, was built by an American 
company, and was supposed to be launched into orbit aboard a 
Russian booster launched from a submarine, but launch attempts 
in 2001 and 2005 both failed. 


In this example, we will calculate the optimal orientation of the 
sail, assuming that “optimal” means changing the vehicle’s en- 
ergy as rapidly as possible. For simplicity, we model the com- 
plicated shape of the sail’s surface as a disk, seen edge-on in 
figure t/2, and we assume that the craft is in a nearly circular orbit 
around the sun, hence the 90-degree angle between the direction 
of motion and the incoming sunlight. We assume that the sail is 
100% reflective. The orientation of the sail is specified using the 
angle 8 between the incoming rays of sunlight and the perpen- 
dicular to the sail. In other words, 6=0 if the sail is catching the 
sunlight full-on, while 6=90° means that the sail is edge-on to the 
sun. 


Conservation of momentum gives 
Plight,i = Plight,t + AP sail ; 


where Apsaj; is the change in momentum picked up by the sail. 
Breaking this down into components, we have 


O = Phignt,t.x + APsail,x and 
Plight,i.y = Plight,ty + APsail,y 


As in example 51 on page 188, the component of the force that 
is directly away from the sun (up in figure t/2) doesn’t change the 
energy of the craft, so we only care about Apsaj,x, which equals 
—Plight,t,x- The outgoing light ray forms an angle of 20 with the 
negative y axis, or 270° —28 measured counterclockwise from the 
X axis, So the useful thrust depends on —cos(270° — 20) = sin20. 


However, this is all assuming a given amount of light strikes the 
sail. During a certain time period, the amount of sunlight striking 


(2) thrust 


reflected ” 
sunlight 


incoming 
sunlight 
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the sail depends on the cross-sectional area the sail presents to 
the sun, which is proportional to cos 8. For 8=90°, cos 8 equals 
zero, since the sail is edge-on to the sun. 


Putting together these two factors, the useful thrust is proportional 
to sin 28 cos®, and this quantity is maximized for 8 ~ 35°. A coun- 
terintuitive fact about this maneuver is that as the spacecraft spi- 
rals outward, its total energy (kinetic plus gravitational) increases, 
but its kinetic energy actually decreases! 


Discussion Questions 


A Anobject goes from one point in space to another. After it arrives at 
its destination, how does the magnitude of its Ar vector compare with the 
distance it traveled? 


B In several examples, I’ve dealt with vectors having negative com- 
ponents. Does it make sense as well to talk about negative and positive 
vectors? 

Cc If you’re doing graphical addition of vectors, does it matter which 
vector you start with and which vector you start from the other vector’s 
tip? 

D If you add a vector with magnitude 1 to a vector of magnitude 2, 


what magnitudes are possible for the vector sum? 


A+B 
oS E Which of these examples of vector addition are correct, and which 
are incorrect? 


F Is it possible for an airplane to maintain a constant velocity vector 
but not a constant |v|? How about the opposite — a constant |v| but not a 
constant velocity vector? Explain. 


v/ Discussion question 


G New York and Rome are at about the same latitude, so the earth’s 
rotation carries them both around nearly the same circle. Do the two cities 
have the same velocity vector (relative to the center of the earth)? If not, 
is there any way for two cities to have the same velocity vector? 


a 


H_ The figure shows a roller coaster car rolling down and then up under 
the influence of gravity. Sketch the car’s velocity vectors and acceleration 
vectors. Pick an interesting point in the motion and sketch a set of force 
vectors acting on the car whose vector sum could have resulted in the 
right acceleration vector. 


| The following is a question commonly asked by students: 


w/ Discussion question =H. “Why does the force vector always have to point in the same direction 
as the acceleration vector? What if you suddenly decide to change your 
force on an object, so that your force is no longer pointing in the same 
direction that the object is accelerating?” 


What misunderstanding is demonstrated by this question? Suppose, for 
example, a spacecraft is blasting its rear main engines while moving for- 
ward, then suddenly begins firing its sideways maneuvering rocket as 
well. What does the student think Newton’s laws are predicting? 


J Debug the following incorrect solutions to this vector addition prob- 
lem. 


Problem: Freddi Fish™ swims 5.0 km northeast, and then 12.0 km in the 
direction 55 degrees west of south. How far does she end up from her 
starting point, and in what direction is she from her starting point? 
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Incorrect solution #1: 
5.0 km+12.0 km=17.0 km 


Incorrect solution #2: 
/ (5.0 km)? + (12.0 km)2=13.0 km 


Incorrect solution #3: 
Let A and B be her two Ar vectors, and let C = A+B. Then 


Ax = (5.0 km) cos 45° = 3.5 km 
B, = (12.0 km) cos 55° = 6.9 km 
Ay = (5.0 km) sin 45° = 3.5 km 
By = (12.0 km) sin55° = 9.8 km 
Cy = Ay + By 

= 10.4 km 
Cy =A, +B, 

= 13.3 km 


|C| = \/C2 + CF 
= 16.9 km 
direction = tan—'(13.3/10.4) 
= 52° north of east 


Incorrect solution #4: 
(same notation as above) 


Ax = (5.0 km) cos 45° = 3.5 km 
B, = —(12.0 km) cos 55° = —6.9 km 
Ay = (5.0 km) sin 45° = 3.5 km 
By = —(12.0 km) sin 55° = —9.8 km 
Cy, = Ay + By 

= —3.4km 
Cy =A, +B, 

= —6.3 km 


IC| = ,/C2 + C2 


=7.2km 
direction = tan~'(—6.3/ — 3.4) 
= 62° north of east 
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the 


accelera- 


Incorrect solution #5: 
(same notation as above) 


Ay = (5.0 km) cos 45° = 3.5 km 
B, = —(12.0 km) sin 55° = —9.8 km 
Ay = (5.0 km) sin 45° = 3.5 km 
By = —(12.0 km) cos 55° = —6.9 km 
Cy = Ay + By 

= —6.3 km 
Cy =A, +B, 

= —3.4 km 


\C| = ,/C2 + C2 


= 7.2 km 
direction = tan~'(—3.4/ — 6.3) 
= 28° north of east 


3.4.4 Calculus with vectors 


Differentiation 


In one dimension, we define the velocity as the derivative of the 
position with respect to time, and we can think of the derivative as 
what we get when we calculate Ax/At for very short time intervals. 
The quantity Ax = xy — 2; is calculated by subtraction. In three 
dimensions, x becomes r, and the Ar vector is calculated by vector 
subtraction, Ar = r¢ —r;. Vector subtraction is defined component 
by component, so when we take the derivative of a vector, this means 
we end up taking the derivative component by component, 


or 


All of this reasoning applies equally well to any derivative of a vector, 
so for instance we can take the second derivative, 


pO: cp, og 
eat? ae? ae 


or 
dv. <“dyg5- dg: . “AU, « 
= x + 


de Ae dee a 


A counterintuitive consequence of this is that the acceleration 
vector does not need to be in the same direction as the motion. The 
velocity vector points in the direction of motion, but by Newton’s 
second law, a = F/m, the acceleration vector points in the same di- 
rection as the force, not the motion. This is easiest to understand if 
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we take velocity vectors from two different moments in the motion, 
and visualize subtracting them graphically to make a Av vector. 
The direction of the Av vector tells us the direction of the accelera- 
tion vector as well, since the derivative dv/dt can be approximated 
as Av/At. As shown in figure x/1, a change in the magnitude of 
the velocity vector implies an acceleration that is in the direction of 
motion. A change in the direction of the velocity vector produces 
an acceleration perpendicular to the motion, x/2. 


Circular motion example 68 
> An object moving in a circle of radius r in the x-y plane has 


xX=rcoswt and 
y=rsinwt ; 
where w is the number of radians traveled per second, and the 


positive or negative sign indicates whether the motion is clock- 
wise or counterclockwise. What is its acceleration? 


> The components of the velocity are 


Vy = —wr sin wt and 
Ww= wrcoswt ; 


and for the acceleration we have 


a =—w*rcoswt and 


ay = —w*r sin wt 


The acceleration vector has cosines and sines in the same places 
as the r vector, but with minus signs in front, so it points in the op- 
posite direction, i.e., toward the center of the circle. By Newton’s 
second law, a=F/m, this shows that the force must be inward as 
well; without this force, the object would fly off straight. 


The magnitude of the acceleration is 


jal = \/ ae + ab 


= wr 


It makes sense that w is squared, since reversing the sign of w 

corresponds to reversing the direction of motion, but the acceler- 

ation is toward the center of the circle, regardless of whether the 

motion is clockwise or counterclockwise. This result can also be 
rewritten in the form 

p 

ja) = WE 


Although I’ve relegated the results a = w?r = |v|?/r to an ex- 


ample because they are a straightforward corollary of more general 


a 

\ 

(1) a 
oe 
Nai 
2) my 4 


y/ This figure shows an_ intu- 
itive justification for the fact 
proved mathematically in the 
example, that the direction of the 
force and acceleration in circular 
motion is inward. The heptagon, 
2, is a better approximation to 
a circle than the triangle, 1. To 
make an infinitely good approx- 
imation to circular motion, we 
would need to use an infinitely 
large number of infinitesimal taps, 


which would amount to a steady 
inward force. 
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z/ The total force in the forward- 
backward direction is zero in both 
cases. 


aa/ There is no outward force 
on the bowling ball, but in the 
noninertial frame it seems like 
one exists. 


principles already developed, they are important and useful enough 
to record for later use. These results are counterintuitive as well. 
Until Newton, physicists and laypeople alike had assumed that the 
planets would need a force to push them forward in their orbits. 
Figure y may help to make it more plausible that only an inward 
force is required. A forward force might be needed in order to cancel 
out a backward force such as friction, z, but the total force in the 
forward-backward direction needs to be exactly zero for constant- 
speed motion. When you are in a car undergoing circular motion, 
there is also a strong illusion of an outward force. But what object 
could be making such a force? The car’s seat makes an inward force 
on you, not an outward one. There is no object that could be ex- 
erting an outward force on your body. In reality, this force is an 
illusion that comes from our brain’s intuitive efforts to interpret the 
situation within a noninertial frame of reference. As shown in figure 
aa, we can describe everything perfectly well in an inertial frame 
of reference, such as the frame attached to the sidewalk. In such a 
frame, the bowling ball goes straight because there is no force on it. 
The wall of the truck’s bed hits the ball, not the other way around. 


Integration 


An integral is really just a sum of many infinitesimally small 
terms. Since vector addition is defined in terms of addition of the 
components, an integral of a vector quantity is found by doing inte- 
grals component by component. 


Projectile motion example 69 
> Find the motion of an object whose acceleration vector is con- 
stant, for instance a projectile moving under the influence of grav- 
ity. 

> We integrate the acceleration to get the velocity, and then inte- 


grate the velocity to get the position as a function of time. Doing 
this to the x component of the acceleration, we find 


ro | (feat) o 


_ [tacts Vxo) dt 5 
where Vy, is a constant of integration, and 


1 2 
X= pat + Vyol + Xo 


Similarly, y=(1/2) ay t? + Vyot+Yo and Z = (1/2)azt? + Vzot+Zo. Once 
one has gained a little confidence, it becomes natural to do the 
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whole thing as a single vector integral, 


-|(Joa)s 


= [ (at vo)at 


1 
= gal? + Vol + lo ; 


where now the constants of integration are vectors. 
Discussion Questions 


A Inthe game of crack the whip, a line of people stand holding hands, 
and then they start sweeping out a circle. One person is at the center, and 
rotates without changing location. At the opposite end is the person who 
is running the fastest, in a wide circle. In this game, someone always ends 
up losing their grip and flying off. Suppose the person on the end loses 
her grip. What path does she follow as she goes flying off? (Assume she 
is going so fast that she is really just trying to put one foot in front of the 
other fast enough to keep from falling; she is not able to get any significant 
horizontal force between her feet and the ground.) 


B Suppose the person on the outside is still holding on, but feels that 
she may loose her grip at any moment. What force or forces are acting 
on her, and in what directions are they? (We are not interested in the 
vertical forces, which are the earth’s gravitational force pulling down, and 
the ground’s normal force pushing up.) 


ab/ Discussion question A. 


C Suppose the person on the outside is still holding on, but feels that 
she may loose her grip at any moment. What is wrong with the following 
analysis of the situation? “The person whose hand she’s holding exerts 
an inward force on her, and because of Newton's third law, there’s an 
equal and opposite force acting outward. That outward force is the one 
she feels throwing her outward, and the outward force is what might make 
her go flying off, if it’s strong enough.” 


D If the only force felt by the person on the outside is an inward force, 
why doesn’t she go straight in? 


E In the amusement park ride shown in the figure, the cylinder spins 
faster and faster until the customer can pick her feet up off the floor with- 
out falling. In the old Coney Island version of the ride, the floor actually 
dropped out like a trap door, showing the ocean below. (There is also a ; ; : 
version in which the whole thing tilts up diagonally, but we’re discussing ac/ Discussion question E. 
the version that stays flat.) If there is no outward force acting on her, why 

does she stick to the wall? Analyze all the forces on her. 


F What is an example of circular motion where the inward force is a 
normal force? What is an example of circular motion where the inward 
force is friction? What is an example of circular motion where the inward 
force is the sum of more than one force? 


G___Does the acceleration vector always change continuously in circular 
motion? The velocity vector? 


H A certain amount of force is needed to provide the acceleration of 
circular motion. What if were are exerting a force perpendicular to the 
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direction of motion in an attempt to make an object trace a circle of radius 
r, but the force isn’t as big as m|v|?/r? 


| Suppose a rotating space station is built that gives its occupants the 
illusion of ordinary gravity. What happens when a person in the station 
lets go of a ball? What happens when she throws a ball straight “up” in 
the air (i.e., towards the center)? 


3.4.5 The dot product 


How would we generalize the mechanical work equation dE = 
Fdx to three dimensions? Energy is a scalar, but force and distance 
are vectors, so it might seem at first that the kind of “magic-wand” 
generalization discussed on page 201 failed here, since we don’t know 
of any way to multiply two vectors together to get a scalar. Actually, 
this is Nature giving us a hint that there is such a multiplication 
operation waiting for us to invent it, and since Nature is simple, we 
can be assured that this operation will work just fine in any situation 
where a similar generalization is required. 


How should this operation be defined? Let’s consider what we 
would get by performing this operation on various combinations of 
the unit vectors x, y, and z. The conventional notation for the 
operation is to put a dot, -, between the two vectors, and the op- 
eration is therefore called the dot product. Rotational invariance 
requires that we handle the three coordinate axes in the same way, 
without giving special treatment to any of them, so we must have 
XX =y-y=2-aZandx-y=y-z4=2-x. This is supposed 
to be a way of generalizing ordinary multiplication, so for consis- 
tency with the property 1 x 1 = 1 of ordinary numbers, the result 
of multiplying a magnitude-one vector by itself had better be the 
scalar 1,sox-X = y-y =zZ-zZ=1. Furthermore, there is no way 
to satisfy rotational invariance unless we define the mixed products 
to be zero, X-y = y-%=zZ-xX = 0; for example, a 90-degree ro- 
tation of our frame of reference about the z axis reverses the sign 
of x-y, but rotational invariance requires that x - y produce the 
same result either way, and zero is the only number that stays the 
same when we reverse its sign. Establishing these six products of 
unit vectors suffices to define the operation in general, since any 
two vectors that we want to multiply can be broken down into com- 
ponents, e.g., (2k + 3Z)-Z = 2x-44+ 32-2 =0+3 = 3. Thus 
by requiring rotational invariance and consistency with multiplica- 
tion of ordinary numbers, we find that there is only one possible 
way to define a multiplication operation on two vectors that gives a 
scalar as the result.!7 The dot product has all of the properties we 
normally associate with multiplication, except that there is no “dot 
division.” 


Dot product in terms of components example 70 


17There is, however, a different operation, discussed in the next chapter, which 
multiplies two vectors to give a vector. 
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If we know the components of any two vectors b and c, we can 
find their dot product: 


b-c = (byX + byy + B22) « (CX + Cy + C22) 
= by Cy + byCy + bzCz 


Magnitude expressed with a dot product example 71 
If we take the dot product of any vector b with itself, we find 


b- b= (bX + by¥ + bz2) - (byX + by + bz2) 


=be+be+be 
so its magnitude can be expressed as 
|b}=Vb-b 


We will often write b* to mean b - b, when the context makes 
it clear what is intended. For example, we could express kinetic 
energy as (1/2)m|v|*, (1/2)mv-v, or (1/2)mv?. In the third version, 
nothing but context tells us that v really stands for the magnitude 
of some vector v. 


Geometric interpretation example 72 
In figure ad, vectors a, b, and c represent the sides of a triangle, 
and a=b+c. The law of cosines gives 


|c|? = |a|* + |b|* — 2|a||b| cos @ 
Using the result of example 71, we can also write this as 


lc? =c-c 
= (a—b)-(a—b) 
=a-a+b-b-2a-b 


Matching up terms in these two expressions, we find 
a-b = |a||b| cos 0 ; 


which is a geometric interpretation for the dot product. 


The result of example 72 is very useful. It gives us a way to 
find the angle between two vectors if we know their components. It 
can be used to show that the dot product of any two perpendicular 
vectors is zero. It also leads to a nifty proof that the dot product 
is rotationally invariant — up until now I’ve only proved that if a 
rotationally invariant product exists, the dot product is it — because 
angles and lengths aren’t affected by a rotation, so the right side of 
the equation is rotationally invariant, and therefore so is the left 
side. 


ad/The geometric 
tation of the dot product. 


interpre- 


ae/ Breaking trail, by Walter 
E. Bohl. The pack horse is not 
doing any work on the pack, 
because the pack is moving in a 
horizontal line at constant speed, 
and therefore there is no kinetic 
or gravitational energy being 
transferred into or out of it. 
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I introduced the whole discussion of the dot product by way of 
generalizing the equation dE = Fdz to three dimensions. In terms 
of a dot product, we have 


d#E =F .-dr 
If F is a constant, integrating both sides gives 
AE=F-Ar 


(If that step seemed like black magic, try writing it out in terms 
of components.) If the force is perpendicular to the motion, as in 
figure ae, then the work done is zero. The pack horse is doing work 
within its own body, but is not doing work on the pack. 


Pushing a lawnmower example 73 
> | push a lawnmower with a force F=(110 N)X — (40 N)y, and the 
total distance | travel is (100 m)X. How much work do | do? 


> The dot product is 11000 N-m = 11000 J. 


A good application of the dot product is to allow us to write a 
simple, streamlined proof of separate conservation of the momentum 
components. (You can skip the proof without losing the continuity 
of the text.) The argument is a generalization of the one-dimensional 
proof on page 130, and makes the same assumption about the type 
of system of particles we’re dealing with. The kinetic energy of 
one of the particles is (1/2)mv-v, and when we transform into a 
different frame of reference moving with velocity u relative to the 
original frame, the one-dimensional rule v > v+u turns into vector 
addition, v ~ v+u. In the new frame of reference, the kinetic 
energy is (1/2)m(v + u)-(v+u). For a system of n particles, we 
have 


K =o jnj(vy +a) (vj +0) 


As in the proof on page 130, the first sum is simply the total kinetic 
energy in the original frame of reference, and the last sum is a con- 
stant, which has no effect on the validity of the conservation law. 
The middle sum can be rewritten as 


n n 
25 MjVj-U=2uU ) MiV5 
n 
=2u- ) P; ’ 
j=l 


so the only way energy can be conserved for all values of u is if the 
vector sum of the momenta is conserved as well. 
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3.4.6 Gradients and line integrals (optional) 


This subsection introduces a little bit of vector calculus. It can 
be omitted without loss of continuity, but the techniques will be 
needed in our study of electricity and magnetism, and it may be 
helpful to be exposed to them in easy-to-visualize mechanical con- 
texts before applying them to invisible electrical and magnetic phe- 
nomena. 


In physics we often deal with fields of force, meaning situations 
where the force on an object depends on its position. For instance, 
figure af could represent a map of the trade winds affecting a sailing 
ship, or a chart of the gravitational forces experienced by a space 
probe entering a double-star system. An object moving under the 
influence of this force will not necessarily be moving in the same 
direction as the force at every moment. The sailing ship can tack 
against the wind, due to the force from the water on the keel. The 
space probe, if it entered from the top of the diagram at high speed, 
would start to curve around to the right, but its inertia would carry 
it forward, and it wouldn’t instantly swerve to match the direction 
of the gravitational force. For convenience, we’ve defined the gravi- 
tational field, g, as the force per unit mass, but that trick only leads 
to a simplification because the gravitational force on an object is 
proportional to its mass. Since this subsection is meant to apply 
to any kind of force, we’ll discuss everything in terms of the actual 
force vector, F, in units of newtons. 


If an object moves through the field of force along some curved 
path from point r; to point ro, the force will do a certain amount 
of work on it. To calculate this work, we can break the path up 
into infinitesimally short segments, find the work done along each 
segment, and add them all up. For an object traveling along a nice 
straight x2 axis, we use the symbol dz to indicate the length of any 
infinitesimally short segment. In three dimensions, moving along a 
curve, each segment is a tiny vector dr = xdxz + ydy + zdz. The 
work theorem can be expressed as a dot product, so the work done 
along a segment is F- dr. We want to integrate this, but we don’t 
know how to integrate with respect to a variable that’s a vector, 
so let’s define a variable s that indicates the distance traveled so 
far along the curve, and integrate with respect to it instead. The 
expression F - dr can be rewritten as |F| |dr| cos@, where @ is the 
angle between F and dr. But |dr| is simply ds, so the amount of 
work done becomes 


r2 
ag= | |F| cos 6 ds 


af 


Both F and @ are functions of s. As a matter of notation, it’s 
cumbersome to have to write the integral like this. Vector notation 
was designed to eliminate this kind of drudgery. We therefore define 
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af/An object moves. through 
a field of force. 
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the line integral 


[ea 
C 


as a way of notating this type of integral. The ‘C’ refers to the curve 
along which the object travels. If we don’t know this curve then we 
typically can’t evaluate the line integral just by knowing the initial 
and final positions r; and ro. 


The basic idea of calculus is that integration undoes differen- 
tiation, and vice-versa. In one dimension, we could describe an 
interaction either in terms of a force or in terms of an interaction 
energy. We could integrate force with respect to position to find 
minus the energy, or we could find the force by taking minus the 
derivative of the energy. In the line integral, position is represented 
by a vector. What would it mean to take a derivative with respect 
to a vector? The correct way to generalize the derivative dU/dzx to 
three dimensions is to replace it with the following vector, 


du. du. du 
x y 
dx 


called the gradient of U, and written with an upside-down delta!® 
like this, VU. Each of these three derivatives is really what’s known 
as a partial derivative. What that means is that when you're differ- 
entiating U with respect to x, you’re supposed to treat y and z and 
constants, and similarly when you do the other two derivatives. To 
emphasize that a derivative is a partial derivative, it’s customary to 
write it using the symbol 0 in place of the differential d’s. Putting 
all this notation together, we have 


VU = x4 yt+—z [definition of the gradient] 
Ox Zz 

The gradient looks scary, but it has a very simple physical inter- 
pretation. It’s a vector that points in the direction in which U is 
increasing most rapidly, and it tells you how rapidly U is increasing 
in that direction. For instance, sperm cells in plants and animals 
find the egg cells by traveling in the direction of the gradient of the 
concentration of certain hormones. When they reach the location 
of the strongest hormone concentration, they find their destiny. In 
terms of the gradient, the force corresponding to a given interaction 
energy is F = —VU. 


Force exerted by a spring example 74 
In one dimension, Hooke’s law is U = (1/2)kx*. Suppose we 
tether one end of a spring to a post, but it’s free to stretch and 
swing around in a plane. Let’s say its equilibrium length is zero, 
and let’s choose the origin of our coordinate system to be at the 


'8The symbol V is called a “nabla.” Cool word! 
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post. Rotational invariance requires that its energy only depend 
on the magnitude of the r vector, not its direction, so in two di- 
mensions we have U = (1/2)k|r|? = (1/2)k (x2 + y?). The force 
exerted by the spring is then 


F=-VU 
_ Wy Wy 
OX Oy 
= —kxX—kyy 


The magnitude of this force vector is k|r|, and its direction is to- 
ward the origin. 


This chapter is summarized on page 892. Notation and terminology 
are tabulated on pages 885-886. 
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Problems 
The symbols V, ©, etc. are explained on page 235. 


1 Derive a formula expressing the kinetic energy of an object in 
terms of its momentum and mass. Vv 


2 Two people in a rowboat wish to move around without causing 
the boat to move. What should be true about their total momen- 
tum? Explain. & 


3 A bullet leaves the barrel of a gun with a kinetic energy of 90 
J. The gun barrel is 50 cm long. The gun has a mass of 4 kg, the 
bullet 10 g. 

(a) Find the bullet’s final velocity. Vv 
(b) Find the bullet’s final momentum. Vv 
(c) Find the momentum of the recoiling gun. 

(d) Find the kinetic energy of the recoiling gun, and explain why 
the recoiling gun does not kill the shooter. Vv Oo 


4 The big difference between the equations for momentum and 
kinetic energy is that one is proportional to v and one to v7. Both, 
however, are proportional to m. Suppose someone tells you that 
there’s a third quantity, funkosity, defined as f = m?v, and that 
funkosity is conserved. How do you know your leg is being pulled? 
> Solution, p. 877 & 


5 A ball of mass 2m collides head-on with an initially stationary 
ball of mass m. No kinetic energy is transformed into heat or sound. 
In what direction is the mass-2m ball moving after the collision, and 
how fast is it going compared to its original velocity? 

> Answer, p. 874 & 


6 A very massive object with velocity v collides head-on with 
an object at rest whose mass is very small. No kinetic energy is 
converted into other forms. Prove that the low-mass object recoils 
with velocity 2v. [Hint: Use the center-of-mass frame of reference.] 


~ 


7  Amassm moving at velocity v collides with a stationary target 
having the same mass m. Find the maximum amount of energy that 
can be released as heat and sound. Vv Oo 


8 A rocket ejects exhaust with an exhaust velocity u. The rate 
at which the exhaust mass is used (mass per unit time) is b. We 
assume that the rocket accelerates in a straight line starting from 
rest, and that no external forces act on it. Let the rocket’s initial 
mass (fuel plus the body and payload) be mj;, and my be its final 
mass, after all the fuel is used up. (a) Find the rocket’s final velocity, 
v, in terms of u, m;, and my. Neglect the effects of special relativity. 
(b) A typical exhaust velocity for chemical rocket engines is 4000 
m/s. Estimate the initial mass of a rocket that could accelerate a 
one-ton payload to 10% of the speed of light, and show that this 


Chapter 3 Conservation of Momentum 


design won’t work. (For the sake of the estimate, ignore the mass of 
the fuel tanks. The speed is fairly small compared to c, so it’s not 
an unreasonable approximation to ignore relativity.) Vv 


9 An object is observed to be moving at constant speed along 
a line. Can you conclude that no forces are acting on it? Explain. 
[Based on a problem by Serway and Faughn.| g 


10 At low speeds, every car’s acceleration is limited by traction, 
not by the engine’s power. Suppose that at low speeds, a certain 
car is normally capable of an acceleration of 3 m/s?. If it is towing 
a trailer with half as much mass as the car itself, what acceleration 
can it achieve? [Based on a problem from PSSC Physics. ] 


11 = (a) Let T be the maximum tension that an elevator’s cable can 
withstand without breaking, i.e., the maximum force it can exert. 
If the motor is programmed to give the car an acceleration a, what 
is the maximum mass that the car can have, including passengers, 
if the cable is not to break? 

(b) Interpret the equation you derived in the special cases of a = 0 
and of a downward acceleration of magnitude g. 


= 


12 When the contents of a refrigerator cool down, the changed 
molecular speeds imply changes in both momentum and energy. 
Why, then, does a fridge transfer power through its radiator coils, 
but not force? > Solution, p. 877 


13 A helicopter of mass m is taking off vertically. The only 
forces acting on it are the earth’s gravitational force and the force, 
Foy, of the air pushing up on the propeller blades. 

(a) If the helicopter lifts off at t = 0, what is its vertical speed at 
time t? 

(b) Check that the units of your answer to part a make sense. 

(c) Discuss how your answer to part a depends on all three variables, 
and show that it makes sense. That is, for each variable, discuss 
what would happen to the result if you changed it while keeping the 
other two variables constant. Would a bigger value give a smaller 
result, or a bigger result? Once you’ve figured out this mathematical 
relationship, show that it makes sense physically. 

(d) Plug numbers into your equation from part a, using m = 2300 
kg, Fyir = 27000 N, and t = 4.0 s. vo 


14 A blimp is initially at rest, hovering, when at ¢ = 0 the pilot 
turns on the motor of the propeller. The motor cannot instantly 
get the propeller going, but the propeller speeds up steadily. The 
steadily increasing force between the air and the propeller is given 
by the equation F' = kt, where k is a constant. If the mass of the 
blimp is m, find its position as a function of time. (Assume that 
during the period of time you’re dealing with, the blimp is not yet 
moving fast enough to cause a significant backward force due to air 
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resistance. ) vo 


15 Acar is accelerating forward along a straight road. If the force 
of the road on the car’s wheels, pushing it forward, is a constant 3.0 
kN, and the car’s mass is 1000 kg, then how long will the car take 
to go from 20 m/s to 50 m/s? > Solution, p. 878 & 


16 A little old lady and a pro football player collide head-on. 
Compare their forces on each other, and compare their accelerations. 
Explain. g 


Problem 16. 
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17 The earth is attracted to an object with a force equal and 
opposite to the force of the earth on the object. If this is true, 
why is it that when you drop an object, the earth does not have an 
acceleration equal and opposite to that of the object? g 


18 When you stand still, there are two forces acting on you, 
the force of gravity (your weight) and the normal force of the floor 
pushing up on your feet. Are these forces equal and opposite? Does 
Newton’s third law relate them to each other? Explain. BE 


19 Today’s tallest buildings are really not that much taller than 
the tallest buildings of the 1940’s. One big problem with making an 
even taller skyscraper is that every elevator needs its own shaft run- 
ning the whole height of the building. So many elevators are needed 
to serve the building’s thousands of occupants that the elevator 
shafts start taking up too much of the space within the building. 
An alternative is to have elevators that can move both horizontally 
and vertically: with such a design, many elevator cars can share a 
few shafts, and they don’t get in each other’s way too much because 
they can detour around each other. In this design, it becomes im- 
possible to hang the cars from cables, so they would instead have to 
ride on rails which they grab onto with wheels. Friction would keep 
them from slipping. The figure shows such a frictional elevator in 
its vertical travel mode. (The wheels on the bottom are for when it 
needs to switch to horizontal motion.) 

(a) If the coefficient of static friction between rubber and steel is 
Ls, and the maximum mass of the car plus its passengers is M, 
how much force must there be pressing each wheel against the rail 
in order to keep the car from slipping? (Assume the car is not 
accelerating. ) Vv 
(b) Show that your result has physically reasonable behavior with 
respect to yss. In other words, if there was less friction, would the 
wheels need to be pressed more firmly or less firmly? Does your 
equation behave that way? g 


20 A tugboat of mass m pulls a ship of mass M, accelerating 
it. Ignore fluid friction acting on their hulls, although there will of 
course need to be fluid friction acting on the tug’s propellers. 

(a) If the force acting on the tug’s propeller is F', what is the tension, 
T, in the cable connecting the two ships? > Hint, p. 864 Vv 
(b) Interpret your answer in the special cases of M = 0 and M = co. 


~ 


21 Someone tells you she knows of a certain type of Central 
American earthworm whose skin, when rubbed on polished dia- 
mond, has uw, > ps. Why is this not just empirically unlikely but 
logically suspect? le 


rubber wheel 


steel rail 


Problem 19. 
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Problem 23, part c. 


22 A uranium atom deep in the earth spits out an alpha particle. 
An alpha particle is a fragment of an atom. This alpha particle has 
initial speed v, and travels a distance d before stopping in the earth. 
(a) Find the force, F', from the dirt that stopped the particle, in 
terms of v,d, and its mass, m. Don’t plug in any numbers yet. 
Assume that the force was constant. Vv 
(b) Show that your answer has the right units. 

(c) Discuss how your answer to part a depends on all three variables, 
and show that it makes sense. That is, for each variable, discuss 
what would happen to the result if you changed it while keeping the 
other two variables constant. Would a bigger value give a smaller 
result, or a bigger result? Once you’ve figured out this mathematical 
relationship, show that it makes sense physically. 

(d) Evaluate your result for m = 6.7 x 107?” kg, v = 2.0 x 104 km/s, 
and d= 0.71 mm. Vv 


~ 


23 You are given a large sealed box, and are not allowed to open 
it. Which of the following experiments measure its mass, and which 
measure its weight? [Hint: Which experiments would give different 
results on the moon?] 
(a) Put it on a frozen lake, throw a rock at it, and see how fast it 
scoots away after being hit. 
(b) Drop it from a third-floor balcony, and measure how loud the 
sound is when it hits the ground. 
(c) As shown in the figure, connect it with a spring to the wall, and 
watch it vibrate. 

> Solution, p. 878 L 


24 While escaping from the palace of the evil Martian em- 
peror, Sally Spacehound jumps from a tower of height h down to 
the ground. Ordinarily the fall would be fatal, but she fires her 
blaster rifle straight down, producing an upward force of magnitude 
Fg. This force is insufficient to levitate her, but it does cancel out 
some of the force of gravity. During the time t that she is falling, 
Sally is unfortunately exposed to fire from the emperor’s minions, 
and can’t dodge their shots. Let m be her mass, and g the strength 
of gravity on Mars. 

(a) Find the time ¢ in terms of the other variables. 

(b) Check the units of your answer to part a. 

(c) For sufficiently large values of F’g, your answer to part a becomes 
nonsense — explain what’s going on. Vv 


_ 


25 When I cook rice, some of the dry grains always stick to the 
measuring cup. To get them out, I turn the measuring cup upside- 
down and hit the “roof” with my hand so that the grains come off of 
the “ceiling.” (a) Explain why static friction is irrelevant here. (b) 
Explain why gravity is negligible. (c) Explain why hitting the cup 
works, and why its success depends on hitting the cup hard enough. 
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26 A flexible rope of mass m and length LF slides without friction 
over the edge of a table. Let x be the length of the rope that is 
hanging over the edge at a given moment in time. 

(a) Show that 2 satisfies the equation of motion d?a/dt? = ga/L. 
[Hint: Use F = dp/dt, which allows you to handle the two parts of 
the rope separately even though mass is moving out of one part and 
into the other.] 

(b) Give a physical explanation for the fact that a larger value of 
x on the right-hand side of the equation leads to a greater value of 
the acceleration on the left side. 

(c) When we take the second derivative of the function x(t) we are 
supposed to get essentially the same function back again, except 
for a constant out in front. The function e” has the property that 
it is unchanged by differentiation, so it is reasonable to look for 
solutions to this problem that are of the form x = be“, where b and 
c are constants. Show that this does indeed provide a solution for 
two specific values of c (and for any value of b). 

(d) Show that the sum of any two solutions to the equation of motion 
is also a solution. 

(e) Find the solution for the case where the rope starts at rest at 
t = 0 with some nonzero value of x. = 


In problems 27-81, analyze the forces using a table in the format 
shown in section 8.2.6. Analyze the forces in which the italicized 
object participates. 


27 Some people put a spare car key in a little magnetic box 
that they stick under the chassis of their car. Let’s say that the 
box is stuck directly underneath a horizontal surface, and the car is 
parked. (See instructions above.) Ez 


28 Analyze two examples of objects at rest relative to the earth 
that are being kept from falling by forces other than the normal 
force. Do not use objects in outer space, and do not duplicate 
problem 27 or 31. (See instructions above.) is 


29 A person is rowing a boat, with her feet braced. She is doing 
the part of the stroke that propels the boat, with the ends of the 
oars in the water (not the part where the oars are out of the water). 
(See instructions above.) z 


30 A farmer is in a stall with a cow when the cow decides 
to press him against the wall, pinning him with his feet off the 
ground. Analyze the forces in which the farmer participates. (See 


instructions above.) g 
31 A propeller plane is cruising east at constant speed and 
altitude. (See instructions above.) B 


Problem 29. 


Problem 31. 
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Problem 32 


/ \ 
ce / \ 
Problem 33. 
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32 =‘ The figure shows a stack of two blocks, sitting on top of a table 
that is bolted to the floor. All three objects are made from iden- 
tical wood, with their surfaces finished identically using the same 
sandpaper. We tap the middle block, giving it an initial velocity vu 
to the right. The tap is executed so rapidly that almost no initial 
velocity is imparted to the top block. 

(a) Find the time that will elapse until the slipping between the top 
and middle blocks stops. Express your answer in terms of v, m, M, 
g, and the relevant coefficient of friction. Vv 
(b) Show that your answer makes sense in terms of units. 

(c) Check that your result has the correct behavior when you make 
m bigger or smaller. Explain. This means that you should discuss 
the mathematical behavior of the result, and then explain how this 
corresponds to what would really happen physically. 

(d) Similarly, discuss what happens when you make M bigger or 
smaller. 

(e) Similarly, discuss what happens when you make g bigger or 
smaller. a 


33 Ginny has a plan. She is going to ride her sled while her dog 
Foo pulls her, and she holds on to his leash. However, Ginny hasn’t 
taken physics, so there may be a problem: she may slide right off 
the sled when Foo starts pulling. 

(a) Analyze all the forces in which Ginny participates, making a 
table as in subsection 3.2.6. 

(b) Analyze all the forces in which the sled participates. 

(c) The sled has mass m, and Ginny has mass M. The coefficient 
of static friction between the sled and the snow is pj, and lg is 
the corresponding quantity for static friction between the sled and 
her snow pants. Ginny must have a certain minimum mass so that 
she will not slip off the sled. Find this in terms of the other three 
variables. Vv 
(d) Interpreting your equation from part c, under what conditions 
will there be no physically realistic solution for M? Discuss what 
this means physically. a 


34 In each case, identify the force that causes the acceleration, 
and give its Newton’s-third-law partner. Describe the effect of the 
partner force. (a) A swimmer speeds up. (b) A golfer hits the ball 
off of the tee. (c) An archer fires an arrow. (d) A locomotive slows 
down. > Solution, p. 878 & 


35 A cop investigating the scene of an accident measures the 
length FL of a car’s skid marks in order to find out its speed v at 
the beginning of the skid. Express v in terms of L and any other 
relevant variables. vo 


36 An ice skater builds up some speed, and then coasts across 
the ice passively in a straight line. (a) Analyze the forces, using a 
table the format shown in section ??. 
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(b) If his initial speed is v, and the coefficient of kinetic friction is pz, 
find the maximum theoretical distance he can glide before coming 
to a stop. Ignore air resistance. Vv 
(c) Show that your answer to part b has the right units. 
(d) Show that your answer to part b depends on the variables in a 
way that makes sense physically. 
(e) Evaluate your answer numerically for wz, = 0.0046, and a world- 
record speed of 14.58 m/s. (The coefficient of friction was measured 
by De Koning et al., using special skates worn by real speed skaters.) 
Vv 
(f) Comment on whether your answer in part e seems realistic. If it 
doesn’t, suggest possible reasons why. 


= 


37 (a) Using the solution of problem 37 on page 124, predict 

how the spring constant of a fiber will depend on its length and 

cross-sectional area. 

(b) The constant of proportionality is called the Young’s modulus, 

E, and typical values of the Young’s modulus are about 10!° to 

10!'. What units would the Young’s modulus have in the SI system? 
> Solution, p. 878 


38 This problem depends on the results of problems problem 37 
on page 124 and problem 37 from this chapter. When atoms form 
chemical bonds, it makes sense to talk about the spring constant of 
the bond as a measure of how “stiff” it is. Of course, there aren’t 
really little springs — this is just a mechanical model. The purpose 
of this problem is to estimate the spring constant, k, for a single 
bond in a typical piece of solid matter. Suppose we have a fiber, 
like a hair or a piece of fishing line, and imagine for simplicity that 
it is made of atoms of a single element stacked in a cubical manner, 
as shown in the figure, with a center-to-center spacing b. A typical 
value for b would be about 107!° m 

(a) Find an equation for k in terms of b, and in terms of the Young’s 
modulus, E, defined in problem 37 and its solution. 

(b) Estimate & using the numerical data given in problem 37. 

(c) Suppose you could grab one of the atoms in a diatomic molecule 
like Hy or Og, and let the other atom hang vertically below it. Does 
the bond stretch by any appreciable fraction due to gravity? a 


39 This problem has been deleted. g 


40 Many fish have an organ known as a swim bladder, an air- 
filled cavity whose main purpose is to control the fish’s buoyancy 
an allow it to keep from rising or sinking without having to use its 
muscles. In some fish, however, the swim bladder (or a small exten- 
sion of it) is linked to the ear and serves the additional purpose of 
amplifying sound waves. For a typical fish having such an anatomy, 
the bladder has a resonant frequency of 300 Hz, the bladder’s Q is 3, 
and the maximum amplification is about a factor of 100 in energy. 


Problem 38 
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Over what range of frequencies would the amplification be at least 
a factor of 50? 
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Al An oscillator with sufficiently strong damping has its maxi- 
mum response at w = 0. Using the result derived on page 857 , find 
the value of Q at which this behavior sets in. 

> Hint, p. 864 pb Answer, p. 874 L 


42 An oscillator has Q=6.00, and, for convenience, let’s assume 
Fm = 1.00, w = 1.00, and m = 1.00. The usual approximations 
would give 


Wres = Wo ) 
Ares = 6.00 : and 
Aw = 1/6.00 


Determine these three quantities numerically using the result de- 
rived on page 857 , and compare with the approximations. g 


43 The apparatus in figure d on page 61 had a natural period 
of oscillation of 5 hours and 20 minutes. The authors estimated, 
based on calculations of internal friction in the tungsten wire, that 
its Q was on the order of 10°, but they were unable to measure it 
empirically because it would have taken years for the amplitude to 
die down by any measurable amount. Although each aluminum or 
platinum mass was really moving along an arc of a circle, any actual 
oscillations caused by a violation of the equivalence of gravitational 
and inertial mass would have been measured in millions of a degree, 
so it’s a good approximation to say that each mass’s motion was 
along a (very short!) straight line segment. We can also treat each 
mass as if it was oscillating separately from the others. If the prin- 
ciple of equivalence had been violated at the 10~!? level, the limit of 
their experiment’s sensitivity, the sun’s gravitational force on one of 
the 0.4-gram masses would have been about 3 x 10~!9 N, oscillating 
with a period of 24 hours due to the rotation of the earth. (We ig- 
nore the inertia of the arms, whose total mass was only about 25% 
of the total mass of the rotating assembly.) 

(a) Find the amplitude of the resulting oscillations, and determine 
the angle to which they would have corresponded, given that the 


radius of the balance arms was 10 cm. > Answer, p. 874 
(b) Show that even if their estimate of Q was wildly wrong, it 
wouldn’t have affected this result. SI 
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44 A firework shoots up into the air, and just before it explodes 
it has a certain momentum and kinetic energy. What can you say 
about the momenta and kinetic energies of the pieces immediately 
after the explosion? [Based on a problem from PSSC Physics.} 

> Solution, p. 878 © 


45 The figure shows a view from above of a collision about 
V to happen between two air hockey pucks sliding without friction. 
They have the same speed, v;, before the collision, but the big puck 
is 2.3 times more massive than the small one. Their sides have 
sticky stuff on them, so when they collide, they will stick together. 
@ At what angle will they emerge from the collision? In addition to 
giving a numerical answer, please indicate by drawing on the figure 
how your angle is defined. > Solution, p. 878 - 


Problem 45 


46 A learjet traveling due east at 300 mi/hr collides with a 
jumbo jet which was heading southwest at 150 mi/hr. The jumbo 
jet’s mass is five times greater than that of the learjet. When they 
collide, the learjet sticks into the fuselage of the jumbo jet, and they 
fall to earth together. Their engines stop functioning immediately 
after the collision. On a map, what will be the direction from the 
location of the collision to the place where the wreckage hits the 
ground? (Give an angle.) Vv of 


AT (a) A ball is thrown straight up with velocity v. Find an 
equation for the height to which it rises. Vv 
(b) Generalize your equation for a ball thrown at an angle 0 above 
horizontal, in which case its initial velocity components are vz; = 
ucos@ and vy = vsiné. Vv 


— 


48 At the 2010 Salinas Lettuce Festival Parade, the Lettuce 
Queen drops her bouquet while riding on a float moving toward the 
right. Sketch the shape of its trajectory in her frame of reference, 
and compare with the shape seen by one of her admirers standing 
on the sidewalk. gE 


49 Two daredevils, Wendy and Bill, go over Niagara Falls. 
Wendy sits in an inner tube, and lets the 30 km/hr velocity of the 
river throw her out horizontally over the falls. Bill paddles a kayak, 
adding an extra 10 km/hr to his velocity. They go over the edge 
of the falls at the same moment, side by side. Ignore air friction. 
Explain your reasoning. 

(a) Who hits the bottom first? 

(b) What is the horizontal component of Wendy’s velocity on im- 


pact? 
(c) What is the horizontal component of Bill’s velocity on impact? 
(d) Who is going faster on impact? & 
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50 A baseball pitcher throws a pitch clocked at v,=73.3 mi/h. 
He throws horizontally. By what amount, d, does the ball drop by 
the time it reaches home plate, L=60.0 ft away? 

(a) First find a symbolic answer in terms of L, vz, and g. v 
(b) Plug in and find a numerical answer. Express your answer in 
units of ft. (Note: 1 ft=12 in, 1 mi=5280 ft, and 1 in=2.54 cm) 


V,=73.3 mi/hr 


L=60.0 ft 


51 A batter hits a baseball at speed v, at an angle 6 above 
horizontal. 
(a) Find an equation for the range (horizontal distance to where 
the ball falls), R, in terms of the relevant variables. Neglect air 
friction and the height of the ball above the ground when it is hit. 

> Answer, p. 874 
(b) Interpret your equation in the cases of 6=0 and @ = 90°. 
(c) Find the angle that gives the maximum range. 

> Answer, p. 874 L 


52 In this problem you'll extend the analysis in problem 51 


to include air friction by writing a computer program. For a 
game played at sea level, the force due to air friction is approxi- 
mately (7 x 10-4 N-s?/m?)v?, in the direction opposite to the motion 
of the ball.!9 The mass of a baseball is 0.146 kg. 

(a) For a ball hit at a speed of 45.0 m/s from a height of 1.0 m, find 
the optimal angle and the resulting range. > Answer, p. 874 
(b) How much farther would the ball fly at the Colorado Rockies’ 
stadium, where the thinner air gives 18 percent less air friction? 

> Answer, p. 874 i 


53 If you walk 35 km at an angle 25°counterclockwise from 
east, and then 22 km at 230°counterclockwise from east, find the 
distance and direction from your starting point to your destination. 


Vv o 


19 standard baseball is supposed to have a circumference of 95 inches. A 
standard way of paramerizing the force of fluid friction is F = (1/2)pACpv”, 
where p is the density of the fluid, A is the object’s cross-sectional area, and 
Cp is a unitless constant. E. Meyer and J. Bohn, in a 2008 paper published at 
arxiv.org, survey existing data on C’p for baseballs and estimate it to be in the 
range from about 0.13 to 0.5. This leads to a figure something like the one given. 
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54 The figure shows vectors A and B. As in figure p on p. 199, 
graphically calculate the following: 


A+B,A-—B,B-A, —2B, A—2B 


No numbers are involved. 


55 Phnom Penh is 470 km east and 250 km south of Bangkok. 
Hanoi is 60 km east and 1030 km north of Phnom Penh. 

(a) Choose a coordinate system, and translate these data into Ax 
and Ay values with the proper plus and minus signs. 

(b) Find the components of the Ar vector pointing from Bangkok 
to Hanoi. Vv 


56 Is it possible for a helicopter to have an acceleration due east 
and a velocity due west? If so, what would be going on? If not, why 


not? a 
north A 
‘ 
K direction of motion 
= of glacier relative 
Se ee % to continent, 1.1x1077 m/s 
Boe Peas ‘ 
= \ 
direction of motion 1G 
of fossil relative to 2 
glacier, 2.3x10-7 m/s SY 
Problem 57. 


57 A dinosaur fossil is slowly moving down the slope of a glacier 
under the influence of wind, rain and gravity. At the same time, 
the glacier is moving relative to the continent underneath. The 
dashed lines represent the directions but not the magnitudes of the 
velocities. Pick a scale, and use graphical addition of vectors to find 
the magnitude and the direction of the fossil’s velocity relative to 
the continent. You will need a ruler and protractor. vo 


58 A bird is initially flying horizontally east at 21.1 m/s, but one 
second later it has changed direction so that it is flying horizontally 
and 7° north of east, at the same speed. What are the magnitude 
and direction of its acceleration vector during that one second time 
interval? (Assume its acceleration was roughly constant.) Vo 
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59 Your hand presses a block of mass m against a wall with 
a force Fy acting at an angle 0, as shown in the figure. Find the 
minimum and maximum possible values of |Fy| that can keep the 
block stationary, in terms of m, g, 0, and ps, the coefficient of static 
friction between the block and the wall. Check both your answers 
in the case of 9 = 90°, and interpret the case where the maximum 
force is infinite. Vv 


60 A skier of mass m is coasting down a slope inclined at 
an angle @ compared to horizontal. Assume for simplicity that the 
treatment of kinetic friction given in chapter 5 is appropriate here, 
although a soft and wet surface actually behaves a little differently. 
The coefficient of kinetic friction acting between the skis and the 
snow is tz, and in addition the skier experiences an air friction 
force of magnitude bv”, where b is a constant. 

(a) Find the maximum speed that the skier will attain, in terms of 
the variables m, g, 0, uz, and 6. Vv 
(b) For angles below a certain minimum angle Onin, the equation 
gives a result that is not mathematically meaningful. Find an equa- 
tion for @min, and give a physical explanation of what is happening 
for 6 < Omin- Vv 


~ 


61 A gun is aimed horizontally to the west, and fired at t = 0. 
The bullet’s position vector as a function of time is r = bk + cty + 
dt?z, where b, c, and d are positive constants. 

(a) What units would b, c, and d need to have for the equation to 
make sense? 

(b) Find the bullet’s velocity and acceleration as functions of time. 
(c) Give physical interpretations of b, c, d, X, y, and @. z 


62 Annie Oakley, riding north on horseback at 30 mi/hr, shoots 
her rifle, aiming horizontally and to the northeast. The muzzle speed 
of the rifle is 140 mi/hr. When the bullet hits a defenseless fuzzy 
animal, what is its speed of impact? Neglect air resistance, and 
ignore the vertical motion of the bullet. > Solution, p. 878 


63 A cargo plane has taken off from a tiny airstrip in the Andes, 
and is climbing at constant speed, at an angle of 0 = 17° with respect 
to horizontal. Its engines supply a thrust of Fip~rys¢ = 200 kN, and 
the lift from its wings is Fj; f; = 654 kN. Assume that air resistance 
(drag) is negligible, so the only forces acting are thrust, lift, and 
weight. What is its mass, in kg? > Solution, p. 879 L 


64 A wagon is being pulled at constant speed up a slope 0 
by a rope that makes an angle ¢ with the vertical. (a) Assuming 
negligible friction, show that the tension in the rope is given by the 
equation 

sin 0 


~ sin@+¢) 


(b) Interpret this equation in the special cases of 6 = 0 and ¢ = 


Problem 59 


Fthrust 


Fiitt 


Problem 64 


Ps. eee 
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180° — 0. > Solution, p. 879 © 


65 The angle of repose is the maximum slope on which an object 

will not slide. On airless, geologically inert bodies like the moon or 

an asteroid, the only thing that determines whether dust or rubble 

will stay on a slope is whether the slope is less steep than the angle 

of repose. 

(a) Find an equation for the angle of repose, deciding for yourself 

what are the relevant variables. 

(b) On an asteroid, where g can be thousands of times lower than 

on Earth, would rubble be able to lie at a steeper angle of repose? 
> Solution, p. 880 L 


66 When yow’re done using an electric mixer, you can get most 
of the batter off of the beaters by lifting them out of the batter with 
the motor running at a high enough speed. Let’s imagine, to make 
things easier to visualize, that we instead have a piece of tape stuck 
to one of the beaters. 

(a) Explain why static friction has no effect on whether or not the 
tape flies off. 

(b) Analyze the forces in which the tape participates, using a table 
the format shown in section ??. 

(c) Suppose you find that the tape doesn’t fly off when the motor is 
on a low speed, but at a greater speed, the tape won’t stay on. Why 
would the greater speed change things? [Hint: If you don’t invoke 
any law of physics, you haven’t explained it.] x 


67 — Show that the expression |v|?/r has the units of acceleration. 


68 A plane is flown in a loop-the-loop of radius 1.00 km. The 
plane starts out flying upside-down, straight and level, then begins 
curving up along the circular loop, and is right-side up when it 
reaches the top. (The plane may slow down somewhat on the way 
up.) How fast must the plane be going at the top if the pilot is to 
experience no force from the seat or the seatbelt while at the top of 


the loop? Vv oo 
69 Find the angle between the following two vectors: 

X+ 2y¥ + 32 

4x + 5y + 62 


> Hint, p. 864 V 


70 The two blocks shown in the figure have equal mass, m, and 
the surface is frictionless. (a) What is the tension in the massless 
rope? > Hint, p. 864 V 


(b) Show that the units of your answer make sense. 
(c) Check the physical behavior of your answer in the special cases 
of 6< 6 and éd=0, 6= 90°. E 
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71 (a) We observe that the amplitude of a certain free oscillation 
decreases from A, to A,/Z after n oscillations. Find its Q. Vv 
(b) The figure is from Shape memory in Spider draglines, Emile, 
Le Floch, and Vollrath, Nature 440:621 (2006). Panel 1 shows an 
electron microscope’s image of a thread of spider silk. In 2, a spi- 
der is hanging from such a thread. From an evolutionary point of 
view, it’s probably a bad thing for the spider if it twists back and 
forth while hanging like this. (We’re referring to a back-and-forth 
rotation about the axis of the thread, not a swinging motion like a 
pendulum.) The authors speculate that such a vibration could make 
the spider easier for predators to see, and it also seems to me that 
it would be a bad thing just because the spider wouldn’t be able 
to control its orientation and do what it was trying to do. Panel 3 
shows a graph of such an oscillation, which the authors measured 
using a video camera and a computer, with a 0.1 g mass hung from it 
in place of a spider. Compared to human-made fibers such as kevlar 
or copper wire, the spider thread has an unusual set of properties: 


1. It has a low Q, so the vibrations damp out quickly. 


2. It doesn’t become brittle with repeated twisting as a copper 
wire would. 


3. When twisted, it tends to settle in to a new equilibrium angle, 
rather than insisting on returning to its original angle. You 
can see this in panel 2, because although the experimenters 
initially twisted the wire by 35 degrees, the thread only per- 
formed oscillations with an amplitude much smaller than +35 
degrees, settling down to a new equilibrium at 27 degrees. 


4. Over much longer time scales (hours), the thread eventually 
resets itself to its original equilbrium angle (shown as zero 
degrees on the graph). (The graph reproduced here only shows 
the motion over a much shorter time scale.) Some human- 
made materials have this “memory” property as well, but they 
typically need to be heated in order to make them go back to 
their original shapes. 


Focusing on property number 1, estimate the Q of spider silk from 
the graph. Vv eo 
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72 A cross-country skier is gliding on a level trail, with negligi- 
ble friction. Then, when he is at position x = 0, the tip of his skis 
enters a patch of dirt. As he rides onto the dirt, more and more of 
his weight is being supported by the dirt. The skis have length @, 
so if he reached x = @ without stopping, his weight would be com- 
pletely on the dirt. This problem deals with the motion for x < @. 
(a) Find the acceleration in terms of x, as well as any other relevant 
constants. 

(b) This is a second-order differential equation. You should be able 
to find the solution simply by thinking about some commonly oc- 
curing functions that you know about, and finding two that have 
the right properties. If these functions are x = f(t) and x = g(t), 
then the most general solution to the equations of motion will be of 
the form « = af + bg, where a and 0 are constants to be determined 
from the initial conditions. 

(c) Suppose that the initial velocity vo at x = 0 is such that he stops 
at x < @. Find the time until he stops, and show that, counterintu- 
itively, this time is independent of vo. Explain physically why this 
is true. v 
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73 A microwave oven works by twisting molecules one way and 
then the other, counterclockwise and then clockwise about their own 
centers, millions of times a second. If you put an ice cube or a stick 
of butter in a microwave, you'll observe that the oven doesn’t heat 
the solid very quickly, although eventually melting begins in one 
small spot. Once a melted spot forms, it grows rapidly, while the 
rest of the solid remains solid. In other words, it appears based on 
this experiment that a microwave oven heats a liquid much more 
rapidly than a solid. Explain why this should happen, based on the 
atomic-level description of heat, solids, and liquids. (See, e.g., figure 
b on page 110.) 


Please don’t repeat the following common mistakes in your ex- 
planation: 


In a solid, the atoms are packed more tightly and have less 
space between them. Not true. Ice floats because it’s less 
dense than water. 


In a liquid, the atoms are moving much faster. No, the differ- 
ence in average speed between ice at —1°C and water at 1°C 
is only 0.4%. 


~ 


74 Problem 2-?? on page ?? was intended to be solved using 
conservation of energy. Solve the same problem using Newton’s 
laws. g 


75 A bead slides down along a piece of wire that is in the shape 
of a helix. The helix lies on the surface of a vertical cylinder of 
radius r, and the vertical distance between turns is d. 

(a) Ordinarily when an object slides downhill under the influence of 
kinetic friction, the velocity-independence of kinetic friction implies 
that the acceleration is constant, and therefore there is no limit to 
the object’s velocity. Explain the physical reason why this argument 
fails here, so that the bead will in fact have some limiting velocity. 

(b) Find the limiting velocity. 

(c) Show that your result has the correct behavior in the limit of 
r — oo. [Problem by B. Korsunsky.] Vv 


76 A person on a bicycle is to coast down a ramp of height h and 
then pass through a circular loop of radius r. What is the small- 
est value of A for which the cyclist will complete the loop without 
falling? (Ignore the kinetic energy of the spinning wheels.) v 


— 
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77 A car accelerates from rest. At low speeds, its acceleration 
is limited by static friction, so that if we press too hard on the 
gas, we will “burn rubber” (or, for many newer cars, a computer- 
ized traction-control system will override the gas pedal). At higher 
speeds, the limit on acceleration comes from the power of the engine, 
which puts a limit on how fast kinetic energy can be developed. 

(a) Show that if a force F' is applied to an object moving at speed 
v, the power required is given by P = vF. 

(b) Find the speed v at which we cross over from the first regime de- 
scribed above to the second. At speeds higher than this, the engine 
does not have enough power to burn rubber. Express your result 
in terms of the car’s power P, its mass m, the coefficient of static 
friction ps, and g. Vv 
(c) Show that your answer to part b has units that make sense. 

(d) Show that the dependence of your answer on each of the four 
variables makes sense physically. 

(e) The 2010 Maserati Gran Turismo Convertible has a maximum 
power of 3.23 x 10° W (433 horsepower) and a mass (including a 50- 
kg driver) of 2.03 x 10° kg. (This power is the maximum the engine 
can supply at its optimum frequency of 7600 r.p.m. Presumably the 
automatic transmission is designed so a gear is available in which 
the engine will be running at very nearly this frequency when the 
car is at moving at v.) Rubber on asphalt has y, + 0.9. Find v for 
this car. Answer: 18 m/s, or about 40 miles per hour. 

(f) Our analysis has neglected air friction, which can probably be 
approximated as a force proportional to v2. The existence of this 
force is the reason that the car has a maximum speed, which is 176 
miles per hour. To get a feeling for how good an approximation 
it is to ignore air friction, find what fraction of the engine’s maxi- 
mum power is being used to overcome air resistance when the car is 
moving at the speed v found in part e. Answer: 1% a 


78 Two wheels of radius r rotate in the same vertical plane with 
angular velocities +9 and —Q about axes that are parallel and at 
the same height. The wheels touch one another at a point on their 
circumferences, so that their rotations mesh like gears in a gear train. 
A board is laid on top of the wheels, so that two friction forces act 
upon it, one from each wheel. Characterize the three qualitatively 
different types of motion that the board can exhibit, depending on 
the initial conditions. = 
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79 For safety, mountain climbers often wear a climbing harness 
and tie in to other climbers on a rope team or to anchors such as 
pitons or snow anchors. When using anchors, the climber usually 
wants to tie in to more than one, both for extra strength and for 
redundancy in case one fails. The figure shows such an arrangement, 
with the climber hanging from a pair of anchors forming a “Y” at 
an angle 6. The usual advice is to make # < 90°; for large values of 
0, the stress placed on the anchors can be many times greater than 
the actual load L, so that two anchors are actually less safe than 
one. 

(a) Find the force S at each anchor in terms of L and 0. v 
(b) Verify that your answer makes sense in the case of 0 = 0. 

(c) Interpret your answer in the case of 6 = 180°. 

(d) What is the smallest value of 0 for which S equals or exceeds 
L, so that for larger angles a failure of at least one anchor is more 
likely than it would have been with a single anchor? v 


~ 


80 Mountain climbers with masses m and M are roped together 
while crossing a horizontal glacier when a vertical crevasse opens up 
under the climber with mass M. The climber with mass m drops 
down on the snow and tries to stop by digging into the snow with 
the pick of an ice ax. Alas, this story does not have a happy ending, 
because this doesn’t provide enough friction to stop. Both m and M 
continue accelerating, with M dropping down into the crevasse and 
m being dragged across the snow, slowed only by the kinetic friction 
with coefficient uz, acting between the ax and the snow. There is no 
significant friction between the rope and the lip of the crevasse. 

(a) Find the acceleration a. Vv 
(b) Check the units of your result. 

(c) Check the dependence of your equation on the variables. That 
means that for each variable, you should determine what its effect 
on a should be physically, and then what your answer from part a 
says its effect would be mathematically. g 


Key to symbols: 
easy . typical challenging Mdifficult M& very difficult 
Vv An answer check is available at www.lightandmatter.com. 


to climber 


Problem 79. 


Problem 80. 


Problems 235 


236 


Exercises 
Exercise 3A: Force and Motion 
Equipment: 
2-meter pieces of butcher paper 
wood blocks with hooks 
string 
masses to put on top of the blocks to increase friction 
spring scales (preferably calibrated in Newtons) 


Suppose a person pushes a crate, sliding it across the floor at a certain speed, and then repeats 
the same thing but at a higher speed. This is essentially the situation you will act out in this 
exercise. What do you think is different about her force on the crate in the two situations? 
Discuss this with your group and write down your hypothesis: 


1. First you will measure the amount of friction between the wood block and the butcher paper 
when the wood and paper surfaces are slipping over each other. The idea is to attach a spring 
scale to the block and then slide the butcher paper under the block while using the scale to 
keep the block from moving with it. Depending on the amount of force your spring scale was 
designed to measure, you may need to put an extra mass on top of the block in order to increase 
the amount of friction. It is a good idea to use long piece of string to attach the block to the 
spring scale, since otherwise one tends to pull at an angle instead of directly horizontally. 


First measure the amount of friction force when sliding the butcher paper as slowly as possi- 


Now measure the amount of friction force at a significantly higher speed, say 1 meter per second. 
(If you try to go too fast, the motion is jerky, and it is impossible to get an accurate reading.) 


Discuss your results. Why are we justified in assuming that the string’s force on the block (i.e., 
the scale reading) is the same amount as the paper’s frictional force on the block? 


2. Now try the same thing but with the block moving and the paper standing still. Try two 
different speeds. 


Do your results agree with your original hypothesis? If not, discuss what’s going on. How does 
the block “know” how fast to go? 
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Exercise 3B: Vibrations 

Equipment: 
e air track and carts of two different masses 
e springs 
e spring scales 


spring 


CH)  Ccgnrvenrcennrnncenney 


Place the cart on the air track and attach springs so that it can vibrate. 


1. Test whether the period of vibration depends on amplitude. Try at least two moderate 
amplitudes, for which the springs do not go slack, and at least one amplitude that is large 
enough so that they do go slack. 


2. Try a cart with a different mass. Does the period change by the expected factor, based on 
the equation T = 21\/m/k? 


3. Use a spring scale to pull the cart away from equilibrium, and make a graph of force versus 
position. Is it linear? If so, what is its slope? 


4. Test the equation T = 27,/m/k numerically. 
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Exercise 3C: Worksheet on Resonance 


1. Compare the oscillator’s energies at A, B, C, and D. 
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3. Match the x-t graphs in #2 with the amplitude-frequency graphs below. 


response response 


oe 


frequency frequency 
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Exercise D is on the following two pages. 
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Exercise 3D: Vectors and Motion 


Each diagram on page 241 shows the motion of an object in an x — y plane. Each dot is one 
location of the object at one moment in time. The time interval from one dot to the next is 
always the same, so you can think of the vector that connects one dot to the next as a v vector, 
and subtract to find Av vectors. 


1. Suppose the object in diagram 1 is moving from the top left to the bottom right. Deduce 
whatever you can about the force acting on it. Does the force always have the same magnitude? 
The same direction? 


Invent a physical situation that this diagram could represent. 

What if you reinterpret the diagram, and reverse the object’s direction of motion? 
2. What can you deduce about the force that is acting in diagram 2? 

Invent a physical situation that diagram 2 could represent. 

3. What can you deduce about the force that is acting in diagram 3? 


Invent a physical situation. 
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Chapter 4 


Conservation of Angular Mo- 
mentum 


4.1 Angular Momentum In Two Dimensions 


4.1.1 Angular momentum 


“Sure, and maybe the sun won’t come up tomorrow.” Of course, 
the sun only appears to go up and down because the earth spins, 
so the cliche should really refer to the unlikelihood of the earth’s 
stopping its rotation abruptly during the night. Why can’t it stop? 
It wouldn’t violate conservation of momentum, because the earth’s 
rotation doesn’t add anything to its momentum. While California 
spins in one direction, some equally massive part of India goes the 
opposite way, canceling its momentum. A halt to Earth’s rotation 
would entail a drop in kinetic energy, but that energy could simply 
be converted into some other form, such as heat. 


a/The jumper can’t move his 
legs counterclockwise without 
moving his arms _ clockwise. 
(Thomas Eakins.) 


Other examples along these lines are not hard to find. An atom 
spins at the same rate for billions of years. A high-diver who is 
rotating when he comes off the board does not need to make any 
physical effort to continue rotating, and indeed would be unable to 
stop rotating before he hit the water. 


These observations have the hallmarks of a conservation law: 


A closed system is involved. Nothing is making an effort to twist 
the earth, the hydrogen atom, or the high-diver. They are isolated 
from rotation-changing influences, i.e., they are closed systems. 


Something remains unchanged. There appears to be a numerical 
quantity for measuring rotational motion such that the total amount 
of that quantity remains constant in a closed system. 
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b/An overhead view of a 
piece of putty being thrown at 
a door. Even though the putty 
is neither spinning nor traveling 
along a curve, we must define it 
has having some kind of “rota- 
tion” because it is able to make 
the door rotate. 


Something can be transferred back and forth without changing 
the total amount. In the photo of the old-fashioned high jump, a, 
the jumper wants to get his feet out in front of him so he can keep 
from doing a “face plant” when he lands. Bringing his feet forward 
would involve a certain quantity of counterclockwise rotation, but he 
didn’t start out with any rotation when he left the ground. Suppose 
we consider counterclockwise as positive and clockwise as negative. 
The only way his legs can acquire some positive rotation is if some 
other part of his body picks up an equal amount of negative rotation. 
This is why he swings his arms up behind him, clockwise. 


What numerical measure of rotational motion is conserved? Car 
engines and old-fashioned LP records have speeds of rotation mea- 
sured in rotations per minute (r.p.m.), but the number of rota- 
tions per minute (or per second) is not a conserved quantity. A 
twirling figure skater, for instance, can pull her arms in to increase 
her r.p.m.’s. The first section of this chapter deals with the nu- 
merical definition of the quantity of rotation that results in a valid 
conservation law. 


When most people think of rotation, they think of a solid object 
like a wheel rotating in a circle around a fixed point. Examples of 
this type of rotation, called rigid rotation or rigid-body rotation, in- 
clude a spinning top, a seated child’s swinging leg, and a helicopter’s 
spinning propeller. Rotation, however, is a much more general phe- 
nomenon, and includes noncircular examples such as a comet in 
an elliptical orbit around the sun, or a cyclone, in which the core 
completes a circle more quickly than the outer parts. 


If there is a numerical measure of rotational motion that is a 
conserved quantity, then it must include nonrigid cases like these, 
since nonrigid rotation can be traded back and forth with rigid ro- 
tation. For instance, there is a trick for finding out if an egg is 
raw or hardboiled. If you spin a hardboiled egg and then stop it 
briefly with your finger, it stops dead. But if you do the same with 
a raw egg, it springs back into rotation because the soft interior was 
still swirling around within the momentarily motionless shell. The 
pattern of flow of the liquid part is presumably very complex and 
nonuniform due to the asymmetric shape of the egg and the differ- 
ent consistencies of the yolk and the white, but there is apparently 
some way to describe the liquid’s total amount of rotation with a 
single number, of which some percentage is given back to the shell 
when you release it. 


The best strategy is to devise a way of defining the amount of 
rotation of a single small part of a system. The amount of rotation 
of a system such as a cyclone will then be defined as the total of all 
the contributions from its many small parts. 


The quest for a conserved quantity of rotation even requires us 
to broaden the rotation concept to include cases where the motion 


244 Chapter 4 Conservation of Angular Momentum 


doesn’t repeat or even curve around. If you throw a piece of putty 
at a door, b, the door will recoil and start rotating. The putty was 
traveling straight, not in a circle, but if there is to be a general 
conservation law that can cover this situation, it appears that we 
must describe the putty as having had some “rotation,” which it 
then gave up to the door. The best way of thinking about it is to 
attribute rotation to any moving object or part of an object that 
changes its angle in relation to the axis of rotation. In the putty- 
and-door example, the hinge of the door is the natural point to think 
of as an axis, and the putty changes its angle as seen by someone 
standing at the hinge, c. For this reason, the conserved quantity 
we are investigating is called angular momentum. The symbol for 
angular momentum can’t be “a” or “m,” since those are used for 
acceleration and mass, so the letter L is arbitrarily chosen instead. 


Imagine a 1 kg blob of putty, thrown at the door at a speed of 
1 m/s, which hits the door at a distance of 1 m from the hinge. 
We define this blob to have 1 unit of angular momentum. When 
it hits the door, the door will recoil and start rotating. We can 
use the speed at which the door recoils as a measure of the angular 
momentum the blob brought in.! 


Experiments show, not surprisingly, that a 2 kg blob thrown in 
the same way makes the door rotate twice as fast, so the angular 
momentum of the putty blob must be proportional to mass, 


Lum 


Similarly, experiments show that doubling the velocity of the 
blob will have a doubling effect on the result, so its angular momen- 
tum must be proportional to its velocity as well, 


Lamu 


You have undoubtedly had the experience of approaching a closed 
door with one of those bar-shaped handles on it and pushing on the 
wrong side, the side close to the hinges. You feel like an idiot, be- 
cause you have so little leverage that you can hardly budge the door. 
The same would be true with the putty blob. Experiments would 
show that the amount of rotation the blob can give to the door is 
proportional to the distance, r, from the axis of rotation, so angular 
momentum must be proportional to r as well, 


Lxmor 


We are almost done, but there is one missing ingredient. We 
know on grounds of symmetry that a putty ball thrown directly 


'We assume that the door is much more massive than the blob. Under this 
assumption, the speed at which the door recoils is much less than the original 
speed of the blob, so the blob has lost essentially all its angular momentum, and 
given it to the door. 
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c/ As seen by someone standing 
at the axis, the putty changes 
its angular position. We there- 
fore define it as having angular 
momentum. 


| ¢ 


d/A_ putty blob thrown di- 
rectly at the axis has no angular 
motion, and therefore no angular 
momentum. It will not cause the 
door to rotate. 
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A, 


e/Only the component of 
the velocity vector perpendicular 
to the line connecting the object 
to the axis should be counted 
into the definition of angular 
momentum. 


f/A figure skater pulls in her 
arms so that she can execute a 
spin more rapidly. 


inward toward the hinge will have no angular momentum to give 
to the door. After all, there would not even be any way to de- 
cide whether the ball’s rotation was clockwise or counterclockwise 
in this situation. It is therefore only the component of the blob’s 
velocity vector perpendicular to the door that should be counted in 
its angular momentum, 


L=mvir 


More generally, v_ should be thought of as the component of the 
object’s velocity vector that is perpendicular to the line joining the 
object to the axis of rotation. 


We find that this equation agrees with the definition of the orig- 
inal putty blob as having one unit of angular momentum, and we 
can now sce that the units of angular momentum are (kg-m/s)-m, 
ie., kg-m?/s. Summarizing, we have 


L=mv_r [angular momentum of a particle in two dimensions] 


where m is the particle’s mass, v, is the component of its velocity 
vector perpendicular to the line joining it to the axis of rotation, 
and r is its distance from the axis. (Note that r is not necessarily 
the radius of a circle.) Positive and negative signs of angular mo- 
mentum are used to describe opposite directions of rotation. The 
angular momentum of a finite-sized object or a system of many ob- 
jects is found by dividing it up into many small parts, applying the 
equation to each part, and adding to find the total amount of an- 
gular momentum. (As implied by the word “particle,” matter isn’t 
the only thing that can have angular momentum. Light can also 
have angular momentum, and the above equation would not apply 
to light.) 


Conservation of angular momentum has been verified over and 
over again by experiment, and is now believed to be one of the most 
fundamental principles of physics, along with conservation of mass, 
energy, and momentum. 


A figure skater pulls her arms in. example 1 
When a figure skater is twirling, there is very little friction between 
her and the ice, so she is essentially a closed system, and her 
angular momentum is conserved. If she pulls her arms in, she is 
decreasing r for all the atoms in her arms. It would violate con- 
servation of angular momentum if she then continued rotating at 
the same speed, i.e., taking the same amount of time for each 
revolution, because her arms’ contributions to her angular mo- 
mentum would have decreased, and no other part of her would 
have increased its angular momentum. This is impossible be- 
cause it would violate conservation of angular momentum. If her 
total angular momentum is to remain constant, the decrease in r 
for her arms must be compensated for by an overall increase in 
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her rate of rotation. That is, by pulling her arms in, she substan- 
tially reduces the time for each rotation. 


Earth’s slowing rotation and the receding moon example 2 
The earth’s rotation is actually slowing down very gradually, with 
the kinetic energy being dissipated as heat by friction between the 
land and the tidal bulges raised in the seas by the earth’s gravity. 
Does this mean that angular momentum is not really perfectly 
conserved? No, it just means that the earth is not quite a closed 
system by itself. If we consider the earth and moon as a system, 
then the angular momentum lost by the earth must be gained by 
the moon somehow. In fact very precise measurements of the 
distance between the earth and the moon have been carried out 
by bouncing laser beams off of a mirror left there by astronauts, 
and these measurements show that the moon is receding from 
the earth at a rate of 4 centimeters per year! The moon’s greater 
value of r means that it has a greater angular momentum, and 
the increase turns out to be exactly the amount lost by the earth. 
In the days of the dinosaurs, the days were significantly shorter, 
and the moon was closer and appeared bigger in the sky. 


But what force is causing the moon to speed up, drawing it out 
into a larger orbit? It is the gravitational forces of the earth’s tidal 
bulges. In figure g, the earth’s rotation is counterclockwise (ar- 
row). The moon’s gravity creates a bulge on the side near it, be- 
cause its gravitational pull is stronger there, and an “anti-bulge” 
on the far side, since its gravity there is weaker. For simplicity, let’s 
focus on the tidal bulge closer to the moon. Its frictional force is 
trying to slow down the earth’s rotation, so its force on the earth's 
solid crust is toward the bottom of the figure. By Newton’s third 
law, the crust must thus make a force on the bulge which is to- 
ward the top of the figure. This causes the bulge to be pulled 
forward at a slight angle, and the bulge’s gravity therefore pulls 
the moon forward, accelerating its orbital motion about the earth 
and flinging it outward. 


The result would obviously be extremely difficult to calculate di- 
rectly, and this is one of those situations where a conservation 
law allows us to make precise quantitative statements about the 
outcome of a process when the calculation of the process itself 
would be prohibitively complex. 


Restriction to rotation in a plane 


Is angular momentum a vector, or a scalar? It does have a di- 
rection in space, but it’s a direction of rotation, not a straight-line 
direction like the directions of vectors such as velocity or force. It 
turns out that there is a way of defining angular momentum as a 
vector, but in this section the examples will be confined to a single 
plane of rotation, i.e., effectively two-dimensional situations. In this 


uc 


g/A view of the earth-moon 
system from above the north 
pole. All distances have been 
highly distorted for legibility. 
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h/The area swept out by a 
planet in its orbit. 


special case, we can choose to visualize the plane of rotation from 
one side or the other, and to define clockwise and counterclockwise 
rotation as having opposite signs of angular momentum. “Effec- 
tively” two-dimensional means that we can deal with objects that 
aren’t flat, as long as the velocity vectors of all their parts lie in a 
plane. 


Discussion Questions 


A Conservation of plain old momentum, p, can be thought of as the 
greatly expanded and modified descendant of Galileo’s original principle 
of inertia, that no force is required to keep an object in motion. The princi- 
ple of inertia is counterintuitive, and there are many situations in which it 
appears superficially that a force is needed to maintain motion, as main- 
tained by Aristotle. Think of a situation in which conservation of angular 
momentum, L, also seems to be violated, making it seem incorrectly that 
something external must act on a closed system to keep its angular mo- 
mentum from “running down.” 


4.1.2 Application to planetary motion 


We now discuss the application of conservation of angular mo- 
mentum to planetary motion, both because of its intrinsic impor- 
tance and because it is a good way to develop a visual intuition for 
angular momentum. 


Kepler’s law of equal areas states that the area swept out by 
a planet in a certain length of time is always the same. Angular 
momentum had not been invented in Kepler’s time, and he did not 
even know the most basic physical facts about the forces at work. He 
thought of this law as an entirely empirical and unexpectedly simple 
way of summarizing his data, a rule that succeeded in describing 
and predicting how the planets sped up and slowed down in their 
elliptical paths. It is now fairly simple, however, to show that the 
equal area law amounts to a statement that the planet’s angular 
momentum stays constant. 


There is no simple geometrical rule for the area of a pie wedge 
cut out of an ellipse, but if we consider a very short time interval, 
as shown in figure h, the shaded shape swept out by the planet is 
very nearly a triangle. We do know how to compute the area of a 
triangle. It is one half the product of the base and the height: 


1 
= =bh 
area = 5 


We wish to relate this to angular momentum, which contains the 
variables r and v,. If we consider the sun to be the axis of rotation, 
then the variable r is identical to the base of the triangle, r = b. 
Referring to the magnified portion of the figure, v, can be related 
to h, because the two right triangles are similar: 


h UL 


distance traveled |v| 
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The area can thus be rewritten as 


1 ae (distance traveled) 
2 Iv| 


area = 
The distance traveled equals |v|At, so this simplifies to 
1 
area = ri At 


We have found the following relationship between angular momen- 
tum and the rate at which area is swept out: 

area 

L = 2m—— 

At 
The factor of 2 in front is simply a matter of convention, since any 
conserved quantity would be an equally valid conserved quantity if 
you multiplied it by a constant. The factor of m was not relevant 
to Kepler, who did not know the planets’ masses, and who was only 
describing the motion of one planet at a time. 


We thus find that Kepler’s equal-area law is equivalent to a state- 
ment that the planet’s angular momentum remains constant. But 
wait, why should it remain constant? — the planet is not a closed 
system, since it is being acted on by the sun’s gravitational force. 
There are two valid answers. The first is that it is actually the to- 
tal angular momentum of the sun plus the planet that is conserved. 
The sun, however, is millions of times more massive than the typical 
planet, so it accelerates very little in response to the planet’s gravi- 
tational force. It is thus a good approximation to say that the sun 
doesn’t move at all, so that no angular momentum is transferred 
between it and the planet. 


The second answer is that to change the planet’s angular mo- 
mentum requires not just a force but a force applied in a certain 
way. Later in this section (starting on page 252) we discuss the 
transfer of angular momentum by a force, but the basic idea here is 
that a force directly in toward the axis does not change the angular 
momentum. 


Discussion Questions 


A Suppose an object is simply traveling in a straight line at constant 
speed. If we pick some point not on the line and call it the axis of rotation, 
is area swept out by the object at a constant rate? 


B__ The figure is a strobe photo of a pendulum bob, taken from under- 
neath the pendulum looking straight up. The black string can’t be seen in 
the photograph. The bob was given a slight sideways push when it was 
released, so it did not swing in a plane. The bright spot marks the center, 
i.e., the position the bob would have if it hung straight down at us. Does 
the bob’s angular momentum appear to remain constant if we consider 
the center to be the axis of rotation? 


i / Discussion question B. 
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j / Two asteroids collide. 


k/Everyone has a 
tendency to think of the diver as 
rotating about his own center of 
mass. However, he is flying in an 
arc, and he also has angular mo- 
mentum because of this motion. 


strong 


1/ This rigid object has angu- 
lar momentum both because it is 
spinning about its center of mass 
and because it is moving through 
space. 


4.1.3 Two theorems about angular momentum 


With plain old momentum, p, we had the freedom to work in 
any inertial frame of reference we liked. The same object could 
have different values of momentum in two different frames, if the 
frames were not at rest with respect to each other. Conservation 
of momentum, however, would be true in either frame. As long as 
we employed a single frame consistently throughout a calculation, 
everything would work. 


The same is true for angular momentum, and in addition there 
is an ambiguity that arises from the definition of an axis of rotation. 
For a wheel, the natural choice of an axis of rotation is obviously the 
axle, but what about an egg rotating on its side? The egg has an 
asymmetric shape, and thus no clearly defined geometric center. A 
similar issue arises for a cyclone, which does not even have a sharply 
defined shape, or for a complicated machine with many gears. The 
following theorem, the first of two presented in this section, explains 
how to deal with this issue. Although I have put descriptive titles 
above both theorems, they have no generally accepted names. The 
proofs, given on page 858, use the vector cross-product technique 
introduced in section 4.3, which greatly simplifies them. 


The choice of axis theorem: It is entirely arbitrary what point 
one defines as the axis for purposes of calculating angular momen- 
tum. If a closed system’s angular momentum is conserved when 
calculated with one choice of axis, then it will be conserved for any 
other choice of axis. Likewise, any inertial frame of reference may 
be used. The theorem also holds in the case where the system is not 
closed, but the total external force is zero. 


Colliding asteroids described with different axes example 3 
Observers on planets A and B both see the two asteroids collid- 
ing. The asteroids are of equal mass and their impact speeds are 
the same. Astronomers on each planet decide to define their own 
planet as the axis of rotation. Planet A is twice as far from the col- 
lision as planet B. The asteroids collide and stick. For simplicity, 
assume planets A and B are both at rest. 


With planet A as the axis, the two asteroids have the same amount 
of angular momentum, but one has positive angular momentum 
and the other has negative. Before the collision, the total angular 
momentum is therefore zero. After the collision, the two asteroids 
will have stopped moving, and again the total angular momen- 
tum is zero. The total angular momentum both before and after 
the collision is zero, so angular momentum is conserved if you 
choose planet A as the axis. 


The only difference with planet B as axis is that r is smaller by a 
factor of two, so all the angular momenta are halved. Even though 
the angular momenta are different than the ones calculated by 
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planet A, angular momentum is still conserved. 


The earth spins on its own axis once a day, but simultaneously 
travels in its circular one-year orbit around the sun, so any given 
part of it traces out a complicated loopy path. It would seem difficult 
to calculate the earth’s angular momentum, but it turns out that 
there is an intuitively appealing shortcut: we can simply add up the 
angular momentum due to its spin plus that arising from its center 
of mass’s circular motion around the sun. This is a special case of 
the following general theorem: 


The spin theorem: An object’s angular momentum with respect 
to some outside axis A can be found by adding up two parts: 

(1) The first part is the object’s angular momentum found by using 
its own center of mass as the axis, i.e., the angular momentum the 
object has because it is spinning. 

(2) The other part equals the angular momentum that the object 
would have with respect to the axis A if it had all its mass concen- 
trated at and moving with its center of mass. 


A system with its center of mass at rest example 4 
In the special case of an object whose center of mass is at rest, 
the spin theorem implies that the object's angular momentum is 
the same regardless of what axis we choose. (This is an even 
stronger statement than the choice of axis theorem, which only 
guarantees that angular momentum is conserved for any given 
choice of axis, without specifying that it is the same for all such 
choices.) 


Angular momentum of a rigid object example 5 
> A motorcycle wheel has almost all its mass concentrated at 
the outside. If the wheel has mass m and radius r, and the time 
required for one revolution is T, what is the spin part of its angular 
momentum? 


> This is an example of the commonly encountered special case 
of rigid motion, as opposed to the rotation of a system like a hur- 
ricane in which the different parts take different amounts of time 
to go around. We don’t really have to go through a laborious 
process of adding up contributions from all the many parts of a 
wheel, because they are all at about the same distance from the 
axis, and are all moving around the axis at about the same speed. 
The velocity is all perpendicular to the spokes, 


Vv, = (circumference) /T 
= 2ar/T 
and the angular momentum of the wheel about its center is 
L=mvy,r 
= mM2nr/T)r 
= 2nmr?/T 


Section 4.1. Angular Momentum In Two Dimensions 


251 


eo 


LJ VI 


ee 


m/The plane’s four engines 
produce zero total torque but not 
zero total force. 


axis F 
x oe << 
x e 

F 


n/The simple physical  situa- 
tion we use to derive an equation 
for torque. A force that points 
directly in at or out away from the 
axis produces neither clockwise 
nor counterclockwise angular 
momentum. A force in the per- 
pendicular direction does transfer 
angular momentum. 


Note that although the factors of 27 in this expression is peculiar 
to a wheel with its mass concentrated on the rim, the proportional- 
ity to m/T would have been the same for any other rigidly rotating 
object. Although an object with a noncircular shape does not have 
a radius, it is also true in general that angular momentum is pro- 
portional to the square of the object’s size for fixed values of m and 
T. For instance doubling an object’s size doubles both the v, and 
r factors in the contribution of each of its parts to the total angular 
momentum, resulting in an overall factor of four increase. 


4.1.4 Torque 


Force is the rate of transfer of momentum. The corresponding 
quantity in the case of angular momentum is called torque (rhymes 
with “fork”). Where force tells us how hard we are pushing or 
pulling on something, torque indicates how hard we are twisting on 
it. Torque is represented by the Greek letter tau, 7, and the rate 
of change of an object’s angular momentum equals the total torque 
acting on it, 

Ttotal = dL /dt 


As with force and momentum, it often happens that angular 
momentum recedes into the background and we focus our interest on 
the torques. The torque-focused point of view is exemplified by the 
fact that many scientifically untrained but mechanically apt people 
know all about torque, but none of them have heard of angular 
momentum. Car enthusiasts eagerly compare engines’ torques, and 
there is a tool called a torque wrench which allows one to apply a 
desired amount of torque to a screw and avoid overtightening it. 


Torque distinguished from force 


Of course a force is necessary in order to create a torque — you 
can’t twist a screw without pushing on the wrench — but force and 
torque are two different things. One distinction between them is 
direction. We use positive and negative signs to represent forces in 
the two possible directions along a line. The direction of a torque, 
however, is clockwise or counterclockwise, not a linear direction. 


The other difference between torque and force is a matter of 
leverage. A given force applied at a door’s knob will change the 
door’s angular momentum twice as rapidly as the same force applied 
halfway between the knob and the hinge. The same amount of force 
produces different amounts of torque in these two cases. 


It’s possible to have a zero total torque with a nonzero total 
force. An airplane with four jet engines would be designed so that 
their forces are balanced on the left and right. Their forces are all in 
the same direction, but the clockwise torques of two of the engines 
are canceled by the counterclockwise torques of the other two, giving 
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zero total torque. 


Conversely we can have zero total force and nonzero total torque. 
A merry-go-round’s engine needs to supply a nonzero torque on it 
to bring it up to speed, but there is zero total force on it. If there 
was not zero total force on it, its center of mass would accelerate! 


Relationship between force and torque 


How do we calculate the amount of torque produced by a given 
force? Since it depends on leverage, we should expect it to depend 
on the distance between the axis and the point of application of 
the force. [ll work out an equation relating torque to force for a 
particular very simple situation, and give a more rigorous derivation 
on page 282, after developing some mathematical techniques that 
dramatically shorten and simplify the proof. 


Consider a pointlike object which is initially at rest at a distance 
r from the axis we have chosen for defining angular momentum. 
We first observe that a force directly inward or outward, along the 
line connecting the axis to the object, does not impart any angular 
momentum to the object. 


A force perpendicular to the line connecting the axis and the 
object does, however, make the object pick up angular momentum. 
Newton’s second law gives 


=f] : 


and using a = dv/dt we find the velocity the object acquires after a 
time dt, 
dv = Fdt/m 


We’re trying to relate force to a change in angular momentum, so 
we multiply both sides of the equation by mr to give 


mdur=Fdtr 
dL=Fdtr 


Dividing by dt gives the torque: 


dL 

pcan 

di 
7=Fr 


If a force acts at an angle other than 0 or 90° with respect to the 
line joining the object and the axis, it would be only the component 
of the force perpendicular to the line that would produce a torque, 


T=F ir 


Although this result was proved under a simplified set of circum- 
stances, it is more generally valid:? 


24 proof is given in example 28 on page 282 


o/The geometric relationships 
referred to in the relationship 


between force and torque. 


Section 4.1. Angular Momentum In Two Dimensions 


253 


254 


p / Self-check. 


Relationship between force and torque: The rate at which 
a force transfers angular momentum to an object, i.e., the torque 
produced by the force, is given by 


Pele) xy 


where r is the distance from the axis to the point of application of 
the force, and F', is the component of the force that is perpendicular 
to the line joining the axis to the point of application. 


The equation is stated with absolute value signs because the 
positive and negative signs of force and torque indicate different 
things, so there is no useful relationship between them. The sign 


of the torque must be found by physical inspection of the case at 
hand. 


From the equation, we see that the units of torque can be writ- 
ten as newtons multiplied by meters. Metric torque wrenches are 
calibrated in N-m, but American ones use foot-pounds, which is also 
a unit of distance multiplied by a unit of force. We know from our 
study of mechanical work that newtons multiplied by meters equal 
joules, but torque is a completely different quantity from work, and 
nobody writes torques with units of joules, even though it would be 
technically correct. 


7 ‘x 
(1) (2) (3) 


self-check A 
Compare the magnitudes and signs of the four torques shown in figure 
p. > Answer, p. 868 


How torque depends on the direction of the force example 6 
> How can the torque applied to the wrench in the figure be ex- 
pressed in terms of r, |F|, and the angle 0? 


> The force vector and its F, component form the hypotenuse 
and one leg of a right triangle, 


and the interior angle opposite to F, equals 8. The absolute value 
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of F, can thus be expressed as 
F, =|F| sin, 
leading to 


|t| = r|F| sin@ 


Sometimes torque can be more neatly visualized in terms of the 
quantity r,; shown in the figure on the left, which gives us a third 
way of expressing the relationship between torque and force: 


q/ Visualizing torque in terms of 
I7]=rilFl fe 


Of course you wouldn’t want to go and memorize all three equa- 
tions for torque. Starting from any one of them you could easily 
derive the other two using trigonometry. Familiarizing yourself with 
them can however clue you in to easier avenues of attack on certain 
problems. r 


The torque due to gravity 


Up until now we’ve been thinking in terms of a force that acts | 
at a single point on an object, such as the force of your hand on the 

wrench. This is of course an approximation, and for an extremely 
realistic calculation of your hand’s torque on the wrench you might 
need to add up the torques exerted by each square millimeter where 
your skin touches the wrench. This is seldom necessary. But in 
the case of a gravitational force, there is never any single point at 
which the force is applied. Our planet is exerting a separate tug on 
every brick in the Leaning Tower of Pisa, and the total gravitational 
torque on the tower is the sum of the torques contributed by all the 
little forces. Luckily there is a trick that allows us to avoid such 
a massive calculation. It turns out that for purposes of computing 
the total gravitational torque on an object, you can get the right 
answer by just pretending that the whole gravitational force acts at 
the object’s center of mass. 


r/ Example 7. 


Gravitational torque on an outstretched arm example 7 
> Your arm has a mass of 3.0 kg, and its center of mass is 30 
cm from your shoulder. What is the gravitational torque on your 
arm when it is stretched out horizontally to one side, taking the 
shoulder to be the axis? 


> The total gravitational force acting on your arm is 
|F| = (3.0 kg)(9.8 m/s?) = 29N 
For the purpose of calculating the gravitational torque, we can 


treat the force as if it acted at the arm’s center of mass. The force 
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s / Discussion question D. 


t / Discussion question E. 


is straight down, which is perpendicular to the line connecting the 
shoulder to the center of mass, so 


F, = |F|=29N 


Continuing to pretend that the force acts at the center of the arm, 
r equals 30 cm = 0.30 m, so the torque is 


t=rF, =9N-m 


Discussion Questions 


A This series of discussion questions deals with past students’ incorrect 
reasoning about the following problem. 


Suppose a comet is at the point in its orbit shown in the figure. The 
only force on the comet is the sun’s gravitational force. Throughout the 
question, define all torques and angular momenta using the sun as the 
axis. 


(1) Is the sun producing a nonzero torque on the comet? Explain. 


(2) Is the comet’s angular momentum increasing, decreasing, or staying 
the same? Explain. 


Explain what is wrong with the following answers. In some cases, the 
answer is correct, but the reasoning leading up to it is wrong. 


(a) Incorrect answer to part (1): “Yes, because the sun is exerting a force 
on the comet, and the comet is a certain distance from the sun.” 


(b) Incorrect answer to part (1): “No, because the torques cancel out.” 


(c) Incorrect answer to part (2): “Increasing, because the comet is speed- 
ing up.” 


comet 


u/ Discussion question A. 


B You whirl a rock over your head on the end of a string, and gradually 
pull in the string, eventually cutting the radius in half. What happens to 
the rock’s angular momentum? What changes occur in its speed, the time 
required for one revolution, and its acceleration? Why might the string 
break? 


CA helicopter has, in addition to the huge fan blades on top, a smaller 
propeller mounted on the tail that rotates in a vertical plane. Why? 


D ~~ Which claw hammer would make it easier to get the nail out of the 
wood if the same force was applied in the same direction? 


E The photo shows an amusement park ride whose two cars rotate in 
opposite directions. Why is this a good design? 


256 Chapter 4 Conservation of Angular Momentum 


4.1.5 Applications to statics 


In chapter 2 I defined equilibrium as a situation where the inter- 
action energy is minimized. This is the same as a condition of zero 
total force, or constant momentum. Thus a car is in equilibrium not 
just when it is parked but also when it is cruising down a straight 
road with constant momentum. 


Likewise there are many cases where a system is not closed but 
maintains constant angular momentum. When a merry-go-round 
is running at constant angular momentum, the engine’s torque is 
being canceled by the torque due to friction. 


It’s not enough for a boat not to sink — we’d also like to avoid 
having it capsize. For this reason, we now redefine equilibrium as 
follows. 


When an object has constant momentum and constant angular 
momentum, we say that it is in equilibrium. Again, this is a scientific 
redefinition of the common English word, since in ordinary speech 
nobody would describe a car spinning out on an icy road as being 
in equilibrium. 


Very commonly, however, we are interested in cases where an 
object is not only in equilibrium but also at rest, and this corre- 
sponds more closely to the usual meaning of the word. Statics is the 
branch of physics concerned with problems such as these. 


Solving statics problems is now simply a matter of applying and 
combining some things you already know: 


- You know the behaviors of the various types of forces, for exam- 
ple that a frictional force is always parallel to the surface of contact. 


- You know about vector addition of forces. It is the vector sum 
of the forces that must equal zero to produce equilibrium. 


- You know about torque. The total torque acting on an object 
must be zero if it is to be in equilibrium. 


- You know that the choice of axis is arbitrary, so you can make 
a choice of axis that makes the problem easy to solve. 


In general, this type of problem could involve four equations in 
four unknowns: three equations that say the force components add 
up to zero, and one equation that says the total torque is zero. Most 
cases you'll encounter will not be this complicated. In the example 
below, only the equation for zero total torque is required in order 
to get an answer. 


‘A flagpole example 8 
> A 10-kg flagpole is being held up by a lightweight horizontal 
cable, and is propped against the foot of a wall as shown in the 
figure. If the cable is only capable of supporting a tension of 70 
N, how great can the angle « be without breaking the cable? 


v/The windmills are not closed 
systems, but angular momentum 
is being transferred out of them 
at the same rate it is transferred 
in, resulting in constant angular 
momentum. To get an idea of 
the huge scale of the modern 
windmill farm, note the sizes of 
the trucks and trailers. 


w / Example 8. 
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> All three objects in the figure are supposed to be in equilibrium: 
the pole, the cable, and the wall. Whichever of the three objects 
we pick to investigate, all the forces and torques on it have to 
cancel out. It is not particularly helpful to analyze the forces and 
torques on the wall, since it has forces on it from the ground that 
are not given and that we don’t want to find. We could study the 
forces and torques on the cable, but that doesn’t let us use the 
given information about the pole. The object we need to analyze 
is the pole. 


The pole has three forces on it, each of which may also result in 
a torque: (1) the gravitational force, (2) the cable’s force, and (3) 
the wall’s force. 


We are free to define an axis of rotation at any point we wish, and 
it is helpful to define it to lie at the bottom end of the pole, since 
by that definition the wall’s force on the pole is applied at r = 0 
and thus makes no torque on the pole. This is good, because we 
don’t know what the wall’s force on the pole is, and we are not 
trying to find it. 


With this choice of axis, there are two nonzero torques on the 
pole, a counterclockwise torque from the cable and a clockwise 
torque from gravity. Choosing to represent counterclockwise torques 
as positive numbers, and using the equation |t| = r|F|sin®, we 
have 


lcable| Fcable| SIN 8cable — lgrav|Fgrav| SIN Ograv = 0 
A little geometry gives Ocapie = 90° — x and Ograv = x, SO 
lcable|Fcable| SiN(Q0° — x) — Fgrav|Fgrav| sin « = 0 


The gravitational force can be considered as acting at the pole’s 
center of mass, i.e., at its geometrical center, SO Feapnie is twice 
fgrav, and we can simplify the equation to read 


2|Fcable| Sin(90° — x) — |Fgrav| sin x = 0 


These are all quantities we were given, except for «, which is the 
angle we want to find. To solve for « we need to use the trig 
identity sin(90° — x) = cos x, 

2|Fcable| COS x — |Fgrav| Sin x = 0 : 
which allows us to find 
= 5 |Fcabie| 


tan ax = 
Forav| 


« = tan—! (orem 
|Forav| 


=tan”! (2 x ie x) 


98 N 
= 55° 
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‘Art! example 9 
> The abstract sculpture shown in figure x contains a cube of 
mass mand sides of length b. The cube rests on top of a cylinder, 
which is off-center by a distance a. Find the tension in the cable. 


> There are four forces on the cube: a gravitational force mg, the 
force F7 from the cable, the upward normal force from the cylin- 
der, Fy, and the horizontal static frictional force from the cylinder, 
Fg: 


The total force on the cube in the vertical direction is zero: 


Fy — mg =0 


As our axis for defining torques, it’s convenient to choose the point 
of contact between the cube and the cylinder, because then nei- 
ther Fs nor Fy) makes any torque. The cable’s torque is coun- 
terclockwise, and the torque due to gravity is clockwise. and the 
cylinder’s torque is clockwise. Letting counterclockwise torques 
be positive, and using the convenient equation t = r_F, we find 
the equation for the total torque: 


bFr — Fya=0 


We could also write down the equation saying that the total hori- 
zontal force is zero, but that would bring in the cylinder’s frictional 
force on the cube, which we don’t know and don’t need to find. We 
already have two equations in the two unknowns Fy and Fy, so 
there’s no need to make it into three equations in three unknowns. 
Solving the first equation for F,y = mg, we then substitute into the 
second equation to eliminate Fy, and solve for Fr = (a/b)mg. 


Why is one equilibrium stable and another unstable? Try push- 
ing your own nose to the left or the right. If you push it a millimeter 
to the left, it responds with a gentle force to the right. If you push 
it a centimeter to the left, its force on your finger becomes much 
stronger. The defining characteristic of a stable equilibrium is that 
the farther the object is moved away from equilibrium, the stronger 
the force is that tries to bring it back. 


The opposite is true for an unstable equilibrium. In the top 
figure, the ball resting on the round hill theoretically has zero total 
force on it when it is exactly at the top. But in reality the total 
force will not be exactly zero, and the ball will begin to move off to 
one side. Once it has moved, the net force on the ball is greater than 
it was, and it accelerates more rapidly. In an unstable equilibrium, 
the farther the object gets from equilibrium, the stronger the force 
that pushes it farther from equilibrium. 


This idea can be rephrased in terms of energy. The difference 
between the stable and unstable equilibria shown in figure y is that 
in the stable equilibrium, the energy is at a minimum, and moving 


x / Example 9. 


unstable 


, 6h 
> 


y/Stable and unstable equi- 
libria. 


z/The dancers — equilibrium 
is unstable. If she didn’t con- 
stantly make tiny adjustments, 
she would tip over. 
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aa / Example 10. 
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to either side of equilibrium will increase it, whereas the unstable 
equilibrium represents a maximum. 


Note that we are using the term “stable” in a weaker sense than 
in ordinary speech. A domino standing upright is stable in the sense 
we are using, since it will not spontaneously fall over in response to 
a sneeze from across the room or the vibration from a passing truck. 
We would only call it unstable in the technical sense if it could be 
toppled by any force, no matter how small. In everyday usage, of 
course, it would be considered unstable, since the force required to 
topple it is so small. 


An application of calculus example 10 
> Nancy Neutron is living in a uranium nucleus that is undergoing 
fission. Nancy’s nuclear energy as a function of position can be 
approximated by U = x* — x?, where all the units and numeri- 
cal constants have been suppressed for simplicity. Use calculus 
to locate the equilibrium points, and determine whether they are 
stable or unstable. 


> The equilibrium points occur where the U is at a minimum or 
maximum, and minima and maxima occur where the derivative 
(which equals minus the force on Nancy) is zero. This derivative 
is dU/dx = 4x? — 2x, and setting it equal to zero, we have x = 
0,+1/,/2. Minima occur where the second derivative is positive, 
and maxima where it is negative. The second derivative is 12x* — 
2, which is negative at x = 0 (unstable) and positive at x = +1//2 
(stable). Interpretation: the graph of U is shaped like a rounded 
letter ‘W, with the two troughs representing the two halves of the 
splitting nucleus. Nancy is going to have to decide which half she 
wants to go with. 


4.1.6 Proof of Kepler’s elliptical orbit law 


Kepler determined purely empirically that the planets’ orbits 
were ellipses, without understanding the underlying reason in terms 
of physical law. Newton’s proof of this fact based on his laws of 
motion and law of gravity was considered his crowning achievement 
both by him and by his contemporaries, because it showed that the 
same physical laws could be used to analyze both the heavens and 
the earth. Newton’s proof was very lengthy, but by applying the 
more recent concepts of conservation of energy and angular momen- 
tum we can carry out the proof quite simply and succinctly. This 
subsection can be skipped without losing the continuity of the text. 


The basic idea of the proof is that we want to describe the shape 
of the planet’s orbit with an equation, and then show that this equa- 
tion is exactly the one that represents an ellipse. Newton’s original 
proof had to be very complicated because it was based directly on 
his laws of motion, which include time as a variable. To make any 
statement about the shape of the orbit, he had to eliminate time 
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from his equations, leaving only space variables. But conservation 
laws tell us that certain things don’t change over time, so they have 
already had time eliminated from them. 


There are many ways of representing a curve by an equation, of 
which the most familiar is y = ax+6 for a line in two dimensions. It 
would be perfectly possible to describe a planet’s orbit using an x-y 
equation like this, but remember that we are applying conservation 
of angular momentum, and the space variables that occur in the 
equation for angular momentum are the distance from the axis, r, 
and the angle between the velocity vector and the r vector, which 
we will call ¢. The planet will have ¢ = 90° when it is moving 
perpendicular to the r vector, i.e., at the moments when it is at its 
smallest or greatest distances from the sun. When ¢ is less than 90° 
the planet is approaching the sun, and when it is greater than 90° it 
is receding from it. Describing a curve with an r-¢ equation is like 
telling a driver in a parking lot a certain rule for what direction to 
steer based on the distance from a certain streetlight in the middle 
of the lot. 


The proof is broken into the three parts for easier digestion. 
The first part is a simple and intuitively reasonable geometrical fact 
about ellipses, whose proof we relegate to the caption of figure ac; 
you will not be missing much if you merely absorb the result without 
reading the proof. 


(1) If we use one of the two foci of an ellipse as an axis for 
defining the variables r and ¢, then the angle between the tangent 
line and the line drawn to the other focus is the same as @, i.e., the 
two angles labeled ¢ in the figure are in fact equal. 


The other two parts form the meat of our proof. We state the 
results first and then prove them. 


(2) A planet, moving under the influence of the sun’s gravity 
with less then the energy required to escape, obeys an equation of 


the form 
1 


4/—pr2 + qr 
where p and q are positive constants that depend on the planet’s 
energy and angular momentum and p is greater than zero. 


sin @= 


(3) A curve is an ellipse if and only if its r-¢ equation is of the 


form 
1 


4/—pr2 + qr 
where p and q are positive constants that depend on the size and 
shape of the ellipse. 


sin @= 


Proof of part (2) 


The component of the planet’s velocity vector that is perpen- 
dicular to the r vector is v, = using, so conservation of angular 
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ab/Describing a curve’ by 
giving } as a function of r. 


ac/Proof that the two an- 
gles labeled ¢ are in fact equal: 
The definition of an ellipse is that 
the sum of the distances from 
the two foci stays constant. If we 
move a small distance @ along the 
ellipse, then one distance shrinks 
by an amount £ cos 1, while the 
other grows by cos de. These 
are equal, so 1 = do. 
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ad/ Quantities referred to in 
the proof of part (3). 


momentum tells us that LZ = mrvsing is a constant. Since the 
planet’s mass is a constant, this is the same as the condition 


rv sin @ = constant 


Conservation of energy gives 


1 M 
xe _ (gree = constant 
r 


We solve the first equation for v and plug into the second equa- 
tion to eliminate v. Straightforward algebra then leads to the equa- 
tion claimed above, with the constant p being positive because of 
our assumption that the planet’s energy is insufficient to escape from 
the sun, i.e., its total energy is negative. 


Proof of part (3) 


We define the quantities a, d, and s as shown in figure ad. The 
law of cosines gives 


d? =r? 4 57 — 2rs cos a 
Using a = 180°—2¢ and the trigonometric identities cos(180° —) = 
—cosz and cos2x = 1— 2 sin? x, we can rewrite this as 
d? =7? 4s? —2rs (2sin? 6 — 1) 
Straightforward algebra transforms this into 


(r +s)? —d? 


aie drs 


Since r +s is constant, the top of the fraction is constant, and the 
denominator can be rewritten as 4rs = 4r(constant — r), which is 
equivalent to the desired form. 
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4.2 Rigid-Body Rotation 


4.2.1 Kinematics 


When a rigid object rotates, every part of it (every atom) moves 
in a circle, covering the same angle in the same amount of time, 
a. Every atom has a different velocity vector, b. Since all the 
velocities are different, we can’t measure the speed of rotation of 
the top by giving a single velocity. We can, however, specify its 
speed of rotation consistently in terms of angle per unit time. Let 
the position of some reference point on the top be denoted by its 
angle 0, measured in a circle around the axis. For reasons that will 
become more apparent shortly, we measure all our angles in radians. 
Then the change in the angular position of any point on the top can 
be written as dé, and all parts of the top have the same value of d6 
over a certain time interval dt. We define the angular velocity, w 
(Greek omega), 


dé 
Q=> — 
dt 
(definition of angular velocity; 6 in units of radians] 


which is similar to, but not the same as, the quantity w we defined 
earlier to describe vibrations. The relationship between w and t 
is exactly analogous to that between x and t for the motion of a 
particle through space. 


self-check B 

If two different people chose two different reference points on the top 
in order to define 8=0, how would their 6-t graphs differ? What effect 
would this have on the angular velocities? > Answer, p. 868 


The angular velocity has units of radians per second, rad/s. 
However, radians are not really units at all. The radian measure 
of an angle is defined, as the length of the circular arc it makes, 
divided by the radius of the circle. Dividing one length by another 
gives a unitless quantity, so anything with units of radians is really 
unitless. We can therefore simplify the units of angular velocity, and 
call them inverse seconds, s~!. 


A 78-rpm record example 11 
> In the early 20th century, the standard format for music record- 
ings was a plastic disk that held a single song and rotated at 78 
rpm (revolutions per minute). What was the angular velocity of 
such a disk? 


> If we measure angles in units of revolutions and time in units 
of minutes, then 78 rpm is the angular velocity. Using standard 
physics units of radians/second, however, we have 


78 revolutions 27 radians 1 minute 


=8.2s7! 
1 minute * 1 revolution * 60 seconds 


a/The two atoms cover the 


same angle 
interval. 


in a given time 


b/ Their velocity vectors, how- 
ever, differ in both magnitude and 


direction. 
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a<+—- @ 


c/Analogies between _ rota- 
tional and _ linear quantities. 


Vr 


d/We_ construct a_ coordi- 
nate system that coincides with 
the location and motion of the 
moving point of interest at a 
certain moment. 


In the absence of any torque, a rigid body will rotate indefinitely 
with the same angular velocity. If the angular velocity is changing 
because of a torque, we define an angular acceleration, 


dw 


eer: ; [definition of angular acceleration] 


a 


The symbol is the Greek letter alpha. The units of this quantity are 
2 


rad/s”, or simply s~?. 

The mathematical relationship between w and @ is the same as 
the one between v and «x, and similarly for @ and a. We can thus 
make a system of analogies, c, and recycle all the familiar kinematic 
equations for constant-acceleration motion. 


The synodic period example 12 
Mars takes nearly twice as long as the Earth to complete an orbit. 
If the two planets are alongside one another on a certain day, 
then one year later, Earth will be back at the same place, but 
Mars will have moved on, and it will take more time for Earth to 
finish catching up. Angular velocities add and subtract, just as 
velocity vectors do. If the two planets’ angular velocities are w 
and we, then the angular velocity of one relative to the other is 
W 1 — We. The corresponding period, 1/(1/7; — 1/T2) is known as 
the synodic period. 


A neutron star example 13 
> A neutron star is initially observed to be rotating with an angular 
velocity of 2.0 s~', determined via the radio pulses it emits. If its 
angular acceleration is a constant —1.0 x 10-® s~?, how many 
rotations will it complete before it stops? (In reality, the angular 
acceleration is not always constant; sudden changes often occur, 
and are referred to as “starquakes!”) 


> The equation v?—v?=2aAx can be translated into w?—w?=2aA0, 
giving 
AO = (w? — w?)/2x 
= 2.0 x 10° radians 
= 3.2 x 10’ rotations 


4.2.2 Relations between angular quantities and motion of a 
point 
It is often necessary to be able to relate the angular quantities 
to the motion of a particular point on the rotating object. As we 
develop these, we will encounter the first example where the advan- 
tages of radians over degrees become apparent. 


The speed at which a point on the object moves depends on 
both the object’s angular velocity w and the point’s distance r from 
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the axis. We adopt a coordinate system, d, with an inward (radial) 
axis and a tangential axis. The length of the infinitesimal circular 
arc ds traveled by the point in a time interval dt is related to d6 
by the definition of radian measure, d@ = ds/r, where positive and 
negative values of ds represent the two possible directions of motion 
along the tangential axis. We then have vu, = ds/dt = rd@/dt = wr, 
or 


UL = WT : [tangential velocity of a point at a 


distance r from the axis of rotation] 


The radial component is zero, since the point is not moving inward 
or outward, 


vr = 0 ; [radial velocity of a point at a 


distance r from the axis of rotation] 


Note that we had to use the definition of radian measure in this 
derivation. Suppose instead we had used units of degrees for our an- 
gles and degrees per second for angular velocities. The relationship 
between dOdegrees and ds is dO degrees = (360/27) s/r, where the extra 
conversion factor of (360/27) comes from that fact that there are 360 
degrees in a full circle, which is equivalent to 27 radians. The equa- 
tion for v; would then have been vu; = (27/360) (Wdegrees per second) (1), 
which would have been much messier. Simplicity, then, is the rea- 
son for using radians rather than degrees; by using radians we avoid 
infecting all our equations with annoying conversion factors. 


Since the velocity of a point on the object is directly proportional 
to the angular velocity, you might expect that its acceleration would 
be directly proportional to the angular acceleration. This is not true, 
however. Even if the angular acceleration is zero, i.e., if the object 
is rotating at constant angular velocity, every point on it will have 
an acceleration vector directed toward the axis, e. As derived on 
page 207, the magnitude of this acceleration is 


Ap = Wr : [radial acceleration of a point 


at a distance r from the axis] 


For the tangential component, any change in the angular velocity 
dw will lead to a change dw-r in the tangential velocity, so it is easily 
shown that 


at = ar : [tangential acceleration of a point 


at a distance r from the axis] 


self-check C 

Positive and negative signs of w represent rotation in opposite direc- 
tions. Why does it therefore make sense physically that w is raised to 
the first power in the equation for vy; and to the second power in the one 
for a,? > Answer, p. 868 


e/Even if the rotating object 
has zero angular acceleration, 
every point on it has an ac- 
celeration towards the center. 
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Radial acceleration at the surface of the Earth example 14 
> What is your radial acceleration due to the rotation of the earth 
if you are at the equator? 


> At the equator, your distance from the Earth’s rotation axis is 
the same as the radius of the spherical Earth, 6.4 x 10° m. Your 
angular velocity is 


a 27 radians 
~~ 1 day 
2753410-%s 


which gives an acceleration of 


ar = wr 


= 0.034 m/s? 


The angular velocity was a very small number, but the radius was 
avery big number. Squaring a very small number, however, gives 
a very very small number, so the w® factor “wins,” and the final 
result is small. 


If you’re standing on a bathroom scale, this small acceleration is 
provided by the imbalance between the downward force of gravity 
and the slightly weaker upward normal force of the scale on your 
foot. The scale reading is therefore a little lower than it should 
be. 


4.2.3 Dynamics 


If we want to connect all this kinematics to anything dynamical, 
we need to see how it relates to torque and angular momentum. 
Our strategy will be to tackle angular momentum first, since angu- 
lar momentum relates to motion, and to use the additive property 
of angular momentum: the angular momentum of a system of par- 
ticles equals the sum of the angular momenta of all the individual 
particles. The angular momentum of one particle within our rigidly 
rotating object, L = mv ,r, can be rewritten as L = rp sin 0, 
where r and p are the magnitudes of the particle’s r and momen- 
tum vectors, and @ is the angle between these two vectors. (The r 
vector points outward perpendicularly from the axis to the parti- 
cle’s position in space.) In rigid-body rotation the angle @ is 90°, 
so we have simply L = rp. Relating this to angular velocity, we 
have L = rp = (r)(mv) = (r)(mwr) = mr?w. The particle’s con- 
tribution to the total angular momentum is proportional to w, with 
a proportionality constant mr?. We refer to mr? as the particle’s 
contribution to the object’s total moment of inertia, I, where “mo- 
ment” is used in the sense of “important,” as in “momentous” — a 
bigger value of J tells us the particle is more important for deter- 
mining the total angular momentum. The total moment of inertia 
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is 
[= S> mir? ; [definition of the moment of inertia; 


for rigid-body rotation in a plane; r is the distance 
from the axis, measured perpendicular to the axis] 


The angular momentum of a rigidly rotating body is then 


Laty : [angular momentum of 


rigid-body rotation in a plane] 


Since torque is defined as dL /dt, and a rigid body has a constant 
moment of inertia, we have 7 = dL /dt = Idw/dt = Ia, 


T=TIa ; [relationship between torque and 


angular acceleration for rigid-body rotation in a plane] 
which is analogous to F = ma. 


The complete system of analogies between linear motion and 
rigid-body rotation is given in figure f. 


A barbell example 15 
> The barbell shown in figure g consists of two small, dense, mas- 
sive balls at the ends of a very light rod. The balls have masses 
of 2.0 kg and 1.0 kg, and the length of the rod is 3.0 m. Find 
the moment of inertia of the rod (1) for rotation about its center of 
mass, and (2) for rotation about the center of the more massive 
ball. 


> (1) The ball’s center of mass lies 1/3 of the way from the greater 
mass to the lesser mass, i.e., 1.0 m from one and 2.0 m from the 
other. Since the balls are small, we approximate them as if they 
were two pointlike particles. The moment of inertia is 
| = (2.0 kg)(1.0 m)? + (1.0 kg)(2.0 m)? 

= 2.0 kg-m? + 4.0 kg-m? 

= 6.0 kg-m? 
Perhaps counterintuitively, the less massive ball contributes far 
more to the moment of inertia. 


(2) The big ball theoretically contributes a little bit to the moment 
of inertia, since essentially none of its atoms are exactly at r=0. 
However, since the balls are said to be small and dense, we as- 
sume all the big ball’s atoms are so close to the axis that we can 
ignore their small contributions to the total moment of inertia: 


! = (1.0 kg)(3.0 m)? 
= 9.0 kg-m? 


This example shows that the moment of inertia depends on the 
choice of axis. For example, it is easier to wiggle a pen about its 
center than about one end. 


p 
F 


f / Analogies 
tional and 


<M 


between 


linear 


L 


Tv 


rota- 


quantities. 


© 


axis2 axis 1 


g/ Example 15 
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The parallel axis theorem example 16 
> Generalizing the previous example, suppose we pick any axis 
parallel to axis 1, but offset from it by a distance h. Part (2) of 
the previous example then corresponds to the special case of 
h = —1.0 m (negative being to the left). What is the moment of 
inertia about this new axis? 


> The big ball’s distance from the new axis is (1.0 m)+h, and the 
small one’s is (2.0 m)-h. The new moment of inertia is 


/ = (2.0 kg)[(1.0 m)+H]* + (1.0 kg)[(2.0 m) — A]? 
= 6.0 kg-m? + (4.0 kg-m)h — (4.0 kg-m)h + (3.0 kg)h? 


The constant term is the same as the moment of inertia about the 
center-of-mass axis, the first-order terms cancel out, and the third 
term is just the total mass multiplied by h?. The interested reader 
will have no difficulty in generalizing this to any set of particles 
(problem 38, p. 292), resulting in the parallel axis theorem: If an 
object of total mass M rotates about a line at a distance h from 
its center of mass, then its moment of inertia equals Jem + Mh’, 
where Icom is the moment of inertia for rotation about a parallel line 
through the center of mass. 


Scaling of the moment of inertia example 17 
> (1) Suppose two objects have the same mass and the same 
shape, but one is less dense, and larger by a factor k. How do 
their moments of inertia compare? 

(2) What if the densities are equal rather than the masses? 


> (1) This is like increasing all the distances between atoms by a 
factor k. All the rs become greater by this factor, so the moment 
of inertia is increased by a factor of k?. 

(2) This introduces an increase in mass by a factor of k®, so the 
moment of inertia of the bigger object is greater by a factor of 
k®. 


4 Iterated integrals 


In various places in this book, starting with subsection 4.2.5, 
we’ll come across integrals stuck inside other integrals. These are 
known as iterated integrals, or double integrals, triple integrals, etc. 
Similar concepts crop up all the time even when you’re not doing 
calculus, so let’s start by imagining such an example. Suppose you 
want to count how many squares there are on a chess board, and you 
don’t know how to multiply eight times eight. You could start from 
the upper left, count eight squares across, then continue with the 
second row, and so on, until you how counted every square, giving 
the result of 64. In slightly more formal mathematical language, 
we could write the following recipe: for each row, r, from 1 to 8, 
consider the columns, c, from 1 to 8, and add one to the count for 
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each one of them. Using the sigma notation, this becomes 


8 


a 


r=1-c=1 


If you’re familiar with computer programming, then you can think 
of this as a sum that could be calculated using a loop nested inside 
another loop. To evaluate the result (again, assuming we don’t know 
how to multiply, so we have to use brute force), we can first evaluate 
the inside sum, which equals 8, giving 


Notice how the “dummy” variable c has disappeared. Finally we do 
the outside sum, over r, and find the result of 64. 


Now imagine doing the same thing with the pixels on a TV 
screen. The electron beam sweeps across the screen, painting the 
pixels in each row, one at a time. This is really no different than the 
example of the chess board, but because the pixels are so small, you 
normally think of the image on a TV screen as continuous rather 
than discrete. This is the idea of an integral in calculus. Suppose 
we want to find the area of a rectangle of width a and height b, and 
we don’t know that we can just multiply to get the area ab. The 
brute force way to do this is to break up the rectangle into a grid of 
infinitesimally small squares, each having width dx and height dy, 
and therefore the infinitesimal area dA = dxdy. For convenience, 
we'll imagine that the rectangle’s lower left corner is at the origin. 
Then the area is given by this integral: 


b a 
area = i dA 
y=0 Jx=0 
a 


b 
l dx dy 
y=0 /xz=0 


Notice how the leftmost integral sign, over y, and the rightmost dif- 
ferential, dy, act like bookends, or the pieces of bread on a sandwich. 
Inside them, we have the integral sign that runs over x, and the dif- 
ferential dx that matches it on the right. Finally, on the innermost 
layer, we’d normally have the thing we’re integrating, but here’s it’s 
1, so ve omitted it. Writing the lower limits of the integrals with 
x = and y = helps to keep it straight which integral goes with with 
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differential. The result is 


Area of a triangle example 18 
> Find the area of a 45-45-90 right triangle having legs a. 


> Let the triangle’s hypotenuse run from the origin to the point 
(a, a), and let its legs run from the origin to (0, a), and then to 
(a, a). In other words, the triangle sits on top of its hypotenuse. 
Then the integral can be set up the same way as the one before, 
but for a particular value of y, values of x only run from 0 (on the 
y axis) to y (on the hypotenuse). We then have 


a py 
area = i dA 
y=0 /x=0 


a y 
| dx dy 
y=0 / x=0 
a y 
(Lae 
y=0 x=0 
a 
- | y dy 
y=0 


ane 


Note that in this example, because the upper end of the x values 
depends on the value of y, it makes a difference which order we 
do the integrals in. The x integral has to be on the inside, and we 
have to do it first. 


Volume of a cube example 19 
> Find the volume of a cube with sides of length a. 


> This is a three-dimensional example, so we'll have integrals 
nested three deep, and the thing we're integrating is the volume 
dV = dxdydz. 
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a a a 
volume - | / dx dy dz 
z=0 J y=0 / x=0 
a a 
i, ‘ ady dz 
z=0 / y=0 
a a 
=4 i | dy dz 
z=0 / y=0 


Area of a circle example 20 
> Find the area of a circle. 


> To make it easy, let’s find the area of a semicircle and then 
double it. Let the circle’s radius be r, and let it be centered on the 
origin and bounded below by the x axis. Then the curved edge 
is given by the equation r2 = x* + y?, or y = Vr2— x2. Since 
the y integral’s limit depends on x, the x integral has to be on the 
outside. The area is 


r V re —xe 
area = | i, dy dx 
x y 


=—/ =0 


=f Vr2 — x2dx 
arf anteater 


Substituting u = x/r, 


, 
area=r? / V1 —u2du 
u=—1 


The definite integal equals 7, as you can find using a trig substi- 
tution or simply by looking it up in a table, and the result is, as 
expected, mr? /2 for the area of the semicircle. 


4.2.5 Finding moments of inertia by integration 


When calculating the moment of inertia of an ordinary-sized ob- 
ject with perhaps 107° atoms, it would be impossible to do an actual 
sum over atoms, even with the world’s fastest supercomputer. Cal- 
culus, however, offers a tool, the integral, for breaking a sum down 
to infinitely many small parts. If we don’t worry about the exis- 
tence of atoms, then we can use an integral to compute a moment 
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of inertia as if the object was smooth and continuous throughout, 
rather than granular at the atomic level. Of course this granular- 
ity typically has a negligible effect on the result unless the object 
is itself an individual molecule. This subsection consists of three 
examples of how to do such a computation, at three distinct levels 
of mathematical complication. 


Moment of inertia of a thin rod 


What is the moment of inertia of a thin rod of mass M and 
length LE about a line perpendicular to the rod and passing through 
its center? We generalize the discrete sum 


I= So mer? 


to a continuous one, 


r= fam 


L/2 M 
=) a” — da [r = |a|, so r? = 2] 
a ee 


1 
=— ML? 
12 


In this example the object was one-dimensional, which made 
the math simple. The next example shows a strategy that can be 
used to simplify the math for objects that are three-dimensional, 
but possess some kind of symmetry. 


Moment of inertia of a disk 
What is the moment of inertia of a disk of radius b, thickness t, 
and mass M, for rotation about its central axis? 


We break the disk down into concentric circular rings of thick- 
ness dr. Since all the mass in a given circular slice has essentially 
the same value of r (ranging only from r to r + dr), the slice’s con- 
tribution to the total moment of inertia is simply r2?dm. We then 


have 
I= / r?dm 


= f Ppav ; 


where V = 7b?t is the total volume, p = M/V = M/nb?t is the 
density, and the volume of one slice can be calculated as the volume 
enclosed by its outer surface minus the volume enclosed by its inner 
surface, dV = r(r + dr)?t — rr*t = 2rtrdr. 


b 
M 
pf r?—— 2ntr dr 
0 abt 


1 
==Mb’ 
2 
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In the most general case where there is no symmetry about the 
rotation axis, we must use iterated integrals, as discussed in subsec- 
tion 4.2.4. The example of the disk possessed two types of symme- 
try with respect to the rotation axis: (1) the disk is the same when 
rotated through any angle about the axis, and (2) all slices perpen- 
dicular to the axis are the same. These two symmetries reduced the 
number of layers of integrals from three to one. The following ex- 
ample possesses only one symmetry, of type (2), and we simply set 
it up as a triple integral. You may not have seen multiple integrals 
yet in a math course. If so, just skim this example. 


Moment of inertia of a cube 


What is the moment of inertia of a cube of side b, for rotation 
about an axis that passes through its center and is parallel to four 
of its faces? Let the origin be at the center of the cube, and let x 
be the rotation axis. 


r= [Pam 
=p [ Pav 


b/2 pb/2 i 
=p | 1. (y? 3 ) dx dy dz 


b/2 J—b/2 J —b/2 
b/2 ey 

= pb [ (y? dig? ) dy dz 
b/2 J—b/2 


The fact that the last step is a trivial integral results from the sym- 
metry of the problem. The integrand of the remaining double in- 
tegral breaks down into two terms, each of which depends on only 
one of the variables, so we break it into two integrals, 


b/2  pb/2 b/2  pb/2 
= pb [ i: y?dy dz + pb / / z*dy dz 
b/2 J—b/2 b/2 J—b/2 
which we know have identical results. We therefore only need to 
evaluate one of them and double the result: 


b/2  pb/2 
— T= 2pb f i: z-dydz 
b/2 J—b/2 


Figure h shows the moments of inertia of some shapes, which 
were evaluated with techniques like these. 
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h/Moments of inertia of some 
geometric shapes. 


2 1 
l=mR Se 2 
FG mb 


i 1 2 
———s = er 
: | S b 


wheel or hoop of radius R, with thin rod of length b cube with sides of length b 


its mass concentrated on the rim rotating end over end 


2 2 = 2 
lI==mR l= >mR 
B i=t.mr2 dene 


sphere of radius R with cylinder of radius R cylinder of radius R and 
uniform density throughout rotating about its axis length b rotating end over end 


The hammer throw example 21 
> In the men’s Olympic hammer throw, a steel ball of radius 6.1 cm 
is swung on the end of a wire of length 1.22 m. What fraction of 
the ball’s angular momentum comes from its rotation, as opposed 
to its motion through space? 


> It’s always important to solve problems symbolically first, and 
plug in numbers only at the end, so let the radius of the ball be b, 
and the length of the wire @. If the time the ball takes to go once 
around the circle is 7, then this is also the time it takes to revolve 
once around its own axis. Its speed is v = 27¢/T, so its angular 
momentum due to its motion through space is mvé = 2nm(?/T. 
Its angular momentum due to its rotation around its own cen- 
ter is (4/5)mb?/T. The ratio of these two angular momenta is 
(2/5)(b/é)* = 1.0x10~-%. The angular momentum due to the ball’s 
spin is extremely small. 


Toppling a rod example 22 

> A rod of length b and mass m stands upright. We want to strike 
the rod at the bottom, causing it to fall and land flat. Find the 
momentum, p, that should be delivered, in terms of m, b, and 
g. Can this really be done without having the rod scrape on the 
floor? 


> This is a nice example of a question that can very nearly be 
answered based only on units. Since the three variables, m, b, 
and g, all have different units, they can’t be added or subtracted. 
The only way to combine them mathematically is by multiplication 
or division. Multiplying one of them by itself is exponentiation, so 
in general we expect that the answer must be of the form 


i/ Example 22. p= Amibkg! 


P) 


where A, /, k, and / are unitless constants. The result has to have 
units of kg-m/s. To get kilograms to the first power, we need 


fads <5 
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meters to the first power requires 
kK+/l=1 P 

and seconds to the power —1 implies 
[=1/2 


We find j = 1, k = 1/2, and / = 1/2, so the solution must be of the 


form 
p=Amv\/bg 
Note that no physics was required! 


Consideration of units, however, won’t help us to find the unit- 
less constant A. Let t be the time the rod takes to fall, so that 
(1/2)gt® = b/2. If the rod is going to land exactly on its side, 
then the number of revolutions it completes while in the air must 
be 1/4, or 3/4, or 5/4, ..., but all the possibilities greater than 1/4 
would cause the head of the rod to collide with the floor prema- 
turely. The rod must therefore rotate at a rate that would cause 
it to complete a full rotation in a time 7 = 4t, and it has angular 
momentum L = (7/6)mb?/T. 


The momentum lost by the object striking the rod is p, and by 
conservation of momentum, this is the amount of momentum, in 
the horizontal direction, that the rod acquires. In other words, 
the rod will fly forward a little. However, this has no effect on 
the solution to the problem. More importantly, the object striking 
the rod loses angular momentum bp/2, which is also transferred 
to the rod. Equating this to the expression above for L, we find 
p = (x/12)my/bg. 

Finally, we need to know whether this can really be done without 
having the foot of the rod scrape on the floor. The figure shows 
that the answer is no for this rod of finite width, but it appears 
that the answer would be yes for a sufficiently thin rod. This is 
analyzed further in homework problem 37 on page 292. 
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4.3. Angular Momentum In Three Dimensions 


Conservation of angular momentum produces some surprising phe- 
nomena when extended to three dimensions. Try the following ex- 
periment, for example. Take off your shoe, and toss it in to the air, 
making it spin along its long (toe-to-heel) axis. You should observe 
a nice steady pattern of rotation. The same happens when you spin 
the shoe about its shortest (top-to-bottom) axis. But something 
unexpected happens when you spin it about its third (left-to-right) 
axis, which is intermediate in length between the other two. Instead 
of a steady pattern of rotation, you will observe something more 
complicated, with the shoe changing its orientation with respect to 
the rotation axis. 


4.3.1 Rigid-body kinematics in three dimensions 


How do we generalize rigid-body kinematics to three dimensions? 
When we wanted to generalize the kinematics of a moving particle 
to three dimensions, we made the numbers r, v, and a into vectors 
r, v, and a. This worked because these quantities all obeyed the 
same laws of vector addition. For instance, one of the laws of vector 
addition is that, just like addition of numbers, vector addition gives 
the same result regardless of the order of the two quantities being 
added. Thus you can step sideways 1 m to the right and then 
step forward 1 m, and the end result is the same as if you stepped 
forward first and then to the side. In order words, it didn’t matter 
whether you took Ar; + Arg or Arg + Ar. In math this is called 
the commutative property of addition. 


a / Performing the rotations in one ——, 
order gives one result, 3, while re- 
versing the order gives a different Physics 


result, 5. 
®| > 


(2) Rotated 90 (3) Rotated about 
degrees about x. x, then y. 
(1) The book in its » v a 
original orientation. Le . 7 _ 
iz > 
[ | > = 
(4) Rotated 90 
degrees about y. (5) Rotated about 
y, then x. 


Angular motion, unfortunately doesn’t have this property, as 
shown in figure a. Doing a rotation about the x axis and then 
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about y gives one result, while doing them in the opposite order 
gives a different result. These operations don’t “commute,” i.e., it 
makes a difference what order you do them in. 


This means that there is in general no possible way to construct 
a A@ vector. However, if you try doing the operations shown in 
figure a using small rotation, say about 10 degrees instead of 90, 
you'll find that the result is nearly the same regardless of what 
order you use; small rotations are very nearly commutative. Not 
only that, but the result of the two 10-degree rotations is about the 
same as a single, somewhat larger, rotation about an axis that lies 
symmetrically at between the x and y axes at 45 degree angles to 
each one. This is exactly what we would expect if the two small 
rotations did act like vectors whose directions were along the axis 
of rotation. We therefore define a d@ vector whose magnitude is 
the amount of rotation in units of radians, and whose direction is 
along the axis of rotation. Actually this definition is ambiguous, 
because there it could point in either direction along the axis. We 
therefore use a right-hand rule as shown in figure b to define the 
direction of the d@ vector, and the w vector, w = d@/dt, based on 
it. Aliens on planet Tammyfaye may decide to define it using their 
left hands rather than their right, but as long as they keep their 
scientific literature separate from ours, there is no problem. When 
entering a physics exam, always be sure to write a large warning note 
on your left hand in magic marker so that you won’t be tempted to 
use it for the right-hand rule while keeping your pen in your right. 


self-check D 
Use the right-hand rule to determine the directions of the w vectors in 
each rotation shown in figures a/1 through a/5. > Answer, p. 868 


Because the vector relationships among d@, w, and q@ are strictly 
analogous to the ones involving dr, v, and a (with the proviso that 
we avoid describing large rotations using A@ vectors), any operation 
in r-v-a vector kinematics has an exact analog in 0-w-q@ kinematics. 


Result of successive 10-degree rotations example 23 
> What is the result of two successive (positive) 10-degree rota- 
tions about the x and y axes? That is, what single rotation about a 
single axis would be equivalent to executing these in succession? 


> The result is only going to be approximate, since 10 degrees 
is not an infinitesimally small angle, and we are not told in what 
order the rotations occur. To some approximation, however, we 
can add the Aé vectors in exactly the same way we would add Ar 
vectors, SO we have 


AO = AO; + Ado 
~ (10 degrees)x + (10 degrees)y 


This is a vector with a magnitude of \/(10 deg)? + (10 deg) = 


direction of | 
A@ and w 
vectors 


direction 
of rotation 


b/The right-hand rule for 
associating a vector with a 
direction of rotation. 
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14 deg, and it points along an axis midway between the x and y 
axes. 


4.3.2 Angular momentum in three dimensions 
The vector cross product 


In order to expand our system of three-dimensional kinematics to 
include dynamics, we will have to generalize equations like vy, = wr, 
7 = rFsin@,;r, and L = rpsin6@;p, each of which involves three 
quantities that we have either already defined as vectors or that we 
want to redefine as vectors. Although the first one appears to differ 
from the others in its form, it could just as well be rewritten as 
Ut = wr sin 6,,,, since 0, = 90°, and sin @,,, = 1. 


It thus appears that we have discovered something general about 
the physically useful way to relate three vectors in a multiplicative 
way: the magnitude of the result always seems to be proportional to 
the product of the magnitudes of the two vectors being “multiplied,” 
and also to the sine of the angle between them. 


Is this pattern just an accident? Actually the sine factor has 
a very important physical property: it goes to zero when the two 
vectors are parallel. This is a Good Thing. The generalization of 
angular momentum into a three-dimensional vector, for example, is 
presumably going to describe not just the clockwise or counterclock- 
wise nature of the motion but also from which direction we would 
have to view the motion so that it was clockwise or counterclock- 
wise. (A clock’s hands go counterclockwise as seen from behind the 
clock, and don’t rotate at all as seen from above or to the side.) Now 
suppose a particle is moving directly away from the origin, so that 
its r and p vectors are parallel. It is not going around the origin 
from any point of view, so its angular momentum vector had better 
be zero. 


Thinking in a slightly more abstract way, we would expect the 
angular momentum vector to point perpendicular to the plane of 
motion, just as the angular velocity vector points perpendicular to 
the plane of motion. The plane of motion is the plane containing 
both r and p, if we place the two vectors tail-to-tail. But if r and 
p are parallel and are placed tail-to-tail, then there are infinitely 
many planes containing them both. To pick one of these planes in 
preference to the others would violate the symmetry of space, since 
they should all be equally good. Thus the zero-if-parallel property 
is a necessary consequence of the underlying symmetry of the laws 
of physics. 


The following definition of a kind of vector multiplication is con- 
sistent with everything we’ve seen so far, and later we’ll prove that 
the definition is unique, i.e., if we believe in the symmetry of space, 
it is essentially the only way of defining the multiplication of two 


Chapter 4 Conservation of Angular Momentum 


vectors to produce a third vector: 


Definition of the vector cross product: 

The cross product A x B of two vectors A and B is defined as 
follows: 

(1) Its magnitude is defined by |A x B| = |A||B] sin 042, where 042 
is the angle between A and B when they are placed tail-to-tail. 

(2) Its direction is along the line perpendicular to both A and B. 
Of the two such directions, it is the one that obeys the right-hand 
rule shown in figure c. 


The name “cross product” refers to the symbol, and distin- 
guishes it from the dot product, which acts on two vectors but 
produces a scalar. 


Although the vector cross-product has nearly all the properties 
of numerical multiplication, e.g., Ax (B+C)=AxB+A~xC, it 
lacks the usual property of commutativity. Try applying the right- 
hand rule to find the direction of the vector cross product B x A 
using the two vectors shown in the figure. This requires starting 
with a flattened hand with the four fingers pointing along B, and 
then curling the hand so that the fingers point along A. The only 
possible way to do this is to point your thumb toward the floor, in 
the opposite direction. Thus for the vector cross product we have 


AxB=-BxA ; 


a property known as anticommutativity. The vector cross product 
is also not associative, ie., A x (B x C) is usually not the same as 
(A x B) x C. 


A geometric interpretation of the cross product, d, is that if both 
A and B are vectors with units of distance, then the magnitude of 
their cross product can be interpreted as the area of the parallelo- 
gram they form when placed tail-to-tail. 


A useful expression for the components of the vector cross prod- 
uct in terms of the components of the two vectors being multiplied 
is as follows: 


(A x B), = AyBz — ByAz 
(A x B)y = A: Br — BeAr 
(A x B), = A;By — BrAy 


I'll prove later that these expressions are equivalent to the previ- 
ous definition of the cross product. Although they may appear 
formidable, they have a simple structure: the subscripts on the right 
are the other two besides the one on the left, and each equation is 
related to the preceding one by a cyclic change in the subscripts, 
e. If the subscripts were not treated in some completely symmetric 


AxB 


= 


a 
wp 


c/The — right-hand 


~~ _—__ 


rule 


or 


the direction of the vector cross 


product. 


Z 


d/The magnitude of the cross 
product is the area of the shaded 


parallelogram. 


Z 


X 


e/A cyclic change in the x, 


y, and z subscripts. 
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Ss 


V 


f/The position and momen- 
tum vectors of an atom in the 
spinning top. 


L 
— 
e 
g/The right-hand rule — for 


the atom’s contribution to the 
angular momentum. 


manner like this, then the definition would provide some way to dis- 
tinguish one axis from another, which would violate the symmetry 
of space. 


self-check E 

Show that the component equations are consistent with the rule A x B = 

—BxA. > Answer, p. 868 
Angular momentum in three dimensions 


In terms of the vector cross product, we have: 


v=wxr 
L=rxp 
T=rxF 


But wait, how do we know these equations are even correct? 
For instance, how do we know that the quantity defined by r x p 
is in fact conserved? Well, just as we saw on page 210 that the 
dot product is unique (i.e., can only be defined in one way while 
observing rotational invariance), the cross product is also unique, 
as proved on page 857. If r x p was not conserved, then there could 
not be any generally conserved quantity that would reduce to our 
old definition of angular momentum in the special case of plane 
rotation. This doesn’t prove conservation of angular momentum 
— only experiments can prove that — but it does prove that if 
angular momentum is conserved in three dimensions, there is only 
one possible way to generalize from two dimensions to three. 


Angular momentum of a spinning top example 24 
As an illustration, we consider the angular momentum of a spin- 
ning top. Figures f and g show the use of the vector cross prod- 
uct to determine the contribution of a representative atom to the 
total angular momentum. Since every other atom’s angular mo- 
mentum vector will be in the same direction, this will also be the 
direction of the total angular momentum of the top. This happens 
to be rigid-body rotation, and perhaps not surprisingly, the angu- 
lar momentum vector is along the same direction as the angular 
velocity vector. 


Three important points are illustrated by this example: (1) 
When we do the full three-dimensional treatment of angular mo- 
mentum, the “axis” from which we measure the position vectors is 
just an arbitrarily chosen point. If this had not been rigid-body 
rotation, we would not even have been able to identify a single line 
about which every atom circled. (2) Starting from figure f, we had 
to rearrange the vectors to get them tail-to-tail before applying the 
right-hand rule. If we had attempted to apply the right-hand rule 
to figure f, the direction of the result would have been exactly the 
opposite of the correct answer. (3) The equation L = r x p cannot 
be applied all at once to an entire system of particles. The total 
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momentum of the top is zero, which would give an erroneous result 
of zero angular momentum (never mind the fact that r is not well 
defined for the top as a whole). 


Doing the right-hand rule like this requires some practice. I 
urge you to make models like g out of rolled up pieces of paper and 
to practice with the model in various orientations until it becomes 
natural. 


Precession example 25 
Figure h shows a counterintuitive example of the concepts we’ve 
been discussing. One expects the torque due to gravity to cause 
the top to flop down. Instead, the top remains spinning in the hor- 
izontal plane, but its axis of rotation starts moving in the direction 
shown by the shaded arrow. This phenomenon is called preces- 
sion. Figure i shows that the torque due to gravity is out of the 
page. (Actually we should add up all the torques on all the atoms 
in the top, but the qualitative result is the same.) Since torque 
is the rate of change of angular momentum, t = dL/dt, the AL 
vector must be in the same direction as the torque (division by 
a positive scalar doesn’t change the direction of the vector). As 
shown in j, this causes the angular momentum vector to twist in 
space without changing its magnitude. 


For similar reasons, the Earth’s axis precesses once every 26,000 
years (although not through a great circle, since the angle between 
the axis and the force isn’t 90 degrees as in figure h). This pre- 
cession is due to a torque exerted by the moon. If the Earth was 
a perfect sphere, there could be no precession effect due to sym- 
metry. However, the Earth’s own rotation causes it to be slightly 
flattened (oblate) relative to a perfect sphere, giving it “love han- 
dles” on which the moon’s gravity can act. The moon’s gravity on 
the nearer side of the equatorial bulge is stronger, so the torques do 
not cancel out perfectly. Presently the earth’s axis very nearly lines 
up with the star Polaris, but in 12,000 years, the pole star will be 
Vega instead. 


The frisbee example 26 

The flow of the air over a flying frisbee generates lift, and the lift 
at the front and back of the frisbee isn’t necessarily balanced. If 
you throw a frisbee without rotating it, as if you were shooting a 
basketball with two hands, you'll find that it pitches, i.e., its nose 
goes either up or down. When | do this with my frisbee, it goes 
nose down, which apparently means that the lift at the back of 
the disc is greater than the lift at the front. The two torques are 
unbalanced, resulting in a total torque that points to the left. 


The way you actually throw a frisbee is with one hand, putting a 
lot of spin on it. If you throw backhand, which is how most peo- 
ple first learn to do it, the angular momentum vector points down 
(assuming you're right-handed). On my frisbee, the aerodynamic 


=) 


h/A top is supported at its 
tip by a pinhead. (More practical 
devices to demonstrate this 
would use a double bearing.) 


. 
| T  t=rxF 


F 


i/The torque made by grav- 
ity is in the horizontal plane. 


L 
4 2. 
Ha 


j/The AL vector is in the 
same direction as the torque, out 
of the page. 
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torque to the left would therefore tend to make the angular mo- 
mentum vector precess in the clockwise direction as seen by the 
thrower. This would cause the disc to roll to the right, and there- 
fore follow a curved trajectory. Some specialized discs, used in 
the sport of disc golf, are actually designed intentionally to show 
this behavior; they’re known as “understable” discs. However, the 
typical frisbee that most people play with is designed to be stable: 
as the disc rolls to one side, the airflow around it is altered in way 
that tends to bring the disc back into level flight. Such a disc will 
therefore tend to fly in a straight line, provided that it is thrown 
with enough angular momentum. 


r 4 5 0 Finding a cross product by components example 27 
> What is the torque produced by a force given by X + 2y + 32 (in 
units of Newtons) acting on a point whose radius vector is 4X + 2y 
k / Example 27. (in meters)? 


> It’s helpful to make a table of the components as shown in the 
figure. The results are 

Ty = lz Fy = For, =—12 N-m 

Tz =IyFy — Fyty = 3N-m 


Torque and angular momentum example 28 
In this example, we prove explicitly the consistency of the equa- 
tions involving torque and angular momentum that we proved 
above based purely on symmetry. Starting from the definition of 
torque, we have 


aa 
~ dt 


d 
= peti x Pi 
i 


= e c(h x pj) 


The derivative of a cross product can be evaluated in the same 
way as the derivative of an ordinary scalar product: 


Stl (Mi ep) 4 (rx OP 
ro [(Gex Pi) + (nx Fe 
The first term is zero for each particle, since the velocity vector is 


parallel to the momentum vector. The derivative appearing in the 
second term is the force acting on the particle, so 


t= Sor x F; ; 
i 


which is the relationship we set out to prove. 
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4.3.3 Rigid-body dynamics in three dimensions 


The student who is not madly in love with mathematics may 
wish to skip the rest of this section after absorbing the statement 
that, for a typical, asymmetric object, the angular momentum vector 
and the angular velocity vector need not be parallel. That is, only 
for a body that possesses symmetry about the rotation axis is it true 
that L = Iw (the rotational equivalent of p = mv) for some scalar 
L, 


Let’s evaluate the angular momentum of a rigidly rotating sys- 
tem of particles: 
L= y= rj X Di 
i 


= ) Miki X Vi 
a 

= ) Mir; X (w X V4) 
a 


An important mathematical skill is to know when to give up and 
back off. This is a complicated expression, and there is no reason 
to expect it to simplify and, for example, take the form of a scalar 
multiplied by w. Instead we examine its general characteristics. If 
we expanded it using the equation that gives the components of a 
vector cross product, every term would have one of the w compo- 
nents raised to the first power, multiplied by a bunch of other stuff. 
The most general possible form for the result is 


Ly = Ingwye + IpyWwy + IpzWz 
Ly = Lyawy + Lyywy + Lyzwz 
Ly = IzgWy + LayWly + LygWz ’ 


which you may recognize as a case of matrix multiplication. The 
moment of inertia is not a scalar, and not a three-component vector. 
It is a matrix specified by nine numbers, called its matrix elements. 


The elements of the moment of inertia matrix will depend on our 
choice of a coordinate system. In general, there will be some special 
coordinate system, in which the matrix has a simple diagonal form: 


The three special axes that cause this simplification are called 
the principal axes of the object, and the corresponding coordinate 
system is the principal axis system. For symmetric shapes such as 
a rectangular box or an ellipsoid, the principal axes lie along the 
intersections of the three symmetry planes, but even an asymmetric 
body has principal axes. 


Section 4.3. Angular Momentum In Three Dimensions 


283 


284 


We can also generalize the plane-rotation equation K = (1/2)Iw? 
to three dimensions as follows: 


1 
i 
1 
= 5 OL malw x rj): (w x r;) 
i 
We want an equation involving the moment of inertia, and this has 
some evident similarities to the sum we originally wrote down for 
the moment of inertia. To massage it into the right shape, we need 


the vector identity (A x B)-C = (Bx C)-A, which we state without 
proof. We then write 


K = 5 Simin x @xr))-w 


==xW: Miy xX (w x ir) 


I 
| 
m 
€ 


As a reward for all this hard work, let’s analyze the problem of 
the spinning shoe that I posed at the beginning of the chapter. The 
three rotation axes referred to there are approximately the principal 
axes of the shoe. While the shoe is in the air, no external torques are 
acting on it, so its angular momentum vector must be constant in 
magnitude and direction. Its kinetic energy is also constant. That’s 
in the room’s frame of reference, however. The principal axis frame 
is attached to the shoe, and tumbles madly along with it. In the 
principal axis frame, the kinetic energy and the magnitude of the 
angular momentum stay constant, but the actual direction of the 
angular momentum need not stay fixed (as you saw in the case 
of rotation that was initially about the intermediate-length axis). 
Constant |L| gives 


ie Le + L? = constant 


In the principal axis frame, it’s easy to solve for the components 
of w in terms of the components of L, so we eliminate w from the 
expression 2K = L- w, giving 
1 


[2 4 7 L? = constant #2. 
RZ 


The first equation is the equation of a sphere in the three di- 
mensional space occupied by the angular momentum vector, while 
the second one is the equation of an ellipsoid. The top figure cor- 
responds to the case of rotation about the shortest axis, which has 
the greatest moment of inertia element. The intersection of the two 
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surfaces consists only of the two points at the front and back of the 
sphere. The angular momentum is confined to one of these points, 
and can’t change its direction, i.e., its orientation with respect to the 
principal axis system, which is another way of saying that the shoe 
can’t change its orientation with respect to the angular momentum 
vector. In the bottom figure, the shoe is rotating about the longest 
axis. Now the angular momentum vector is trapped at one of the 
two points on the right or left. In the case of rotation about the 
axis with the intermediate moment of inertia element, however, the 
intersection of the sphere and the ellipsoid is not just a pair of iso- 
lated points but the curve shown with the dashed line. The relative 
orientation of the shoe and the angular momentum vector can and 
will change. 


One application of the moment of inertia tensor is to video games 
that simulate car racing or flying airplanes. 


One more exotic example has to do with nuclear physics. Al- 
though you have probably visualized atomic nuclei as nothing more 
than featureless points, or perhaps tiny spheres, they are often el- 
lipsoids with one long axis and two shorter, equal ones. Although 
a spinning nucleus normally gets rid of its angular momentum via 
gamma ray emission within a period of time on the order of picosec- 
onds, it may happen that a deformed nucleus gets into a state in 
which has a large angular momentum is along its long axis, which 
is a very stable mode of rotation. Such states can live for seconds 
or even years! (There is more to the story — this is the topic on 
which I wrote my Ph.D. thesis — but the basic insight applies even 
though the full treatment requires fancy quantum mechanics. ) 


Our analysis has so far assumed that the kinetic energy of ro- 
tation energy can’t be converted into other forms of energy such as 
heat, sound, or vibration. When this assumption fails, then rota- 
tion about the axis of least moment of inertia becomes unstable, 
and will eventually convert itself into rotation about the axis whose 
moment of inertia is greatest. This happened to the U.S.’s first ar- 
tificial satellite, Explorer I, launched in 1958. Note the long, floppy 
antennas, which tended to dissipate kinetic energy into vibration. It 
had been designed to spin about its minimimum-moment-of-inertia 
axis, but almost immediately, as soon as it was in space, it began 
spinning end over end. It was nevertheless able to carry out its 
science mission, which didn’t depend on being able to maintain a 
stable orientation, and it discovered the Van Allen radiation belts. 


This chapter is summarized on page 894. Notation and terminology 
are tabulated on pages 885-886. 


| / Visualizing surfaces of 
constant energy and angular 
momentum in L,-Ly-Lz space. 


satellite. 


m/ The 


Explorer | 
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Problem 6. 
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Problem 8. 
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Problems 


The symbols v, “, etc. are explained on page 293. 


1 The figure shows scale drawing of a pair of pliers being 
used to crack a nut, with an appropriately reduced centimeter grid. 
Warning: do not attempt this at home; it is bad manners. If the 
force required to crack the nut is 300 N, estimate the force required 
of the person’s hand. > Solution, p. 880 -— 


2 You are trying to loosen a stuck bolt on your RV using a big 
wrench that is 50 cm long. If you hang from the wrench, and your 
mass is 55 kg, what is the maximum torque you can exert on the 
bolt? Vv 


3 A physical therapist wants her patient to rehabilitate his in- 
jured elbow by laying his arm flat on a table, and then lifting a 2.1 
kg mass by bending his elbow. In this situation, the weight is 33 
cm from his elbow. He calls her back, complaining that it hurts him 
to grasp the weight. He asks if he can strap a bigger weight onto 
his arm, only 17 cm from his elbow. How much mass should she 
tell him to use so that he will be exerting the same torque? (He is 
raising his forearm itself, as well as the weight.) vo 


4 An object thrown straight up in the air is momentarily at rest 
when it reaches the top of its motion. Does that mean that it is in 
equilibrium at that point? Explain. gE 


5 An object is observed to have constant angular momentum. 
Can you conclude that no torques are acting on it? Explain. [Based 
on a problem by Serway and Faughn.] a 


6 A person of mass m stands on the ball of one foot. Find the 
tension in the calf muscle and the force exerted by the shinbones 
on the bones of the foot, in terms of m, g, a, and b. For simplicity, 
assume that all the forces are at 90-degree angles to the foot, i.e., 
neglect the angle between the foot and the floor. vo 


7 Two pointlike particles have the same momentum vector. Can 
you conclude that their angular momenta are the same? Explain. 
[Based on a problem by Serway and Faughn.] z 


8 The box shown in the figure is being accelerated by pulling on 
it with the rope. 
(a) Assume the floor is frictionless. What is the maximum force 
that can be applied without causing the box to tip over? 

> Hint, p. 864 V 
(b) Repeat part a, but now let the coefficient of friction be w. Vv 
(c) What happens to your answer to part b when the box is sufh- 
ciently tall? How do you interpret this? E 
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9 A uniform ladder of mass m and length @ leans against a smooth 
wall, making an angle 6 with respect to the ground. The dirt exerts 
a normal force and a frictional force on the ladder, producing a force 
vector with magnitude F) at an angle ¢ with respect to the ground. 
Since the wall is smooth, it exerts only a normal force on the ladder; 
let its magnitude be Fo. 

(a) Explain why ¢ must be greater than 6. No math is needed. 

(b) Choose any numerical values you like for m and @, and show that 
the ladder can be in equilibrium (zero torque and zero total force 
vector) for 6=45.00°and ¢=63.438°. E 


10 Continuing problem 9, find an equation for ¢ in terms of 
0, and show that m and L do not enter into the equation. Do not 
assume any numerical values for any of the variables. You will need 
the trig identity sin(a — b) = sinacosb—sinbcosa. (As a numerical 
check on your result, you may wish to check that the angles given 
in problem 9b satisfy your equation.) Vv oe 


11 (a) Find the minimum horizontal force which, applied at 
the axle, will pull a wheel over a step. Invent algebra symbols for 
whatever quantities you find to be relevant, and give your answer 
in symbolic form. 

(b) Under what circumstances does your result become infinite? 
Give a physical interpretation. What happens to your answer when 
the height of the curb is zero? Does this make sense? 

> Hint, p. 864 & 


12 A ball is connected by a string to a vertical post. The ball is 
set in horizontal motion so that it starts winding the string around 
the post. Assume that the motion is confined to a horizontal plane, 
i.e., ignore gravity. Michelle and Astrid are trying to predict the 
final velocity of the ball when it reaches the post. Michelle says 
that according to conservation of angular momentum, the ball has 
to speed up as it approaches the post. Astrid says that according to 
conservation of energy, the ball has to keep a constant speed. Who 
is right? [Hint: How is this different from the case where you whirl 
a rock in a circle on a string and gradually reel in the string?] + 


13 In the 1950’s, serious articles began appearing in magazines 
like Life predicting that world domination would be achieved by the 
nation that could put nuclear bombs in orbiting space stations, from 
which they could be dropped at will. In fact it can be quite difficult 
to get an orbiting object to come down. Let the object have energy 
E& = K+U and angular momentum L. Assume that the energy 
is negative, i.e., the object is moving at less than escape velocity. 
Show that it can never reach a radius less than 


GMm 2E LE? 
tmin = om \ IT VET Gants 


[Note that both factors are negative, giving a positive result.] © 


Problems 9 and 10. 


Problem 11. 


Problems 
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Problem 14. 


Problem 15. 


Problem 16. 


Problem 17. 


14 (a) The bar of mass m is attached at the wall with a hinge, 
and is supported on the right by a massless cable. Find the tension, 
T, in the cable in terms of the angle @. Vv 
(b) Interpreting your answer to part a, what would be the best angle 
to use if we wanted to minimize the strain on the cable? 

(c) Again interpreting your answer to part a, for what angles does 
the result misbehave mathematically? Interpet this physically. © 


15 (a) The two identical rods are attached to one another with 
a hinge, and are supported by the two massless cables. Find the 
angle q@ in terms of the angle @, and show that the result is a purely 
geometric one, independent of the other variables involved. Vv 
(b) Using your answer to part a, sketch the configurations for 3 —> 0, 
8 = 45°, and 8 = 90°. Do your results make sense intuitively? © 


16 = Two bars of length @ are connected with a hinge and placed on 
a frictionless cylinder of radius r. (a) Show that the angle 6 shown 
in the figure is related to the unitless ratio r/¢ by the equation 


(b) Discuss the physical behavior of this equation for very large and 
very small values of r/@. = 


17 You wish to determine the mass of a ship in a bottle without 
taking it out. Show that this can be done with the setup shown in 
the figure, with a scale supporting the bottle at one end, provided 
that it is possible to take readings with the ship slid to several 
different locations. Note that you can’t determine the position of 
the ship’s center of mass just by looking at it, and likewise for the 
bottle. In particular, you can’t just say, “position the ship right 
on top of the fulcrum” or “position it right on top of the balance.” 

lies) 


18 Suppose that we lived in a universe in which Newton’s law of 
gravity gave an interaction energy proportional to r~®, rather than 
r—!. Which, if any, of Kepler’s laws would still be true? Which 
would be completely false? Which would be different, but in a way 
that could be calculated with straightforward algebra? x 
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19 Use analogies to find the equivalents of the following equations 
for rotation in a plane: 


KE =p? /2m 
Ag = v,At + (1/2)aAt? 
W =FAsz 


Example: v = Az/At > w = Ad/At 


20 For a one-dimensional harmonic oscillator, the solution to 
the energy conservation equation, 


1 1 
U+K= ake + sme = constant : 


is an oscillation with frequency w = \/k/m. 


Now consider an analogous system consisting of a bar magnet hung 
from a thread, which acts like a magnetic compass. A normal com- 
pass is full of water, so its oscillations are strongly damped, but 
the magnet-on-a-thread compass has very little friction, and will os- 
cillate repeatedly around its equilibrium direction. The magnetic 
energy of the bar magnet is 


U = —Bmcos@ , 


where B is a constant that measures the strength of the earth’s 
magnetic field, m is a constant that parametrizes the strength of 
the magnet, and @ is the angle, measured in radians, between the 
bar magnet and magnetic north. The equilibrium occurs at 6 = 0, 
which is the minimum of U. 


(a) Problem 19 on p. 289 gave some examples of how to construct 
analogies between rotational and linear motion. Using the same 
technique, translate the equation defining the linear quantity k to 
one that defines an analogous angular one « (Greek letter kappa). 
Applying this to the present example, find an expression for k. (As- 
sume the thread is so thin that its stiffness does not have any sig- 
nificant effect compared to earth’s magnetic field.) 


(b) Find the frequency of the compass’s vibrations. le 
21 (a) Find the angular velocities of the earth’s rotation and of 
the earth’s motion around the sun. Vv 


(b) Which motion involves the greater acceleration? 
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Problem 22. 
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Goode 


Problem 23 


22 The sun turns on its axis once every 26.0 days. Its mass 
is 2.0 x 10° kg and its radius is 7.0 x 10° m. Assume it is a rigid 
sphere of uniform density. 

(a) What is the sun’s angular momentum? Vv 
In a few billion years, astrophysicists predict that the sun will use 
up all its sources of nuclear energy, and will collapse into a ball of 
exotic, dense matter known as a white dwarf. Assume that its radius 
becomes 5.8 x 10° m (similar to the size of the Earth.) Assume it 
does not lose any mass between now and then. (Don’t be fooled 
by the photo, which makes it look like nearly all of the star was 
thrown off by the explosion. The visually prominent gas cloud is 
actually thinner than the best laboratory vacuum ever produced on 
earth. Certainly a little bit of mass is actually lost, but it is not at 
all unreasonable to make an approximation of zero loss of mass as 
we are doing.) 

(b) What will its angular momentum be? 

(c) How long will it take to turn once on its axis? v 


~~ 


23 Give a numerical comparison of the two molecules’ moments 
of inertia for rotation in the plane of the page about their centers 
of mass. g 


24 A yo-yo of total mass m consists of two solid cylinders 
of radius R, connected by a small spindle of negligible mass and 
radius r. The top of the string is held motionless while the string 
unrolls from the spindle. Show that the acceleration of the yo-yo is 
g/(1+ R?/2r?). = 


25 Show that a sphere of radius R that is rolling without slipping 
has angular momentum and momentum in the ratio L/p = (2/5)R. 


~ 


26 Suppose a bowling ball is initially thrown so that it has no 
angular momentum at all, i.e., it is initially just sliding down the 
lane. Eventually kinetic friction will bring its angular velocity up to 
the point where it is rolling without slipping. Show that the final 
velocity of the ball equals 5/7 of its initial velocity. You’ll need the 


result of problem 25. = 
27 Find the angular momentum of a particle whose position is 
r = 3x—y-+z (in meters) and whose momentum is p = —2k+y+z 
(in kg-m/s). v 
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28 Find a vector that is perpendicular to both of the following 
two vectors: 


K+ 2H 4 34 
A& + BY + 62 


29 Prove property (3) of the vector cross product from the 
theorem on page 857. a 


30 Prove the anticommutative property of the vector cross prod- 
uct, A x B = —Bx A, using the expressions for the components of 
the cross product. 


31 Find three vectors with which you can demonstrate that the 
vector cross product need not be associative, i.e., that A x (B x C) 
need not be the same as (A x B) x C. a 


32 Which of the following expressions make sense, and which are 
nonsense? For those that make sense, indicate whether the result is 
a vector or a scalar. 

(a) (Ax B)x C 

(b) (A x B)-C 

(c) (A-B)xC 


33 (a) As suggested in the figure, find the area of the infinites- 
imal region expressed in polar coordinates as lying between r and 
r+dr and between 6 and 6+ dé. v 
(b) Generalize this to find the infinitesimal element of volume in 
cylindrical coordinates (r,0,z), where the Cartesian z axis is per- 
pendicular to the directions measured by r and 6. Vv 
(c) Find the moment of inertia for rotation about its axis of a cone 


whose mass is M, whose height is h, and whose base has a radius 
b. Vv Oo 


34 Find the moment of inertia of a solid rectangular box of mass 
M and uniform density, whose sides are of length a, b, and c, for 
rotation about an axis through its center parallel to the edges of 
length a. Vv Oo 


Problem 33 
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35 The nucleus '°°Er (erbium-168) contains 68 protons (which 
is what makes it a nucleus of the element erbium) and 100 neutrons. 
It has an ellipsoidal shape like an American football, with one long 
axis and two short axes that are of equal diameter. Because this 
is a subatomic system, consisting of only 168 particles, its behavior 
shows some clear quantum-mechanical properties. It can only have 
certain energy levels, and it makes quantum leaps between these 
levels. Also, its angular momentum can only have certain values, 
which are all multiples of 2.109 x 1074 kg. m?/s. The table shows 
some of the observed angular momenta and energies of '®8Er, in SI 
units (kg - m?/s and joules). 

ENO: EO 

0 0 

2.109 1.2786 

4.218 4.2311 

6.327 8.7919 

8.437 14.8731 

10.546 22.3798 

12.655 31.135 

14.764 41.206 

16.873 592.223 
(a) These data can be described to a good approximation as a rigid 
end-over-end rotation. Estimate a single best-fit value for the mo- 
ment of inertia from the data, and check how well the data agree 
with the assumption of rigid-body rotation. > Hint, p. 864 V 
(b) Check whether this moment of inertia is on the right order of 
magnitude. The moment of inertia depends on both the size and 
the shape of the nucleus. For the sake of this rough check, ignore 
the fact that the nucleus is not quite spherical. To estimate its size, 
use the fact that a neutron or proton has a volume of about 1 fm? 
(one cubic femtometer, where 1 fm = 107! m), and assume they 
are closely packed in the nucleus. x 


36 (a) Prove the identity a x (b x c) = b(a-c) — c(a- b) 
by expanding the product in terms of its components. Note that 
because the x, y, and z components are treated symmetrically in 
the definitions of the vector cross product, it is only necessary to 
carry out the proof for the x component of the result. 

(b) Applying this to the angular momentum of a rigidly rotating 
body, L = fr x (w x r) dm, show that the diagonal elements of the 
moment of inertia tensor can be expressed as, e.g., Inx = [ (y? + 
2*\dm. 

(c) Find the diagonal elements of the moment of inertia matrix of 
an ellipsoid with axes of lengths a, 6, and c, in the principal-axis 
frame, and with the axis at the center. Vv 


37 In example 22 on page 274, prove that if the rod is sufficiently 
thin, it can be toppled without scraping on the floor. 
> Solution, p. 880 i 
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38 Suppose an object has mass m, and moment of inertia J, 
for rotation about some axis A passing through its center of mass. 
Prove that for an axis B, parallel to A and lying at a distance h 
from it, the object’s moment of inertia is given by J, + mh?. This 
is known as the parallel axis theorem. = 


39 Let two sides of a triangle be given by the vectors A and 
B, with their tails at the origin, and let mass m be uniformly dis- 
tributed on the interior of the triangle. (a) Show that the distance 
of the triangle’s center of mass from the intersection of sides A and 
B is given by 4|A +B]. 

(b) Consider the quadrilateral with mass 2m, and vertices at the 
origin, A, B, and A+B. Show that its moment of inertia, for 
rotation about an axis perpendicular to it and passing through its 
center of mass, is (A? + B?). 

(c) Show that the moment of inertia for rotation about an axis per- 
pendicular to the plane of the original triangle, and passing through 
its center of mass, is 44(A?+B?—A-B). Hint: Combine the results 
of parts a and b with the result of problem 38. = 


40 When we talk about rigid-body rotation, the concept of 
a perfectly rigid body can only be an idealization. In reality, any 
object will compress, expand, or deform to some extent when sub- 
jected to the strain of rotation. However, if we let it settle down for 
a while, perhaps it will reach a new equilibrium. As an example, 
suppose we fill a centrifuge tube with some compressible substance 
like shaving cream or Wonder Bread. We can model the contents of 
the tube as a one-dimensional line of mass, extending from r = 0 to 
r =f. Once the rotation starts, we expect that the contents will be 
most compressed near the “floor” of the tube at r = @; this is both 
because the inward force required for circular motion increases with 
r for a fixed w, and because the part at the floor has the greatest 
amount of material pressing “down” (actually outward) on it. The 
linear density dm/dr, in units of kg/m, should therefore increase as 
a function of r. Suppose that we have dm/dr = pe", where pu is a 
constant. Find the moment of inertia. vio 


Key to symbols: 
easy . typical challenging Mdifficult M& very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 
Exercise 4A: Torque 
Equipment: 

e rulers with holes in them 


e spring scales (two per group) 


eee mm — 
° —— aoute 


a a A 


While one person holds the pencil which forms the axle for the ruler, the other members of the 
group pull on the scale and take readings. In each case, calculate the total torque on the ruler, 
and find out whether it equals zero to roughly within the accuracy of the experiment. 


e)} 
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Chapter 5 
Thermodynamics 


S=klogW 


Inscription on the tomb of Ludwig Boltzmann, 1844-1906. 
Boltzmann, who originated the microscopic theory of thermo- 
dynamics, was driven to suicide by the criticism of his peers, 
who thought that physical theories shouldn’t discuss purely hy- 
pothetical objects like atoms. 


In a developing country like China, a refrigerator is the mark of 
a family that has arrived in the middle class, and a car is the ulti- 
mate symbol of wealth. Both of these are heat engines: devices for 
converting between heat and other forms of energy. Unfortunately 
for the Chinese, neither is a very efficient device. Burning fossil fuels 
has made China’s big cities the most polluted on the planet, and 
the country’s total energy supply isn’t sufficient to support Amer- 
ican levels of energy consumption by more than a small fraction 
of China’s population. Could we somehow manipulate energy in a 
more efficient way? 


Conservation of energy is a statement that the total amount of 
energy is constant at all times, which encourages us to believe that 
any energy transformation can be undone — indeed, the laws of 
physics you’ve learned so far don’t even distinguish the past from 
the future. If you get in a car and drive around the block, the 
net effect is to consume some of the energy you paid for at the 
gas station, using it to heat the neighborhood. There would not 
seem to be any fundamental physical principle to prevent you from 
recapturing all that heat and using it again the next time you want 
to go for a drive. More modestly, why don’t engineers design a car 
engine so that it recaptures the heat energy that would otherwise 
be wasted via the radiator and the exhaust? 


Hard experience, however, has shown that designers of more and 
more efficient engines run into a brick wall at a certain point. The 
generators that the electric company uses to produce energy at an 
oil-fueled plant are indeed much more efficient than a car engine, but 
even if one is willing to accept a device that is very large, expensive, 
and complex, it turns out to be impossible to make a perfectly effi- 
cient heat engine — not just impossible with present-day technology, 
but impossible due to a set of fundamental physical principles known 
as the science of thermodynamics. And thermodynamics isn’t just a 
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pesky set of constraints on heat engines. Without thermodynamics, 
there is no way to explain the direction of time’s arrow — why we 
can remember the past but not the future, and why it’s easier to 
break Humpty Dumpty than to put him back together again. 


5.1 Pressure and Temperature 


When we heat an object, we speed up the mind-bogglingly complex 
random motion of its molecules. One method for taming complexity 
is the conservation laws, since they tell us that certain things must 
remain constant regardless of what process is going on. Indeed, 
the law of conservation of energy is also known as the first law of 
thermodynamics. 


But as alluded to in the introduction to this chapter, conserva- 
tion of energy by itself is not powerful enough to explain certain 
empirical facts about heat. A second way to sidestep the complex- 
ity of heat is to ignore heat’s atomic nature and concentrate on 
quantities like temperature and pressure that tell us about a sys- 
tem’s properties as a whole. This approach is called macroscopic in 
contrast to the microscopic method of attack. Pressure and temper- 
ature were fairly well understood in the age of Newton and Galileo, 
hundreds of years before there was any firm evidence that atoms 
and molecules even existed. 


Unlike the conserved quantities such as mass, energy, momen- 
tum, and angular momentum, neither pressure nor temperature is 
additive. Two cups of coffee have twice the heat energy of a single 
cup, but they do not have twice the temperature. Likewise, the 
painful pressure on your eardrums at the bottom of a pool is not 
affected if you insert or remove a partition between the two halves 
of the pool. 


We restrict ourselves to a discussion of pressure in fluids at rest 
and in equilibrium. In physics, the term “fluid” is used to mean 
either a gas or a liquid. The important feature of a fluid can be 
demonstrated by comparing with a cube of jello on a plate. The 
jello is a solid. If you shake the plate from side to side, the jello will 
respond by shearing, i.e., by slanting its sides, but it will tend to 
spring back into its original shape. A solid can sustain shear forces, 
but a fluid cannot. A fluid does not resist a change in shape unless 
it involves a change in volume. 


5.1.1 Pressure 


If you’re at the bottom of a pool, you can’t relieve the pain in 
your ears by turning your head. The water’s force on your eardrum 
is always the same, and is always perpendicular to the surface where 
the eardrum contacts the water. If your ear is on the east side of 
your head, the water’s force is to the west. If you keep your ear 
in the same spot while turning around so your ear is on the north, 
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the force will still be the same in magnitude, and it will change 
its direction so that it is still perpendicular to the eardrum: south. 
This shows that pressure has no direction in space, i.e., it is a scalar. 
The direction of the force is determined by the orientation of the 
surface on which the pressure acts, not by the pressure itself. A 
fluid flowing over a surface can also exert frictional forces, which 
are parallel to the surface, but the present discussion is restricted 
to fluids at rest. 


Experiments also show that a fluid’s force on a surface is pro- 
portional to the surface area. The vast force of the water behind 
a dam, for example, in proportion to the dam’s great surface area. 
(The bottom of the dam experiences a higher proportion of its force.) 


Based on these experimental results, it appears that the useful 
way to define pressure is as follows. The pressure of a fluid at a 
given point is defined as F'| /A, where A is the area of a small surface 
inserted in the fluid at that point, and F', is the component of the 
fluid’s force on the surface which is perpendicular to the surface. (In 
the case of a moving fluid, fluid friction forces can act parallel to 
the surface, but we’re only dealing with stationary fluids, so there 
is only an F’.) 


This is essentially how a pressure gauge works. The reason that 
the surface must be small is so that there will not be any significant 
different in pressure between one part of it and another part. The SI 
units of pressure are evidently N/m?, and this combination can be 
abbreviated as the pascal, 1 Pa=1 N/m?. The pascal turns out to 
be an inconveniently small unit, so car tires, for example, normally 
have pressures imprinted on them in units of kilopascals. 


Pressure in U.S. units example 1 
In U.S. units, the unit of force is the pound, and the unit of distance 
is the inch. The unit of pressure is therefore pounds per square 
inch, or p.s.i. (Note that the pound is not a unit of mass.) 


Atmospheric pressure in U.S. and metric units example 2 
> A figure that many people in the U.S. remember is that atmo- 
spheric pressure is about 15 pounds per square inch. What is 
this in metric units? Solution: 


(15 Ib) /(1 in?) = 1.0 x 10° N/m? 
= 100 kPa 


Only pressure differences are normally significant. 


If you spend enough time on an airplane, the pain in your ears 
subsides. This is because your body has gradually been able to ad- 
mit more air into the cavity behind the eardrum. Once the pressure 
inside is equalized with the pressure outside, the inward and out- 
ward forces on your eardrums cancel out, and there is no physical 
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a/A_ simple pressure gauge 
consists of a cylinder open at one 
end, with a piston and a spring 
inside. The depth to which the 
spring is depressed is a measure 
of the pressure. To determine the 
absolute pressure, the air needs 
to be pumped out of the interior of 
the gauge, so that there is no air 
pressure acting outward on the 
piston. In many practical gauges, 
the back of the piston is open to 
the atmosphere, so the pressure 
the gauge registers equals the 
pressure of the fluid minus the 
pressure of the atmosphere. 
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sensation to tell you that anything unusual is going on. For this 
reason, it is normally only pressure differences that have any phys- 
ical significance. Thus deep-sea fish are perfectly healthy in their 
habitat because their bodies have enough internal pressure to cancel 
the pressure from the water in which they live; if they are caught in 
a net and brought to the surface rapidly, they explode because their 
internal pressure is so much greater than the low pressure outside. 


Getting killed by a pool pump example 3 
> My house has a pool, which | maintain myself. A pool always 
needs to have its water circulated through a filter for several hours 
a day in order to keep it clean. The filter is a large barrel with a 
strong clamp that holds the top and bottom halves together. My 
filter has a prominent warning label that warns me not to try to 
open the clamps while the pump is on, and it shows a cartoon 
of a person being struck by the top half of the pump. The cross- 
sectional area of the filter barrel is 0.25 m?. Like most pressure 
gauges, the one on my pool pump actually reads the difference in 
pressure between the pressure inside the pump and atmospheric 
pressure. The gauge reads 90 kPa. What is the force that is 
trying to pop open the filter? 


> If the gauge told us the absolute pressure of the water inside, 
we'd have to find the force of the water pushing outward and the 
force of the air pushing inward, and subtract in order to find the 
total force. Since air surrounds us all the time, we would have to 
do such a subtraction every time we wanted to calculate anything 
useful based on the gauge’s reading. The manufacturers of the 
gauge decided to save us from all this work by making it read the 
difference in pressure between inside and outside, so all we have 
to do is multiply the gauge reading by the cross-sectional area of 
the filter: 


F=PA 
= (90 x 10° N/m?)(0.25 m?) 
= 22000 N 

That’s a lot of force! 


The word “suction” and other related words contain a hidden 
misunderstanding related to this point about pressure differences. 
When you suck water up through a straw, there is nothing in your 
mouth that is attracting the water upward. The force that lifts the 
water is from the pressure of the water in the cup. By creating a 
partial vacuum in your mouth, you decreased the air’s downward 
force on the water so that it no longer exactly canceled the upward 
force. 


Variation of pressure with depth 


The pressure within a fluid in equilibrium can only depend on 
depth, due to gravity. If the pressure could vary from side to side, 
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then a piece of the fluid in between, b, would be subject to unequal 
forces from the parts of the fluid on its two sides. Since fluids do not 
exhibit shear forces, there would be no other force that could keep 
this piece of fluid from accelerating. This contradicts the assumption 
that the fluid was in equilibrium. 


self-check A 


How does this proof fail for solids? > Answer, p. 868 


To find the variation with depth, we consider the vertical forces 
acting on a tiny, imaginary cube of the fluid having infinitesimal 
height dy and areas dA on the top and bottom. Using positive 
numbers for upward forces, we have 


Prottomd A — Propd A _ Fy = 0 


The weight of the fluid is Fy = mg = pVg = pdAdy g, where p is 
the density of the fluid, so the difference in pressure is 


dP = —pgdy [variation in pressure with depth for 


a fluid of density p in equilibrium; 


positive y is up.| 


A more elegant way of writing this is in terms of a dot product, 
dP = pg- dy, which automatically takes care of the plus or minus 
sign, depending on the relative directions of the g and dy vectors, 
and avoids any requirements about the coordinate system. 


The factor of p explains why we notice the difference in pressure 
when diving 3 m down in a pool, but not when going down 3 m of 
stairs. The equation only tells us the difference in pressure, not the 
absolute pressure. The pressure at the surface of a swimming pool 
equals the atmospheric pressure, not zero, even though the depth is 
zero at the surface. The blood in your body does not even have an 
upper surface. 


In cases where g and p are independent of depth, we can inte- 
grate both sides of the equation to get everything in terms of finite 
differences rather than differentials: AP = —pgAy. 


self-check B 

In which of the following situations is the equation AP = —pgAy valid? 
Why? (1) difference in pressure between a tabletop and the feet (i.e., 
predicting the pressure of the feet on the floor) (2) difference in air pres- 
sure between the top and bottom of a tall building (3) difference in air 
pressure between the top and bottom of Mt. Everest (4) difference in 
pressure between the top of the earth’s mantle and the center of the 
earth (5) difference in pressure between the top and bottom of an air- 
plane’s wing > Answer, p. 
868 
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b/This doesn’t happen. If 
pressure could vary horizontally 
in equilibrium, the cube of water 
would accelerate horizontally. 
This is a contradiction, since 
we assumed the fluid was in 
equilibrium. 


ee 


c/The pressure is the same 
at all the points marked with dots. 


O 
| 


d/This does happen. The 
sum of the forces from the 
surrounding parts of the fluid is 
upward, canceling the downward 
force of gravity. 
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e/We have to wait for the 
thermometer to equilibrate its 
temperature with the temperature 
of lrene’s armpit. 


' Pressure of lava underneath a volcano example 4 
> A volcano has just finished erupting, and a pool of molten lava 
is lying at rest in the crater. The lava has come up through an 
opening inside the volcano that connects to the earth’s molten 
mantle. The density of the lava is 4.1 g/cm?. What is the pressure 
in the lava underneath the base of the volcano, 3000 m below the 
surface of the pool? 


> 


AP = pgAy 
= (4.1 g/cm?)(9.8 m/s2)(3000 m) 
= (4.1 x 10° g/m*)(9.8 m/s?)(3000 m) 
= (4.1 x 10° kg/m°)(9.8 m/s*)(3000 m) 
= 1.2 x 108 N/m? 
= 1.2 x 10° Pa 


This is the difference between the pressure we want to find and 
atmospheric pressure at the surface. The latter, however, is tiny 
compared to the AP we just calculated, so what we’ve found is 
essentially the pressure, P. 


‘Atmospheric pressure example 5 
Gases, unlike liquids, are quite compressible, and at a given tem- 
perature, the density of a gas is approximately proportional to the 
pressure. The proportionality constant is discussed on page 304, 
but for now let’s just call it k, p = kP. Using this fact, we can 
find the variation of atmospheric pressure with altitude, assuming 
constant temperature: 


dP = —pgdy 
dP = —kPgdy 
dP 

Pps —kg dy 


In P = —kgy + constant [integrating both sides] 
P = (constant)e-*9 [exponentiating both sides] 


Pressure falls off exponentially with height. There is no sharp 
cutoff to the atmosphere, but the exponential gets extremely small 
by the time you're ten or a hundred miles up. 


5.1.2 Temperature 
Thermal equilibrium 


We use the term temperature casually, but what is it exactly? 
Roughly speaking, temperature is a measure of how concentrated 
the heat energy is in an object. A large, massive object with very 
little heat energy in it has a low temperature. 
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But physics deals with operational definitions, i.e., definitions of 
how to measure the thing in question. How do we measure temper- 
ature? One common feature of all temperature-measuring devices 
is that they must be left for a while in contact with the thing whose 
temperature is being measured. When you take your temperature 
with a fever thermometer, you are waiting for the mercury inside to 
come up to the same temperature as your body. The thermometer 
actually tells you the temperature of its own working fluid (in this 
case the mercury). In general, the idea of temperature depends on 
the concept of thermal equilibrium. When you mix cold eggs from 
the refrigerator with flour that has been at room temperature, they 
rapidly reach a compromise temperature. What determines this 
compromise temperature is conservation of energy, and the amount 
of energy required to heat or cool each substance by one degree. 
But without even having constructed a temperature scale, we can 
see that the important point is the phenomenon of thermal equi- 
librium itself: two objects left in contact will approach the same 
temperature. We also assume that if object A is at the same tem- 
perature as object B, and B is at the same temperature as C, then 
A is at the same temperature as C. This statement is sometimes 
known as the zeroth law of thermodynamics, so called because after 
the first, second, and third laws had been developed, it was realized 
that there was another law that was even more fundamental. 


Thermal expansion 


The familiar mercury thermometer operates on the principle that 
the mercury, its working fluid, expands when heated and contracts 
when cooled. In general, all substances expand and contract with 
changes in temperature. The zeroth law of thermodynamics guar- 
antees that we can construct a comparative scale of temperatures 
that is independent of what type of thermometer we use. If a ther- 
mometer gives a certain reading when it’s in thermal equilibrium 
with object A, and also gives the same reading for object B, then 
A and B must be the same temperature, regardless of the details of 
how the thermometers works. 


What about constructing a temperature scale in which every 
degree represents an equal step in temperature? The Celsius scale 
has 0 as the freezing point of water and 100 as its boiling point. The 
hidden assumption behind all this is that since two points define a 
line, any two thermometers that agree at two points must agree at 
all other points. In reality if we calibrate a mercury thermometer 
and an alcohol thermometer in this way, we will find that a graph 
of one thermometer’s reading versus the other is not a perfectly 
straight y = x line. The subtle inconsistency becomes a drastic one 
when we try to extend the temperature scale through the points 
where mercury and alcohol boil or freeze. Gases, however, are much 
more consistent among themselves in their thermal expansion than 
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f/ Thermal equilibrium = can 
be prevented. Otters have a coat 
of fur that traps air bubbles for in- 
sulation. If a swimming otter was 
in thermal equilibrium with cold 
water, it would be dead. Heat is 
still conducted from the otter’s 
body to the water, but much 
more slowly than it would be in a 
warm-blooded animal that didn’t 
have this special adaptation. 


g/A hot air balloon is inflated. 
Because of thermal expansion, 
the hot air is less dense than 
the surrounding cold air, and 
therefore floats as the cold air 
drops underneath it and pushes it 
up out of the way. 
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open to 
standard 


pressure 
noble gas 


h/A_ simplified version of an 
ideal gas thermometer. The 
whole instrument is allowed to 
come into thermal equilibrium 
with the substance whose tem- 
perature is to be measured, and 
the mouth of the cylinder is left 
open to standard pressure. The 
volume of the noble gas gives an 
indication of temperature. 


-273 -200 -100 
temperature (C) 


i/The volume of 1 kg of neon 
gas as a function of temperature 
(at standard pressure). Although 
neon would actually condense 
into a liquid at some _ point, 
extrapolating the graph gives 
to zero volume gives the same 
temperature as for any other gas: 
absolute zero. 


solids or liquids, and the noble gases like helium and neon are more 
consistent with each other than gases in general. Continuing to 
search for consistency, we find that noble gases are more consistent 
with each other when their pressure is very low. 


As an idealization, we imagine a gas in which the atoms interact 
only with the sides of the container, not with each other. Such a 
gas is perfectly nonreactive (as the noble gases very nearly are), and 
never condenses to a liquid (as the noble gases do only at extremely 
low temperatures). Its atoms take up a negligible fraction of the 
available volume. Any gas can be made to behave very much like 
this if the pressure is extremely low, so that the atoms hardly ever 
encounter each other. Such a gas is called an ideal gas, and we define 
the Celsius scale in terms of the volume of the gas in a thermometer 
whose working substance is an ideal gas maintained at a fixed (very 
low) pressure, and which is calibrated at 0 and 100 degrees according 
to the melting and boiling points of water. The Celsius scale is not 
just a comparative scale but an additive one as well: every step in 
temperature is equal, and it makes sense to say that the difference 
in temperature between 18 and 28°C is the same as the difference 
between 48 and 58. 


Absolute zero and the kelvin scale 


We find that if we extrapolate a graph of volume versus tem- 
perature, the volume becomes zero at nearly the same temperature 
for all gases: -273°C. Real gases will all condense into liquids at 
some temperature above this, but an ideal gas would achieve zero 
volume at this temperature, known as absolute zero. The most use- 
ful temperature scale in scientific work is one whose zero is defined 
by absolute zero, rather than by some arbitrary standard like the 
melting point of water. The temperature scale used universally in 
scientific work, called the Kelvin scale, is the same as the Celsius 
scale, but shifted by 273 degrees to make its zero coincide with ab- 
solute zero. Scientists use the Celsius scale only for comparisons or 
when a change in temperature is all that is required for a calcula- 
tion. Only on the Kelvin scale does it make sense to discuss ratios 
of temperatures, e.g., to say that one temperature is twice as hot as 
another. 


Which temperature scale to use example 6 
> You open an astronomy book and encounter the equation 


(light emitted) = (constant) x T* 


for the light emitted by a star as a function of its surface tempera- 
ture. What temperature scale is implied? 


> The equation tells us that doubling the temperature results in 
the emission of 16 times as much light. Such a ratio only makes 
sense if the Kelvin scale is used. 
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Although we can achieve as good an approximation to an ideal 
gas as we wish by making the pressure very low, it seems never- 
theless that there should be some more fundamental way to define 
temperature. We will construct a more fundamental scale of tem- 
perature in section 5.4. 


5.2 Microscopic Description of An Ideal Gas 


5.2.1 Evidence for the kinetic theory 


Why does matter have the thermal properties it does? The basic 
answer must come from the fact that matter is made of atoms. How, 
then, do the atoms give rise to the bulk properties we observe? 
Gases, whose thermal properties are so simple, offer the best chance 
for us to construct a simple connection between the microscopic and 
macroscopic worlds. 


A crucial observation is that although solids and liquids are 
nearly incompressible, gases can be compressed, as when we in- 
crease the amount of air in a car’s tire while hardly increasing its 
volume at all. This makes us suspect that the atoms in a solid are 
packed shoulder to shoulder, while a gas is mostly vacuum, with 
large spaces between molecules. Most liquids and solids have den- 
sities about 1000 times greater than most gases, so evidently each 
molecule in a gas is separated from its nearest neighbors by a space 
something like 10 times the size of the molecules themselves. 


If gas molecules have nothing but empty space between them, 
why don’t the molecules in the room around you just fall to the 
floor? The only possible answer is that they are in rapid motion, 
continually rebounding from the walls, floor and ceiling. In section 
2.4 I have already given some of the evidence for the kinetic theory 
of heat, which states that heat is the kinetic energy of randomly 
moving molecules. This theory was proposed by Daniel Bernoulli 
in 1738, and met with considerable opposition because it seemed 
as though the molecules in a gas would eventually calm down and 
settle into a thin film on the floor. There was no precedent for this 
kind of perpetual motion. No rubber ball, however elastic, rebounds 
from a wall with exactly as much energy as it originally had, nor 
do we ever observe a collision between balls in which none of the 
kinetic energy at all is converted to heat and sound. The analogy is 
a false one, however. A rubber ball consists of atoms, and when it is 
heated in a collision, the heat is a form of motion of those atoms. An 
individual molecule, however, cannot possess heat. Likewise sound 
is a form of bulk motion of molecules, so colliding molecules in a gas 
cannot convert their kinetic energy to sound. Molecules can indeed 
induce vibrations such as sound waves when they strike the walls of a 
container, but the vibrations of the walls are just as likely to impart 
energy to a gas molecule as to take energy from it. Indeed, this kind 
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of exchange of energy is the mechanism by which the temperatures 
of the gas and its container become equilibrated. 


5.2.2 Pressure, volume, and temperature 


A gas exerts pressure on the walls of its container, and in the 
kinetic theory we interpret this apparently constant pressure as the 
averaged-out result of vast numbers of collisions occurring every 
second between the gas molecules and the walls. The empirical 
facts about gases can be summarized by the relation 


PV cond, [ideal gas] 


which really only holds exactly for an ideal gas. Here n is the number 
of molecules in the sample of gas. 


Volume related to temperature example 7 
The proportionality of volume to temperature at fixed pressure 
was the basis for our definition of temperature. 


Pressure related to temperature example 8 
Pressure is proportional to temperature when volume is held con- 
stant. An example is the increase in pressure in a car's tires when 
the car has been driven on the freeway for a while and the tires 
and air have become hot. 


We now connect these empirical facts to the kinetic theory of 
a classical ideal gas. For simplicity, we assume that the gas is 
monoatomic (i.e., each molecule has only one atom), and that it 
is confined to a cubical box of volume V, with DL being the length 
of each edge and A the area of any wall. An atom whose velocity 
has an x component v, will collide regularly with the left-hand wall, 
traveling a distance 2D parallel to the x axis between collisions with 
that wall. The time between collisions is At = 2L/v,, and in each 
collision the x component of the atom’s momentum is reversed from 
—mMv,z to mvz,. The total force on the wall is 


F= a [monoatomic ideal gas] ; 


where the index i refers to the individual atoms. Substituting 
Apz,i = 2mvz,, and At; = 2L/vz;, we have 


mei Bea 
f= Ss . [monoatomic ideal gas] 


The quantity mv? is twice the contribution to the kinetic energy 


from the part of the atoms’ center of mass motion that is parallel to 
the x axis. Since we’re assuming a monoatomic gas, center of mass 
motion is the only type of motion that gives rise to kinetic energy. 
(A more complex molecule could rotate and vibrate as well.) If the 
quantity inside the sum included the y and z components, it would 
be twice the total kinetic energy of all the molecules. Since we 
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expect the energy to be equally shared among z, y, and z motion,! 
the quantity inside the sum must therefore equal 2/3 of the total 
kinetic energy, so 


2K, 
ES ae [monoatomic ideal gas] 
Dividing by A and using AL = V, we have 
2K, 
P= a [monoatomic ideal gas] 


This can be connected to the empirical relation PV « nT if we 
multiply by V on both sides and rewrite Kiotg, as nK, where K is 
the average kinetic energy per molecule: 


QD? os 
PY = gk [monoatomic ideal gas] 


For the first time we have an interpretation for the temperature 
based on a microscopic description of matter: in a monoatomic ideal 
gas, the temperature is a measure of the average kinetic energy per 
molecule. The proportionality between the two is K = (3/2)kT, 
where the constant of proportionality k, known as Boltzmann’s con- 
stant, has a numerical value of 1.38 x 10773 J/K. In terms of Boltz- 
mann’s constant, the relationship among the bulk quantities for an 
ideal gas becomes 


PV =nEF : [ideal gas] 


which is known as the ideal gas law. Although I won’t prove it here, 
this equation applies to all ideal gases, even though the derivation 
assumed a monoatomic ideal gas in a cubical box. (You may have 
seen it written elsewhere as PV = NRT, where N = n/N, is the 
number of moles of atoms, R = kN,4, and Ny = 6.0 x 107%, called 
Avogadro’s number, is essentially the number of hydrogen atoms in 
1 g of hydrogen.) 


Pressure in a car tire example 9 
> After driving on the freeway for a while, the air in your car's 
tires heats up from 10°C to 35°C. How much does the pressure 
increase? 


> The tires may expand a little, but we assume this effect is small, 
so the volume is nearly constant. From the ideal gas law, the 
ratio of the pressures is the same as the ratio of the absolute 
temperatures, 


P2/Py 
= 12/7; 
= (308 K) /(283 K) 
29° 


or a 9% increase. 


'This equal sharing will be justified more rigorously on page 319. 
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Discussion Questions 


A Compare the amount of energy needed to heat 1 liter of helium by 
1 degree with the energy needed to heat 1 liter of xenon. In both cases, 
the heating is carried out in a sealed vessel that doesn’t allow the gas to 
expand. (The vessel is also well insulated.) 


B Repeat discussion question A if the comparison is 1 kg of helium 
versus 1 kg of xenon (equal masses, rather than equal volumes). 


Cc Repeat discussion question A, but now compare 1 liter of helium in 
a vessel of constant volume with the same amount of helium in a vessel 
that allows expansion beyond the initial volume of 1 liter. (This could be a 
piston, or a balloon.) 


5.3 Entropy As a Macroscopic Quantity 
5.3.1 Efficiency and grades of energy 


Some forms of energy are more convenient than others in certain 
situations. You can’t run a spring-powered mechanical clock on a 
battery, and you can’t run a battery-powered clock with mechanical 
energy. However, there is no fundamental physical principle that 
prevents you from converting 100% of the electrical energy in a 
battery into mechanical energy or vice-versa. More efficient motors 
and generators are being designed every year. In general, the laws 
of physics permit perfectly efficient conversion within a broad class 
of forms of energy. 


Heat is different. Friction tends to convert other forms of energy 
into heat even in the best lubricated machines. When we slide a 
book on a table, friction brings it to a stop and converts all its kinetic 
energy into heat, but we never observe the opposite process, in which 
a book spontaneously converts heat energy into mechanical energy 
and starts moving! Roughly speaking, heat is different because it is 
disorganized. Scrambling an egg is easy. Unscrambling it is harder. 


We summarize these observations by saying that heat is a lower 
grade of energy than other forms such as mechanical energy. 


Of course it is possible to convert heat into other forms of energy 
such as mechanical energy, and that is what a car engine does with 
the heat created by exploding the air-gasoline mixture. But a car 
engine is a tremendously inefficient device, and a great deal of the 
heat is simply wasted through the radiator and the exhaust. Engi- 
neers have never succeeded in creating a perfectly efficient device for 
converting heat energy into mechanical energy, and we now know 
that this is because of a deeper physical principle that is far more 
basic than the design of an engine. 
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5.3.2 Heat engines 


Heat may be more useful in some forms than in other, i.e., there 
are different grades of heat energy. In figure a/1, the difference 
in temperature can be used to extract mechanical work with a fan 
blade. This principle is used in power plants, where steam is heated 
by burning oil or by nuclear reactions, and then allowed to expand 
through a turbine which has cooler steam on the other side. On a 
smaller scale, there is a Christmas toy, b, that consists of a small 
propeller spun by the hot air rising from a set of candles, very much 
like the setup shown in figure a. 


In figure a/2, however, no mechanical work can be extracted 
because there is no difference in temperature. Although the air in 
a/2 has the same total amount of energy as the air in a/1, the heat 
in a/2 is a lower grade of energy, since none of it is accessible for 
doing mechanical work. 


In general, we define a heat engine as any device that takes heat 
from a reservoir of hot matter, extracts some of the heat energy to do 
mechanical work, and expels a lesser amount of heat into a reservoir 
of cold matter. The efficiency of a heat engine equals the amount of 
useful work extracted, W, divided by the amount of energy we had 
to pay for in order to heat the hot reservoir. This latter amount 
of heat is the same as the amount of heat the engine extracts from 
the high-temperature reservoir, Qy. (The letter Q is the standard 
notation for a transfer of heat.) By conservation of energy, we have 
Quy =W+Qz, where Qry is the amount of heat expelled into the 
low-temperature reservoir, so the efficiency of a heat engine, W/Qz, 
can be rewritten as 


efficiency = 1 — —— [efficiency of any heat engine] 


It turns out that there is a particular type of heat engine, the 
Carnot engine, which, although not 100% efficient, is more efficient 
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a/1. The temperature difference 
between the hot and cold parts of 
the air can be used to extract me- 
chanical energy, for example with 
a fan blade that spins because 
of the rising hot air currents. 2. 
If the temperature of the air is 
first allowed to become uniform, 
then no mechanical energy can 
be extracted. The same amount 
of heat energy is present, but it 
is no longer accessible for doing 
mechanical work. 


b/A heat engine. Hot air 
from the candles rises through 
the fan blades, and makes the 
angels spin. 


Carnot 


c / Sadi (1796-1832) 
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d/The beginning of the first 
expansion stroke, in which the 
working gas is kept in thermal 
equilibrium with the hot reservoir. 


e/The beginning of the sec- 
ond expansion stroke, in which 
the working gas is thermally 
insulated. The working gas cools 


because it is doing work on the 
piston and thus losing energy. 


el 


f/The beginning of the first 
compression stroke. The working 
gas begins the stroke at the same 
temperature as the cold reservoir, 
and remains in thermal contact 
with it the whole time. The engine 
does negative work. 


Lr 
} ol 


g/The beginning of the sec- 
ond compression stroke, in which 
mechanical work is absorbed, 
heating the working gas back up 
to Ty. 


than any other. The grade of heat energy in a system can thus be 
unambiguously defined in terms of the amount of heat energy in it 
that cannot be extracted even by a Carnot engine. 


How can we build the most efficient possible engine? Let’s start 
with an unnecessarily inefficient engine like a car engine and see 
how it could be improved. The radiator and exhaust expel hot 
gases, which is a waste of heat energy. These gases are cooler than 
the exploded air-gas mixture inside the cylinder, but hotter than 
the air that surrounds the car. We could thus improve the engine’s 
efficiency by adding an auxiliary heat engine to it, which would 
operate with the first engine’s exhaust as its hot reservoir and the 
air as its cold reservoir. In general, any heat engine that expels 
heat at an intermediate temperature can be made more efficient by 
changing it so that it expels heat only at the temperature of the 
cold reservoir. 


Similarly, any heat engine that absorbs some energy at an in- 
termediate temperature can be made more efficient by adding an 
auxiliary heat engine to it which will operate between the hot reser- 
voir and this intermediate temperature. 


Based on these arguments, we define a Carnot engine as a heat 
engine that absorbs heat only from the hot reservoir and expels it 
only into the cold reservoir. Figures d-g show a realization of a 
Carnot engine using a piston in a cylinder filled with a monoatomic 
ideal gas. This gas, known as the working fluid, is separate from, 
but exchanges energy with, the hot and cold reservoirs. As proved 
on page 322, this particular Carnot engine has an efficiency given 
by 

TL 
efficiency = 1 — Ty : 
where Ty, is the temperature of the cold reservoir and Ty is the 
temperature of the hot reservoir. 


[efficiency of a Carnot engine] 


Even if you do not wish to dig into the details of the proof, 
the basic reason for the temperature dependence is not so hard to 
understand. Useful mechanical work is done on strokes d and e, in 
which the gas expands. The motion of the piston is in the same 
direction as the gas’s force on the piston, so positive work is done 
on the piston. In strokes f and g, however, the gas does negative 
work on the piston. We would like to avoid this negative work, 
but we must design the engine to perform a complete cycle. Luckily 
the pressures during the compression strokes are lower than the ones 
during the expansion strokes, so the engine doesn’t undo all its work 
with every cycle. The ratios of the pressures are in proportion to 
the ratios of the temperatures, so if Ty; is 20% of Ty, the engine is 
80% efficient. 


We have already proved that any engine that is not a Carnot 
engine is less than optimally efficient, and it is also true that all 


308 Chapter5 Thermodynamics 


Carnot engines operating between a given pair of temperatures Ty 
and Ty, have the same efficiency. (This can be proved by the methods 
of section 5.4.) Thus a Carnot engine is the most efficient possible 
heat engine. 


5.3.3 Entropy 


We would like to have some numerical way of measuring the 
grade of energy in a system. We want this quantity, called entropy, 
to have the following two properties: 


(1) Entropy is additive. When we combine two systems and 
consider them as one, the entropy of the combined system equals 
the sum of the entropies of the two original systems. (Quantities 
like mass and energy also have this property.) 


(2) The entropy of a system is not changed by operating a Carnot 
engine within it. 


It turns out to be simpler and more useful to define changes 
in entropy than absolute entropies. Suppose as an example that a 
system contains some hot matter and some cold matter. It has a 
relatively high grade of energy because a heat engine could be used 
to extract mechanical work from it. But if we allow the hot and 
cold parts to equilibrate at some lukewarm temperature, the grade 
of energy has gotten worse. Thus putting heat into a hotter area 
is more useful than putting it into a cold area. Motivated by these 
considerations, we define a change in entropy as follows: 


AS - 2 


Tr [change in entropy when adding 


heat Q to matter at temperature T; 


AS is negative if heat is taken out] 


A system with a higher grade of energy has a lower entropy. 


Entropy is additive. example 10 


Since changes in entropy are defined by an additive quantity (heat) 


divided by a non-additive one (temperature), entropy is additive. 


Entropy isn’t changed by a Carnot engine. example 11 


The efficiency of a heat engine is defined by 
efficiency = 1 — Q,/Qy ; 
and the efficiency of a Carnot engine is 


efficiency = 1 — T,/Ty ; 


so for a Carnot engine we have Q,/Qy = 7T,/Ty, which can be 
rewritten as Q,/T, = Qy/Ty. The entropy lost by the hot reservoir 


is therefore the same as the entropy gained by the cold one. 
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cas 


h/Entropy can be understood 
using the metaphor of a water 
wheel. Letting the water levels 
equalize is like letting the entropy 
maximize. Taking water from the 
high side and putting it into the 
low side increases the entropy. 
Water levels in this metaphor 
correspond to temperatures in 
the actual definition of entropy. 
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Entropy increases in heat conduction. example 12 
When a hot object gives up energy to a cold one, conservation 
of energy tells us that the amount of heat lost by the hot object 
is the same as the amount of heat gained by the cold one. The 
change in entropy is —Q/Ty + Q/T,, which is positive because 
Tr < Ty: 


Entropy is increased by a non-Carnot engine. example 13 
The efficiency of a non-Carnot engine is less than 1 - 7,/Ty, 
so Q,/Qy > T,/Ty and Q,/T, > Qy/Ty. This means that the 
entropy increase in the cold reservoir is greater than the entropy 
decrease in the hot reservoir. 


A book sliding to a stop example 14 
A book slides across a table and comes to a stop. Once it stops, 
all its kinetic energy has been transformed into heat. As the book 
and table heat up, their entropies both increase, so the total en- 
tropy increases as well. 


All of these examples involved closed systems, and in all of them 
the total entropy either increased or stayed the same. It never de- 
creased. Here are two examples of schemes for decreasing the en- 
tropy of a closed system, with explanations of why they don’t work. 


Using a refrigerator to decrease entropy? example 15 
> A refrigerator takes heat from a cold area and dumps it into a 
hot area. (1) Does this lead to a net decrease in the entropy of 
a closed system? (2) Could you make a Carnot engine more ef- 
ficient by running a refrigerator to cool its low-temperature reser- 
voir and eject heat into its high-temperature reservoir? 


> (1) No. The heat that comes off of the radiator coils is a great 
deal more than the heat the fridge removes from inside; the dif- 
ference is what it costs to run your fridge. The heat radiated from 
the coils is so much more than the heat removed from the inside 
that the increase in the entropy of the air in the room is greater 
than the decrease of the entropy inside the fridge. The most ef- 
ficient refrigerator is actually a Carnot engine running in reverse, 
which leads to neither an increase nor a decrease in entropy. 


(2) No. The most efficient refrigerator is a reversed Carnot en- 
gine. You will not achieve anything by running one Carnot engine 
in reverse and another forward. They will just cancel each other 
out. 


Maxwell’s demon example 16 
> Maxwell imagined a pair of rooms, their air being initially in ther- 
mal equilibrium, having a partition across the middle with a tiny 
door. A miniscule demon is posted at the door with a little ping- 
pong paddle, and his duty is to try to build up faster-moving air 
molecules in room B and slower moving ones in room A. For in- 
stance, when a fast molecule is headed through the door, going 
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from A to B, he lets it by, but when a slower than average molecule 
tries the same thing, he hits it back into room A. Would this de- 
crease the total entropy of the pair of rooms? 


> No. The demon needs to eat, and we can think of his body 
as a little heat engine, and his metabolism is less efficient than a 
Carnot engine, so he ends up increasing the entropy rather than 
decreasing it. 


Observations such as these lead to the following hypothesis, 
known as the second law of thermodynamics: 


The entropy of a closed system always increases, or at best stays 
the same: AS > 0. 


At present our arguments to support this statement may seem 
less than convincing, since they have so much to do with obscure 
facts about heat engines. In the following section we will find a more 
satisfying and fundamental explanation for the continual increase in 
entropy. To emphasize the fundamental and universal nature of the 
second law, here are a few exotic examples. 


Entropy and evolution example 17 
A favorite argument of many creationists who don’t believe in evo- 
lution is that evolution would violate the second law of thermody- 
namics: the death and decay of a living thing releases heat (as 
when a compost heap gets hot) and lessens the amount of en- 
ergy available for doing useful work, while the reverse process, 
the emergence of life from nonliving matter, would require a de- 
crease in entropy. Their argument is faulty, since the second law 
only applies to closed systems, and the earth is not a closed sys- 
tem. The earth is continuously receiving energy from the sun. 


The heat death of the universe example 18 
Living things have low entropy: to demonstrate this fact, observe 
how a compost pile releases heat, which then equilibrates with 
the cooler environment. We never observe dead things to leap 
back to life after sucking some heat energy out of their environ- 
ments! The only reason life was able to evolve on earth was that 
the earth was not a closed system: it got energy from the sun, 
which presumably gained more entropy than the earth lost. 


Victorian philosophers spent a lot of time worrying about the heat 
death of the universe: eventually the universe would have to be- 
come a high-entropy, lukewarm soup, with no life or organized 
motion of any kind. Fortunately (?), we now know a great many 
other things that will make the universe inhospitable to life long 
before its entropy is maximized. Life on earth, for instance, will 
end when the sun evolves into a giant star and vaporizes our 
planet. 
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Hawking radiation example 19 
Any process that could destroy heat (or convert it into noth- 
ing but mechanical work) would lead to a reduction in entropy. 
Black holes are supermassive stars whose gravity is so strong 
that nothing, not even light, can escape from them once it gets 
within a boundary known as the event horizon. Black holes are 
commonly observed to suck hot gas into them. Does this lead to 
a reduction in the entropy of the universe? Of course one could 
argue that the entropy is still there inside the black hole, but being 
able to “hide” entropy there amounts to the same thing as being 
able to destroy entropy. 


The physicist Steven Hawking was bothered by this question, and 
finally realized that although the actual stuff that enters a black 
hole is lost forever, the black hole will gradually lose energy in the 
form of light emitted from just outside the event horizon. This light 
ends up reintroducing the original entropy back into the universe 
at large. 


Discussion Questions 


A __Inthis discussion question, you'll think about a car engine in terms of 
thermodynamics. Note that an internal combustion engine doesn't fit very 
well into the theoretical straightjacket of a heat engine. For instance, a 
heat engine has a high-temperature heat reservoir at a single well-defined 
temperature, 7}. In a typical car engine, however, there are several very 
different temperatures you could imagine using for Ty: the temperature 
of the engine block (~ 100°C), the walls of the cylinder (~ 250°C), or the 
temperature of the exploding air-gas mixture (~ 1000°C, with significant 
changes over a four-stroke cycle). Let’s use Ty ~ 1000°C. 


Burning gas supplies heat energy Q, to your car’s engine. The engine 
does mechanical work W, but also expels heat Q, into the environment 
through the radiator and the exhaust. Conservation of energy gives 


Qy = Q. + W * 


and the relative proportions of Q, and W are usually about 90% to 10%. 
(Actually it depends quite a bit on the type of car, the driving conditions, 
etc.) 


(1) Q,; is obviously undesirable: you pay for it, but all it does is heat the 
neighborhood. Suppose that engineers do a really good job of getting rid 
of the effects that create Q,, such as friction. Could Q, ever be reduced 
to zero, at least theoretically? 


(2) A gallon of gas releases about 140 MJ of heat Qy when burned. Es- 
timate the change in entropy of the universe due to running a typical car 
engine and burning one gallon of gas. (You'll have to estimate how hot 
the environment is. For the sake of argument, assume that the work done 
by the engine, W, remains in the form of mechanical energy, although in 
reality it probably ends up being changed into heat when you step on the 
brakes.) Is your result consistent with the second law of thermodynamics? 


(3) What would happen if you redid the calculation in #2, but assumed 
Q, = 0? Is this consistent with your answer to #1? 
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B Example 11 on page 309 showed that entropy isn’t changed by 
a Carnot engine, by using the expression for the efficiency of a Carnot 
engine. Prove the same thing by applying the definition of entropy to 
each of the four strokes of the Carnot engine. 


5.4 Entropy As a Microscopic Quantity 


5.4.1 A microscopic view of entropy 


To understand why the second law of thermodynamics is always 
true, we need to see what entropy really means at the microscopic 
level. An example that is easy to visualize is the free expansion of 
a monoatomic gas. Figure a/1 shows a box in which all the atoms 
of the gas are confined on one side. We very quickly remove the 
barrier between the two sides, a/2, and some time later, the system 
has reached an equilibrium, a/3. Each snapshot shows both the po- 
sitions and the momenta of the atoms, which is enough information 
to allow us in theory to extrapolate the behavior of the system into 
the future, or the past. However, with a realistic number of atoms, 
rather than just six, this would be beyond the computational power 


of any computer.” 


But suppose we show figure a/2 to a friend without any further 
information, and ask her what she can say about the system’s behav- 
ior in the future. She doesn’t know how the system was prepared. 
Perhaps, she thinks, it was just a strange coincidence that all the 
atoms happened to be in the right half of the box at this particular 
moment. In any case, she knows that this unusual situation won’t 
last for long. She can predict that after the passage of any signifi- 
cant amount of time, a surprise inspection is likely to show roughly 
half the atoms on each side. The same is true if you ask her to say 
what happened in the past. She doesn’t know about the barrier, 
so as far as she’s concerned, extrapolation into the past is exactly 
the same kind of problem as extrapolation into the future. We just 
have to imagine reversing all the momentum vectors, and then all 
our reasoning works equally well for backwards extrapolation. She 
would conclude, then, that the gas in the box underwent an unusual 
fluctuation, b, and she knows that the fluctuation is very unlikely 
to exist very far into the future, or to have existed very far into the 
past. 


What does this have to do with entropy? Well, state a/3 has 
a greater entropy than state a/2. It would be easy to extract me- 
chanical work from a/2, for instance by letting the gas expand while 
pressing on a piston rather than simply releasing it suddenly into the 
void. There is no way to extract mechanical work from state a/3. 
Roughly speaking, our microscopic description of entropy relates to 


?Even with smaller numbers of atoms, there is a problem with this kind of 
brute-force computation, because the tiniest measurement errors in the initial 
state would end up having large effects later on. 


a/Agas expands freely, doubling 
its volume. 


6 


number 
of atoms LI 
onthe 3 } ute 
right 
0 


time 


b/An unusual fluctuation in 
the distribution of the atoms 
between the two sides of the 
box. There has been no external 
manipulation as in figure a/1. 
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c/Earth orbit is becoming clut- 
tered with space junk, and the 
pieces can be thought of as the 
“molecules” comprising an exotic 
kind of gas. These image shows 
the evolution of a cloud of debris 
arising from a 2007 Chinese test 
of an anti-satellite rocket. Pan- 
els 1-4 show the cloud five min- 
utes, one hour, one day, and one 
month after the impact. The en- 
tropy seems to have maximized 
by panel 4. 


R=1 R=2 


Xo 
R=0 R=1 


~ 


d/The phase space for two 
atoms in a box. 


R=2 
R=1 (hidden) 


e/The phase space for three 
atoms in a box. 
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the number of possible states. There are a lot more states like a/3 
than there are states like a/2. Over long enough periods of time 
— long enough for equilibration to occur — the system gets mixed 
up, and is about equally likely to be in any of its possible states, 
regardless of what state it was initially in. We define some number 
that describes an interesting property of the whole system, say the 
number of atoms in the right half of the box, R. A high-entropy 
value of R is one like R = 3, which allows many possible states. We 
are far more likely to encounter R = 3 than a low-entropy value like 
R=0OorR=6. 


5.4.2 Phase space 


There is a problem with making this description of entropy into a 
mathematical definition. The problem is that it refers to the number 
of possible states, but that number is theoretically infinite. To get 
around the problem, we coarsen our description of the system. For 
the atoms in figure a, we don’t really care exactly where each atom 
is. We only care whether it is in the right side or the left side. If a 
particular atom’s left-right position is described by a coordinate x, 
then the set of all possible values of x is a line segment along the 
x axis, containing an infinite number of points. We break this line 
segment down into two halves, each of width Az, and we consider 
two different values of x to be variations on the same state if they 
both lie in the same half. For our present purposes, we can also 
ignore completely the y and z coordinates, and all three momentum 
components, pz, Py, and pz. 


Now let’s do a real calculation. Suppose there are only two atoms 
in the box, with coordinates 7; and x2. We can give all the relevant 
information about the state of the system by specifying one of the 
cells in the grid shown in figure d. This grid is known as the phase 
space of the system. The lower right cell, for instance, describes a 
state in which atom number 1 is in the right side of the box and 
atom number 2 in the left. Since there are two possible states with 
R = 1 and only one state with R = 2, we are twice as likely to 
observe R = 1, and R = 1 has higher entropy than R = 2. 


Figure e shows a corresponding calculation for three atoms, which 
makes the phase space three-dimensional. Here, the R = 1 and 2 
states are three times more likely than R = 0 and 3. Four atoms 
would require a four-dimensional phase space, which exceeds our 
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ability to visualize. Although our present example doesn’t require 

it, a phase space can describe momentum as well as position, as 

shown in figure f. In general, a phase space for a monoatomic gas 

has six dimensions per atom (one for each coordinate and one for p 
each momentum component). 


5.4.3 Microscopic definitions of entropy and temperature 
X 


Two more issues need to be resolved in order to make a micro- 
scopic definition of entropy. f/A phase space for a_ sin- 
gle atom in one dimension, taking 
momentum into account. 


First, if we defined entropy as the number of possible states, 
it would be a multiplicative quantity, not an additive one: if an 
ice cube in a glass of water has M, states available to it, and the 
number of states available to the water is Mo, then the number of 
possible states of the whole system is the product M,M . To get 
around this problem, we take the natural logarithm of the number 
of states, which makes the entropy additive because of the property 
of the logarithm In(M1 M2) = In M, + In Mo. 


The second issue is a more trivial one. The concept of entropy 
was originally invented as a purely macroscopic quantity, and the 
macroscopic definition AS = Q/T, which has units of J/K, has a 
different calibration than would result from defining S$ = In M. The 
calibration constant we need turns out to be simply the Boltzmann 


constant, k. g/Ludwig Boltzmann’s — tomb, 
‘ : eee . inscribed with his equation for 
Microscopic definition of entropy: The entropy of a system is entropy. 


S =klnM, where M is the number of available states.? 


This also leads to a more fundamental definition of temperature. 
Two systems are in thermal equilibrium when they have maximized 
their combined entropy through the exchange of energy. Here the 
energy possessed by one part of the system, FE; or E2, plays the 
same role as the variable R in the examples of free expansion above. 
A maximum of a function occurs when the derivative is zero, so the 
maximum entropy occurs when 


d(S1 + S2) 
dE, 


We assume the systems are only able to exchange heat energy with 
each other, dE, = —dE, so 


dS; = dS 


dE, dE, ’ 
and since the energy is being exchanged in the form of heat we can 
make the equations look more familiar if we write dQ for an amount 
of heat to be transferred into either system: 


dS; = dS» 


dQ dQz 


3This is the same relation as the one on Boltzmann’s tomb, just in a slightly 
different notation. 


=0 
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° number of possible states 


h/A_ two-atom system has 
the highest number of available 
states when the energy is equally 
divided. Equal energy division is 
therefore the most likely possibil- 
ity at any given moment in time. 


© Number of possible states 
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i/When two systems of 10 
atoms each interact, the graph of 
the number of possible states is 
narrower than with only one atom 
in each system. 


In terms of our previous definition of entropy, this is equivalent to 
1/T;, = 1/T2, which makes perfect sense since the systems are in 
thermal equilibrium. According to our new approach, entropy has 
already been defined in a fundamental manner, so we can take this 
as a definition of temperature: 


Taq 
where dS represents the increase in the system’s entropy from adding 
heat dQ to it. 


Examples with small numbers of atoms 


Let’s see how this applies to an ideal, monoatomic gas with a 
small number of atoms. To start with, consider the phase space 
available to one atom. Since we assume the atoms in an ideal gas 
are noninteracting, their positions relative to each other are really ir- 
relevant. We can therefore enumerate the number of states available 
to each atom just by considering the number of momentum vectors 
it can have, without considering its possible locations. The relation- 
ship between momentum and kinetic energy is E = (p2 +pi+p2) /2m, 
so if for a fixed value of its energy, we arrange all of an atom’s possi- 
ble momentum vectors with their tails at the origin, their tips all lie 
on the surface of a sphere in phase space with radius |p| = V2mE. 
The number of possible states for that atom is proportional to the 
sphere’s surface area, which in turn is proportional to the square of 
the sphere’s radius, |p|? = 2mE. 


Now consider two atoms. For any given way of sharing the en- 
ergy between the atoms, EF = E, + Eo, the number of possible 
combinations of states is proportional to EL, Ey. The result is shown 
in figure h. The greatest number of combinations occurs when we 
divide the energy equally, so an equal division gives maximum en- 


tropy. 


By increasing the number of atoms, we get a graph whose peak 
is narrower, i. With more than one atom in each system, the total 
energy is E = (p24 + P1 +021 +Pi2+Py2 t+ P22 t--)/2m. With n 
atoms, a total of 3n momentum coordinates are needed in order to 
specify their state, and such a set of numbers is like a single point 
in a 3n-dimensional space (which is impossible to visualize). For a 
given total energy E, the possible states are like the surface of a 
3n-dimensional sphere, with a surface area proportional to p?”~!, 
or B@"-1)/2_ The graph in figure i, for example, was calculated 
according to the formula | = EMP E Oe 


Since graph i is narrower than graph h, the fluctuations in energy 
sharing are smaller. If we inspect the system at a random moment in 
time, the energy sharing is very unlikely to be more lopsided than 
a 40-60 split. Now suppose that, instead of 10 atoms interacting 
with 10 atoms, we had a 107° atoms interacting with 107° atoms. 
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The graph would be extremely narrow, and it would be a statistical 
certainty that the energy sharing would be nearly perfectly equal. 
This is why we never observe a cold glass of water to change itself 
into an ice cube sitting in some warm water! 


By the way, note that although we’ve redefined temperature, 
these examples show that things are coming out consistent with the 
old definition, since we saw that the old definition of temperature 
could be described in terms of the average energy per atom, and 
here we're finding that equilibration results in each subset of the 
atoms having an equal share of the energy. 


Entropy of a monoatomic ideal gas 


Let’s calculate the entropy of a monoatomic ideal gas of n atoms. 
This is an important example because it allows us to show that 
our present microscopic treatment of thermodynamics is consistent 
with our previous macroscopic approach, in which temperature was 
defined in terms of an ideal gas thermometer. 


The number of possible locations for each atom is V/A, where 
Az is the size of the space cells in phase space. The number of pos- 
sible combinations of locations for the atoms is therefore (V/Ax?)”. 


The possible momenta cover the surface of a 3n-dimensional 
sphere, whose radius is /2mE, and whose surface area is therefore 
proportional to E@"-))/2, In terms of phase-space cells, this area 
corresponds to E@n-1)/2 /Ap®” possible combinations of momenta, 
multiplied by some constant of proportionality which depends on 
m, the atomic mass, and n, the number of atoms. To avoid having 
to calculate this constant of proportionality, we limit ourselves to 
calculating the part of the entropy that does not depend on n, so 
the resulting formula will not be useful for comparing entropies of 
ideal gas samples with different numbers of atoms. 


The final result for the number of available states is 


mn Fp(8n—1)/2 
M= (= :) Apa ; (function of n] 
i 


so the entropy is 


S=nkinV + nk In E + (function of Ax, Ap, and n) ; 


where the distinction between n and n — 1 has been ignored. Using 
PV = nkT and E = (3/2)nkT, we can also rewrite this as 


S= nk InT—nkln P+..., entropy of a monoatomic ideal gas] 


where “...” indicates terms that may depend on Az, Ap, m, and 
n, but that have no effect on comparisons of gas samples with the 


same number of atoms. 
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self-check C 

Why does it make sense that the temperature term has a positive sign 
in the above example, while the pressure term is negative? Why does 
it make sense that the whole thing is proportional ton? —_> Answer, p. 
869 


To show consistency with the macroscopic approach to thermo- 
dynamics, we need to show that these results are consistent with 
the behavior of an ideal-gas thermometer. Using the new definition 
1/T = dS/dQ, we have 1/T = dS/dE, since transferring an amount 
of heat dQ into the gas increases its energy by a corresponding 
amount. Evaluating the derivative, we find 1/T = (3/2)nk/E, or 
E = (3/2)nkT, which is the correct relation for a monoatomic ideal 
gas. 


A mixture of molecules example 20 
> Suppose we have a mixture of two different monoatomic gases, 
say helium and argon. How would we find the entropy of such a 
mixture (say, in terms of V and E)? How would the energy be 
shared between the two types of molecules, i.e., would a more 
massive argon atom have more energy on the average than a 
less massive helium atom, the same, or less? 


> Since entropy is additive, we simply need to add the entropies of 
the two types of atom. However, the expression derived above for 
the entropy omitted the dependence on the mass m of the atom, 
which is different for the two constituents of the gas, so we need 
to go back and figure out how to put that m-dependence back in. 
The only place where we threw away m’s was when we identified 
the radius of the sphere in momentum space with V2mE, but 
then threw away the constant factor of m. In other words, the final 
result can be generalized merely by replacing E everywhere with 
the product mE. Since the log of a product is the sum of the logs, 
the dependence of the final result on m and E can be broken 
apart into two different terms, and we find 


S =nklin V+ Snkinm+ SnkinE +... 


The total entropy of the mixture can then be written as 


S=nkln V+ Nnokln V+ Smkinm + Smokin me 


3 3 
+ 5MkInEy + 5nekin Ep +... 


Now what about the energy sharing? If the total energy is E = 
E, + Eo, then the most ovewhelmingly probable sharing of energy 
will the the one that maximizes the entropy. Notice that the depen- 
dence of the entropy on the masses m; and mp occurs in terms 
that are entirely separate from the energy terms. If we want to 
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maximize S with respect to E; (with Es = E — E, by conservation 
of energy), then we differentiate S with respect to E; and set it 
equal to zero. The terms that contain the masses don’t have any 
dependence on &£,;, so their derivatives are zero, and we find that 
the molecular masses can have no effect on the energy sharing. 
Setting the derivative equal to zero, we have 


a) 3 3 
cary (nikin V +Nnokln Vt smkinm + 5fekInime 


+ 2 ee + 5k In ~ E) +...) 


2 
=5« (2 =) 
2 \E, E-E; 
nN No 
°° EEE 
m — Ne 
E, Ee 


In other words, each gas gets a share of the energy in proportion 
to the number of its atoms, and therefore every atom gets, on 
average, the same amount of energy, regardless of its mass. The 
result for the average energy per atom is exactly the same as for 


an unmixed gas, K = (3/2)kT. 


Equipartition 


Example 20 is a special case of a more general statement called 
the equipartition theorem. Suppose we have only one argon atom, 
named Alice, and one helium atom, named Harry. Their total ki- 
netic energy is H = pz /2m+ pz /2m + p2/2m +p! /2ml + pls /2m! + 
p'?/2m!, where the primes indicate Harry. We have six terms that 
all look alike. The only difference among them is that the constant 
factors attached to the squares of the momenta have different val- 
ues, but we’ve just proved that those differences don’t matter. In 
other words, if we have any system at all whose energy is of the 
form E = (...)p? + (...)p3 +..., with any number of terms, then 
each term holds, on average, the same amount of energy, 5kT. We 
say that the system consisting of Alice and Harry has six degrees of 
freedom. It doesn’t even matter whether the things being squared 
are momenta: if you look back over the logical steps that went into 
the argument, you’ll see that none of them depended on that. In a 
solid, for example, the atoms aren’t free to wander around, but they 
can vibrate from side to side. If an atom moves away from its equi- 
librium position at « = 0 to some other value of x, then its electrical 
energy is (1/2)kx?, where « is the spring constant (written as the 
Greek letter kappa to distinguish it from the Boltzmann constant k). 
We can conclude that each atom in the solid, on average, has 5kT of 
energy in the electrical energy due to its x displacement along the x 
axis, and equal amounts for y and z. This is known as equipartition, 


Section 5.4 Entropy As a Microscopic Quantity 


319 


manometer 


syringe 


j/An experiment for  deter- 
mining the shapes of molecules. 


k/The differing shapes of a 
helium atom (1), a_ nitrogen 
molecule (2), and a_ difluo- 
roethane molecule (3) have 
surprising macroscopic effects. 


meaning equal partitioning, or equal sharing. The equipartition the- 
orem says that if the expression for the energy looks like a sum of 
squared variables, then each degree of freedom has an average en- 
ergy of 5kT. Thus, very generally, we can interpret temperature as 
the average energy per degree of freedom (times k/2). 


An unexpected glimpse of the microcosm 


You may have the feeling at this point that of course Boltzmann 
was right about the literal existence of atoms, but only very so- 
phisticated experiments could vindicate him definitively. After all, 
the microscopic and macroscopic definitions of entropy are equiva- 
lent, so it might seem as though there was no real advantage to the 
microscopic approach. Surprisingly, very simple experiments are ca- 
pable of revealing a picture of the microscopic world, and there is 
no possible macroscopic explanation for their results. 


In 1819, before Boltzmann was born, Clément and Desormes did 
an experiment like the one shown in figure j. The gas in the flask 
is pressurized using the syringe. This heats it slightly, so it is then 
allowed to cool back down to room temperature. Its pressure is 
measured using the manometer. The stopper on the flask is popped 
and then immediately reinserted. Its pressure is now equalized with 
that in the room, and the gas’s expansion has cooled it a little, 
because it did mechanical work on its way out of the flask, causing 
it to lose some of its internal energy EF. The expansion is carried out 
quickly enough so that there is not enough time for any significant 
amount of heat to flow in through the walls of the flask before the 
stopper is reinserted. The gas is now allowed to come back up 
to room temperature (which takes a much longer time), and as a 
result regains a fraction b of its original overpressure. During this 
constant-volume reheating, we have PV = nkT, so the amount of 
pressure regained is a direct indication of how much the gas cooled 
down when it lost an amount of energy AF. 
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If the gas is monoatomic, then we know what to expect for this 
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relationship between energy and temperature: AF = (3/2)nkAT, 
where the factor of 3 came ultimately from the fact that the molecules 
of the gas was in a three-dimensional space, k/1. Moving in this 
space, each molecule can have momentum in the x, y, and z di- 
rections. It has three degrees of freedom. What if the gas is not 
monoatomic? Air, for example, is made of diatomic molecules, k/2. 
There is a subtle difference between the two cases. An individual 
atom of a monoatomic gas is a perfect sphere, so it is exactly the 
same no matter how it is oriented. Because of this perfect symme- 
try, there is thus no way to tell whether it is spinning or not, and 
in fact we find that it can’t rotate. The diatomic gas, on the other 
hand, can rotate end over end about the x or y axis, but cannot 
rotate about the z axis, which is its axis of symmetry. It has a total 
of five degrees of freedom. A polyatomic molecule with a more com- 
plicated, asymmetric shape, k/3, can rotate about all three axis, so 
it has a total of six degrees of freedom. 


Because a polyatomic molecule has more degrees of freedom than 
a monoatomic one, it has more possible states for a given amount of 
energy. That is, its entropy is higher for the same energy. From the 
definition of temperature, 1/T = dS/dE, we conclude that it has a 
lower temperature for the same energy. In other words, it is more 
difficult to heat n molecules of difluoroethane than it is to heat n 
atoms of helium. When the Clément-Desormes experiment is carried 
out, the result b therefore depends on the shape of the molecule! 
Who would have dreamed that such simple observations, correctly 
interpreted, could give us this kind of glimpse of the microcosm? 


Lets go ahead and calculate how this works. Suppose a gas is 
allowed to expand without being able to exchange heat with the 
rest of the universe. The loss of thermal energy from the gas equals 
the work it does as it expands, and using the result of homework 
problem 2 on page 332, the work done in an infinitesimal expansion 
equals PdV, so 

dE + PdV =0 


(If the gas had not been insulated, then there would have been a 
third term for the heat gained or lost by heat conduction.) 


From section 5.2 we have E = (3/2)PV for a monoatomic ideal 
gas. More generally, the equipartition theorem tells us that the 3 
simply needs to be replaced with the number of degrees of freedom 
a, so dE = (a/2)PdV +(a/2)VdP, and the equation above becomes 


a+2 a 
= P P 
0 5 dV 5Vd 


Rearranging, we have 
dV dP 
2)— = -a— 
(a + 2) V as 


Integrating both sides gives 
(a+ 2) nV = —alnP + constant, 
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and taking exponentials on both sides yields 


yor x P-@ 


We now wish to reexpress this in terms of pressure and temper- 
ature. Eliminating V «x (T/P) gives 


Tx P®, 


where 6 = 2/(a + 2) is equal to 2/5, 2/7, or 1/4, respectively, for a 
monoatomic, diatomic, or polyatomic gas. 


Efficiency of the Carnot engine example 21 
As an application, we now prove the result claimed earlier for the 
efficiency of a Carnot engine. First consider the work done dur- 
ing the constant-temperature strokes. Integrating the equation 
dW = PdV, we have W = [{ PdV. Since the thermal energy of 
an ideal gas depends only on its temperature, there is no change 
in the thermal energy of the gas during this constant-temperature 
process. Conservation of energy therefore tells us that work done 
by the gas must be exactly balanced by the amount of heat trans- 
ferred in from the reservoir. 


=W 
- | Pav 


For our proof of the efficiency of the Carnot engine, we need only 
the ratio of Qy to Q;, so we neglect constants of proportionality, 
and simply subsitutde P « T/V, giving 

Py 
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The efficiency of a heat engine is 
efficiency = 1 -— —— 


Making use of the result from the previous proof for a Carnot en- 
gine with a monoatomic ideal gas as its working gas, we have 


Ti In(P4/Pa) 
Ty In(P; /Po)’ 


where the subscripts 1, 2, 3, and 4 refer to figures d—g on page 
308. We have shown above that the temperature is proportional 
to P° on the insulated strokes 2-3 and 4-1, the pressures must be 
related by P2/P3 = P;/P4, which can be rearranged as P,4/P3 = 
P; /P2, and we therefore have 


efficiency = 1 


efficiency = 1 — ft 
Ty 


Chapter5 Thermodynamics 


5.4.4 The arrow of time, or “this way to the Big Bang” 


Now that we have a microscopic understanding of entropy, what 
does that tell us about the second law of thermodynamics? The 
second law defines a forward direction to time, “time’s arrow.” The 
microscopic treatment of entropy, however, seems to have mysteri- 
ously sidestepped that whole issue. A graph like figure b on page 
313, showing a fluctuation away from equilibrium, would look just 
as natural if we flipped it over to reverse the direction of time. Af 
ter all, the basic laws of physics are conservation laws, which don’t 
distinguish between past and future. Our present picture of entropy 
suggests that we restate the second law of thermodynamics as fol- 
lows: low-entropy states are short-lived. An ice cube can’t exist 
forever in warm water. We no longer have to distinguish past from 
future. 


But how do we reconcile this with our strong psychological sense 
of the direction of time, including our ability to remember the past 
but not the future? Why do we observe ice cubes melting in water, 
but not the time-reversed version of the same process? 


The answer is that there is no past-future asymmetry in the laws 
of physics, but there is a past-future asymmetry in the universe. The 
universe started out with the Big Bang. (Some of the evidence for 
the Big Bang theory is given on page 352.) The early universe had 
a very low entropy, and low-entropy states are short-lived. What 
does “short-lived” mean here, however? Hot coffee left in a paper 
cup will equilibrate with the air within ten minutes or so. Hot coffee 
in a thermos bottle maintains its low-entropy state for much longer, 
because the coffee is insulated by a vacuum between the inner and 
outer walls of the thermos. The universe has been mostly vacuum 
for a long time, so it’s well insulated. Also, it takes billions of years 
for a low-entropy normal star like our sun to evolve into the high- 
entropy cinder known as a white dwarf. 


The universe, then, is still in the process of equilibrating, and 
all the ways we have of telling the past from the future are really 
just ways of determining which direction in time points toward the 
Big Bang, i.e., which direction points to lower entropy. The psy- 
chological arrow of time, for instance, is ultimately based on the 
thermodynamic arrow. In some general sense, your brain is like 
a computer, and computation has thermodynamic effects. In even 
the most efficient possible computer, for example, erasing one bit 
of memory decreases its entropy from k1n2 (two possible states) to 
kIn1 (one state), for a drop of about 10773 J/K. One way of de- 
termining the direction of the psychological arrow of time is that 
forward in psychological time is the direction in which, billions of 
years from now, all consciousness will have ceased; if consciousness 
was to exist forever in the universe, then there would have to be a 
never-ending decrease in the universe’s entropy. This can’t happen, 
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because low-entropy states are short-lived. 


Relating the direction of the thermodynamic arrow of time to the 
existence of the Big Bang is a satisfying way to avoid the paradox of 
how the second law can come from basic laws of physics that don’t 
distinguish past from future. There is a remaining mystery, however: 
why did our universe have a Big Bang that was low in entropy? It 
could just as easily have been a maximum-entropy state, and in fact 
the number of possible high-entropy Big Bangs is vastly greater than 
the number of possible low-entropy ones. The question, however, is 
probably not one that can be answered using the methods of science. 
All we can say is that if the universe had started with a maximum- 
entropy Big Bang, then we wouldn’t be here to wonder about it. A 
longer, less mathematical discussion of these concepts, along with 
some speculative ideas, is given in “The Cosmic Origins of Time’s 
Arrow,” Sean M. Carroll, Scientific American, June 2008, p. 48. 


5.4.5 Quantum mechanics and zero entropy 


The previous discussion would seem to imply that absolute en- 
tropies are never well defined, since any calculation of entropy will 
always end up having terms that depend on Ap and Az. For in- 
stance, we might think that cooling an ideal gas to absolute zero 
would give zero entropy, since there is then only one available mo- 
mentum state, but there would still be many possible position states. 
We'll see later in this book, however, that the quantum mechanical 
uncertainty principle makes it impossible to know the location and 
position of a particle simultaneously with perfect accuracy. The 
best we can do is to determine them with an accuracy such that 
the product ApAz is equal to a constant called Planck’s constant. 
According to quantum physics, then, there is a natural minimum 
size for rectangles in phase space, and entropy can be defined in 
absolute terms. Another way of looking at it is that according to 
quantum physics, the gas as a whole has some well-defined ground 
state, which is its state of minimum energy. When the gas is cooled 
to absolute zero, the scene is not at all like what we would picture 
in classical physics, with a lot of atoms lying around motionless. It 
might, for instance, be a strange quantum-mechanical state called 
the Bose-Einstein condensate, which was achieved for the first time 
recently with macroscopic amounts of atoms. Classically, the gas 
has many possible states available to it at zero temperature, since 
the positions of the atoms can be chosen in a variety of ways. The 
classical picture is a bad approximation under these circumstances, 
however. Quantum mechanically there is only one ground state, in 
which each atom is spread out over the available volume in a cloud 
of probability. The entropy is therefore zero at zero temperature. 
This fact, which cannot be understood in terms of classical physics, 
is known as the third law of thermodynamics. 
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5.4.6 Summary of the laws of thermodynamics 


Here is a summary of the laws of thermodynamics: 


The zeroth law of thermodynamics (page 301) If object A is 
at the same temperature as object B, and B is at the same 
temperature as C, then A is at the same temperature as C. 


The first law of thermodynamics (page 296) Energy is 
conserved. 


The second law of thermodynamics (page 311) The entropy 
of a closed system always increases, or at best stays the same: 
AS > 0. 


The third law of thermodynamics (page 324) The entropy of 
a system approaches zero as its temperature approaches abso- 
lute zero. 


From a modern point of view, only the first law deserves to be called 
a fundamental law of physics. Once Boltmann discovered the mi- 
croscopic nature of entropy, the zeroth and second laws could be 
understood as statements about probability: a system containing a 
large number of particles is overwhelmingly likely to do a certain 
thing, simply because the number of possible ways to do it is ex- 
tremely large compared to the other possibilities. The third law 
is also now understood to be a consequence of more basic physical 
principles, but to explain the third law, it’s not sufficient simply to 
know that matter is made of atoms: we also need to understand the 
quantum-mechanical nature of those atoms, discussed in chapter 13. 
Historically, however, the laws of thermodynamics were discovered 
in the eighteenth century, when the atomic theory of matter was 
generally considered to be a hypothesis that couldn’t be tested ex- 
perimentally. Ideally, with the publication of Boltzmann’s work on 
entropy in 1877, the zeroth and second laws would have been imme- 
diately demoted from the status of physical laws, and likewise the 
development of quantum mechanics in the 1920’s would have done 
the same for the third law. 


5.5 More About Heat Engines 


So far, the only heat engine we’ve discussed in any detail has been 
a fictitious Carnot engine, with a monoatomic ideal gas as its work- 
ing gas. As a more realistic example, figure 1 shows one full cycle 
of a cylinder in a standard gas-burning automobile engine. This 
four-stroke cycle is called the Otto cycle, after its inventor, German 
engineer Nikolaus Otto. The Otto cycle is more complicated than a 
Carnot cycle, in a number of ways: 
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1/The Otto cycle. 1. In the ex- 
haust stroke, the piston expels the 
burned air-gas mixture left over 
from the preceding cycle. 2. In 
the intake stroke, the piston sucks 
in fresh air-gas mixture. 3. In 
the compression stroke, the pis- 
ton compresses the mixture, and 
heats it. 4. At the beginning of 
the power stroke, the spark plug 
fires, causing the air-gas mixture 
to burn explosively and heat up 
much more. The heated mix- 
ture expands, and does a large 
amount of positive mechanical 
work on the piston. 


The working gas is physically pumped in and out of the cylin- 
der through valves, rather than being sealed and reused indef- 
initely as in the Carnot engine. 


The cylinders are not perfectly insulated from the engine block, 
so heat energy is lost from each cylinder by conduction. This 
makes the engine less efficient that a Carnot engine, because 
heat is being discharged at a temperature that is not as cool 
as the environment. 


Rather than being heated by contact with an external heat 
reservoir, the air-gas mixture inside each cylinder is heated 
by internal combusion: a spark from a spark plug burns the 
gasoline, releasing heat. 


The working gas is not monoatomic. Air consists of diatomic 
molecules (Nz and Oz), and gasoline of polyatomic molecules 
such as octane (CgHjs). 


The working gas is not ideal. An ideal gas is one in which 
the molecules never interact with one another, but only with 
the walls of the vessel, when they collide with it. In a car en- 
gine, the molecules are interacting very dramatically with one 
another when the air-gas mixture explodes (and less dramat- 
ically at other times as well, since, for example, the gasoline 
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may be in the form of microscopic droplets rather than indi- 
vidual molecules). 


This is all extremely complicated, and it would be nice to have 
some way of understanding and visualizing the important proper- 
ties of such a heat engine without trying to handle every detail at 
once. A good method of doing this is a type of graph known as 
a P-V diagram. As proved in homework problem 2, the equation 
dW = Fdz for mechanical work can be rewritten as dW = PdV in 
the case of work done by a piston. Here P represents the pressure 
of the working gas, and V its volume. Thus, on a graph of P versus 
V, the area under the curve represents the work done. When the 
gas expands, dz is positive, and the gas does positive work. When 
the gas is being compressed, dx is negative, and the gas does neg- 
ative work, i.e., it absorbs energy. Notice how, in the diagram of 
the Carnot engine in the top panel of figure a, the cycle goes clock- 
wise around the curve, and therefore the part of the curve in which 
negative work is being done (arrowheads pointing to the left) are 
below the ones in which positive work is being done. This means 
that over all, the engine does a positive amount of work. This net 
work equals the area under the top part of the curve, minus the 
area under the bottom part of the curve, which is simply the area 
enclosed by the curve. Although the diagram for the Otto engine is 
more complicated, we can at least compare it on the same footing 
with the Carnot engine. The curve forms a figure-eight, because it 
cuts across itself. The top loop goes clockwise, so as in the case 
of the Carnot engine, it represents positive work. The bottom loop 
goes counterclockwise, so it represents a net negative contribution 
to the work. This is because more work is expended in forcing out 
the exhaust than is generated in the intake stroke. 


To make an engine as efficient as possible, we would like to make 
the loop have as much area as possible. What is it that determines 
the actual shape of the curve? First let’s consider the constant- 
temperature expansion stroke that forms the top of the Carnot en- 
gine’s P-V plot. This is analogous to the power stroke of an Otto 
engine. Heat is being sucked in from the hot reservoir, and since 
the working gas is always in thermal equilibrium with the hot reser- 
voir, its temperature is constant. Regardless of the type of gas, we 
therefore have PV = nkT with T held constant, and thus P x V7! 
is the mathematical shape of this curve — a y = 1/ax graph, which 
is a hyperbola. This is all true regardless of whether the working 
gas is monoatomic, diatomic, or polyatomic. (The bottom of the 
loop is likewise of the form P « V~!, but with a smaller constant 
of proportionality due to the lower temperature.) 


Now consider the insulated expansion stroke that forms the right 
side of the curve for the Carnot engine. As shown on page 321, 
the relationship between pressure and temperature in an insulated 


Carnot engine 


constant- 

IP temperature 
expansion 
(heat intake) 


insulated 
compression 


constant-temperature 


Otto cycle 


combustion 


Vv 


insulated 
expansion 


compression (heat outflow) 


a/P-V diagrams for a Carnot 


engine and an Otto engine. 
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b/ Example 23, 
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compression or expansion is T « P®, with b = 2/5, 2/7, or 1/4, 
respectively, for a monoatomic, diatomic, or polyatomic gas. For P 
as a function of V at constant T, the ideal gas law gives P x T/V, 
so P x V~7, where y = 1/(1 — 5) takes on the values 5/3, 7/5, and 
4/3. The number 7 can be interpreted as the ratio Cp/Cy, where 
Cp, the heat capacity at constant pressure, is the amount of heat 
required to raise the temperature of the gas by one degree while 
keeping its pressure constant, and Cy is the corresponding quantity 
under conditions of constant volume. 


The compression ratio example 22 
Operating along a constant-temperature stroke, the amount of 
mechanical work done by a heat engine can be calculated as 
follows: 


PV =nkT 


Setting c = nkT to simplify the writing, 


P=cV"' 
V; 
w= | Pdv 
V; 
Ve 
ei V-'dv 
V; 
=cinV;—clnV; 
= cln(V;/ Vj) 


The ratio V/V; is called the compression ratio of the engine, and 
higher values result in more power along this stroke. Along an 
insulated stroke, we have P « V~Y, with y # 1, so the result for 
the work no longer has this perfect mathematical property of de- 
pending only on the ratio V;/V;. Nevertheless, the compression 
ratio is still a good figure of merit for predicting the performance of 
any heat engine, including an internal combustion engine. High 
compression ratios tend to make the working gas of an internal 
combustion engine heat up so much that it spontaneously ex- 
plodes. When this happens in an Otto-cycle engine, it can cause 
ignition before the sparkplug fires, an undesirable effect known as 
pinging. For this reason, the compression ratio of an Otto-cycle 
automobile engine cannot normally exceed about 10. In a diesel 
engine, however, this effect is used intentionally, as an alternative 
to sparkplugs, and compression ratios can be 20 or more. 


Sound example 23 
Figure b shows a P-V plot for a sound wave. As the pressure 
oscillates up and down, the air is heated and cooled by its com- 
pression and expansion. Heat conduction is a relatively slow pro- 
cess, So typically there is not enough time over each cycle for any 
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significant amount of heat to flow from the hot areas to the cold 
areas. (This is analogous to insulated compression or expansion 
of a heat engine; in general, a compression or expansion of this 
type, with no transfer of heat, is called adiabatic.) The pressure 
and volume of a particular little piece of the air are therefore re- 
lated according to P « VY. The cycle of oscillation consists 
of motion back and forth along a single curve in the P-V plane, 
and since this curve encloses zero volume, no mechanical work 
is being done: the wave (under the assumed ideal conditions) 
propagates without any loss of energy due to friction. 


Measuring y using the “spring of air” example 24 
Figure c shows an experiment that can be used to measure the 
y of a gas. When the mass ™m is inserted into bottle’s neck, 
which has cross-sectional area A, the mass drops until it com- 
presses the air enough so that the pressure is enough to support 
its weight. The observed frequency w of oscillations about this 
equilibrium position yo can be used to extract the y of the gas. 


We make the bottle big enough so that its large surface-to-volume 
ratio prevents the conduction of any significant amount of heat 
through its walls during one cycle, so P « V-Y, and dP/dV = 
—yP/V. Thus, 


The Helmholtz resonator example 25 
When you blow over the top of a beer bottle, you produce a pure 
tone. As you drink more of the beer, the pitch goes down. This 
is similar to example 24, except that instead of a solid mass m 
sitting inside the neck of the bottle, the moving mass is the air 
itself. As air rushes in and out of the bottle, its velocity is highest 
at the bottleneck, and since kinetic energy is proportional to the 
square of the velocity, essentially all of the kinetic energy is that 
of the air that’s in the neck. In other words, we can replace m with 
ALp, where L is the length of the neck, and p is the density of the 
air. Substituting into the earlier result, we find that the resonant 
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d/The resonance curve of a 
1713 Stradivarius violin, mea- 
sured by Carleen Hutchins. There 
are a number of different reso- 
nance peaks, some strong and 
some weak; the ones near 200 
and 400 Hz are vibrations of the 
wood, but the one near 300 Hz 
is a resonance of the air moving 
in and out through those holes 
shaped like the letter F. The white 
lines show the frequencies of the 
four strings. 


frequency is 


This is known as a Helmholtz resonator. As shown in figure d, a 
violin or an acoustic guitar has a Helmholtz resonance, since air 
can move in and out through the f-holes. Problem 10 is a more 
quantitative exploration of this. 


4 
10 


10° 
energy 
of steady- 
state 3 
response 10 
(relative 
scale) 


1h 
10 


0 
10 


° ° So 8 OS fo} oS © So fo} 
5) + ey iS) re 1S) © o tno © 
NX N Nn mom + t+ nm © BS oO a 


1120 
1280 
1440 
1600 
1920 
2240 


frequency (Hz) 


We have already seen, based on the microscopic nature of en- 
tropy, that any Carnot engine has the same efficiency, and the ar- 
gument only employed the assumption that the engine met the def- 
inition of a Carnot cycle: two insulated strokes, and two constant- 
temperature strokes. Since we didn’t have to make any assumptions 
about the nature of the working gas being used, the result is evi- 
dently true for diatomic or polyatomic molecules, or for a gas that is 
not ideal. This result is surprisingly simple and general, and a little 
mysterious — it even applies to possibilities that we have not even 
considered, such as a Carnot engine designed so that the working 
“eas” actually consists of a mixture of liquid droplets and vapor, as 
in a steam engine. How can it always turn out so simple, given the 
kind of mathematical complications that were swept under the rug 
in example 22? A better way to understand this result is by switch- 
ing from P-V diagrams to a diagram of temperature versus entropy, 
as shown in figure e. An infinitesimal transfer of heat dQ gives rise 
to a change in entropy dS = dQ/T, so the area under the curve on 
a T-S plot gives the amount of heat transferred. The area under the 
top edge of the box in figure e, extending all the way down to the 
axis, represents the amount of heat absorbed from the hot reservoir, 
while the smaller area under the bottom edge represents the heat 
wasted into the cold reservoir. By conservation of energy, the area 
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enclosed by the box therefore represents the amount of mechanical 
work being done, as for a P-V diagram. We can now see why the 
efficiency of a Carnot engine is independent of any of the physical 
details: the definition of a Carnot engine guarantees that the T-S 
diagram will be a rectangular box, and the efficiency depends only 
on the relative heights of the top and bottom of the box. 


This chapter is summarized on page 896. Notation and terminology 
are tabulated on pages 885-886. 


Carnot engine 


constant- 
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expansion 
(heat intake) 
\ insulated 
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compression 
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compression (heat outflow) 
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e/A T-S diagram for a Carnot 
engine. 
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Problems 
The symbols V, “, etc. are explained on page 334. 


1 (a) Show that under conditions of standard pressure and tem- 
perature, the volume of a sample of an ideal gas depends only on 
the number of molecules in it. 

(b) One mole is defined as 6.0 x 107° atoms. Find the volume of one 
mole of an ideal gas, in units of liters, at standard temperature and 


pressure (0°C and 101 kPa). Vv 

2 A gas in a cylinder expands its volume by an amount dV, 
pushing out a piston. Show that the work done by the gas on the 
piston is given by dW = PdV. E 
3 (a) A helium atom contains 2 protons, 2 electrons, and 2 
neutrons. Find the mass of a helium atom. Vv 
(b) Find the number of atoms in 1.0 kg of helium. Vv 


(c) Helium gas is monoatomic. Find the amount of heat needed to 
raise the temperature of 1.0 kg of helium by 1.0 degree C. (This is 
known as helium’s heat capacity at constant volume.) Vv 


4 A sample of gas is enclosed in a sealed chamber. The gas 
consists of molecules, which are then split in half through some 
process such as exposure to ultraviolet light, or passing an electric 
spark through the gas. The gas returns to thermal equilibrium with 
the surrounding room. How does its pressure now compare with its 
pressure before the molecules were split? a 


5 Most of the atoms in the universe are in the form of gas that 
is not part of any star or galaxy: the intergalactic medium (IGM). 
The IGM consists of about 107° atoms per cubic centimeter, with 
a typical temperature of about 10° K. These are, in some sense, the 
density and temperature of the universe (not counting light, or the 
exotic particles known as “dark matter”). Calculate the pressure of 
the universe (or, speaking more carefully, the typical pressure due 
to the IGM). Vv o 


6 Estimate the pressure at the center of the Earth, assuming it 
is of constant density throughout. Note that g is not constant with 
respect to depth — as shown in example 19 on page 105, g equals 
Gmr/b? for r, the distance from the center, less than b, the earth’s 
radius. 

(a) State your result in terms of G, m, and b. Vv 
(b) Show that your answer from part a has the right units for pres- 
sure. 

(c) Evaluate the result numerically. Vv 
(d) Given that the earth’s atmosphere is on the order of one thou- 
sandth the earth’s radius, and that the density of the earth is several 
thousand times greater than the density of the lower atmosphere, 
check that your result is of a reasonable order of magnitude. & 
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7 (a) Determine the ratio between the escape velocities from the 
surfaces of the earth and the moon. Vv 
(b) The temperature during the lunar daytime gets up to about 
130°C. In the extremely thin (almost nonexistent) lunar atmosphere, 
estimate how the typical velocity of a molecule would compare with 
that of the same type of molecule in the earth’s atmosphere. As- 
sume that the earth’s atmosphere has a temperature of 0°C. v 
(c) Suppose you were to go to the moon and release some fluo- 
rocarbon gas, with molecular formula C,Fo2n+2. Estimate what is 
the smallest fluorocarbon molecule (lowest 1) whose typical velocity 
would be lower than that of an No molecule on earth in proportion 
to the moon’s lower escape velocity. The moon would be able to 
retain an atmosphere made of these molecules. Vv o 


8 Refrigerators, air conditioners, and heat pumps are heat en- 
gines that work in reverse. You put in mechanical work, and the 
effect is to take heat out of a cooler reservoir and deposit heat in a 
warmer one: Q, +W = Qzy. As with the heat engines discussed pre- 
viously, the efficiency is defined as the energy transfer you want (Qz 
for a refrigerator or air conditioner, Qy for a heat pump) divided 
by the energy transfer you pay for (W). 


Efficiencies are supposed to be unitless, but the efficiency of an 
air conditioner is normally given in terms of an EER rating (or 
a more complex version called an SEER). The EER is defined as 
Q,/W, but expressed in the barbaric units of of Btu/watt-hour. 
A typical EER rating for a residential air conditioner is about 10 
Btu/watt-hour, corresponding to an efficiency of about 3. The stan- 
dard temperatures used for testing an air conditioner’s efficiency are 
80°F (27°C) inside and 95°F’ (35°C) outside. 


(a) What would be the EER rating of a reversed Carnot engine used 
as an air conditioner? Vv 
(b) If you ran a 3-kW residential air conditioner, with an efficiency 
of 3, for one hour, what would be the effect on the total entropy 
of the universe? Is your answer consistent with the second law of 
thermodynamics? vo 


9 Even when resting, the human body needs to do a certain 
amount of mechanical work to keep the heart beating. This quan- 
tity is difficult to define and measure with high precision, and also 
depends on the individual and her level of activity, but it’s estimated 
to be about 1 to 5 watts. Suppose we consider the human body as 
nothing more than a pump. A person who is just lying in bed all 
day needs about 1000 kcal/day worth of food to stay alive. (a) Es- 
timate the person’s thermodynamic efficiency as a pump, and (b) 
compare with the maximum possible efficiency imposed by the laws 
of thermodynamics for a heat engine operating across the difference 
between a body temperature of 37°C and an ambient temperature 
of 22°C. (c) Interpret your answer. > Answer, p. 874 © 
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10 Example 25 on page 329 suggests analyzing the resonance 
of a violin at 300 Hz as a Helmholtz resonance. However, we might 
expect the equation for the frequency of a Helmholtz resonator to 
be a rather crude approximation here, since the f-holes are not long 
tubes, but slits cut through the face of the instrument, which is only 
about 2.5 mm thick. (a) Estimate the frequency that way anyway, 
for a violin with a volume of about 1.6 liters, and f-holes with a total 
area of 10 cm?. (b) A common rule of thumb is that at an open end 
of an air column, such as the neck of a real Helmholtz resonator, 
some air beyond the mouth also vibrates as if it was inside the 
tube, and that this effect can be taken into account by adding 0.4 
times the diameter of the tube for each open end (i.e., 0.8 times the 
diameter when both ends are open). Applying this to the violin’s 
f-holes results in a huge change in L, since the ~ 7 mm width of the 
f-hole is considerably greater than the thickness of the wood. Try 
it, and see if the result is a better approximation to the observed 
frequency of the resonance. > Answer, p. 874 L 


Key to symbols: 
easy © typical challenging Mdifficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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The vibrations of this electric bass string are converted to electrical vibra- 
tions, then to sound vibrations, and finally to vibrations of our eardrums. 


Chapter 6 
Waves 


Dandelion. Cello. Read those two words, and your brain instantly 
conjures a stream of associations, the most prominent of which have 
to do with vibrations. Our mental category of “dandelion-ness” is 
strongly linked to the color of light waves that vibrate about half a 
million billion times a second: yellow. The velvety throb of a cello 
has as its most obvious characteristic a relatively low musical pitch 
— the note you’re spontaneously imagining right now might be one 
whose sound vibrations repeat at a rate of a hundred times a second. 


Evolution seems to have designed our two most important senses 
around the assumption that our environment is made of waves, 
whereas up until now, we’ve mostly taken the view that Nature can 
be understood by breaking her down into smaller and smaller parts, 
ending up with particles as her most fundamental building blocks. 
Does that work for light and sound? Sound waves are disturbances 
in air, which is made of atoms, but light, on the other hand, isn’t a 
vibration of atoms. Light, unlike sound, can travel through a vac- 
uum: if you’re reading this by sunlight, you’re taking advantage of 
light that had to make it through millions of miles of vacuum to get 
to you. Waves, then, are not just a trick that vibrating atoms can 
do. Waves are one of the basic phenomena of the universe. At the 
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a ae 
— 


a/Your finger makes a de- 
pression in the surface of the 
water, 1. The wave patterns 
starts evolving, 2, after you 
remove your finger. 


end of this book, we’ll even see that the things we’ve been calling 
particles, such as electrons, are really waves!! 


6.1 Free Waves 
6.1.1 Wave motion 


Let’s start with an intuition-building exercise that deals with 
waves in matter, since they’re easier than light waves to get your 
hands on. Put your fingertip in the middle of a cup of water and 
then remove it suddenly. You’ll have noticed two results that are 
surprising to most people. First, the flat surface of the water does 
not simply sink uniformly to fill in the volume vacated by your 
finger. Instead, ripples spread out, and the process of flattening out 
occurs over a long period of time, during which the water at the 
center vibrates above and below the normal water level. This type 
of wave motion is the topic of the present section. Second, you’ve 
found that the ripples bounce off of the walls of the cup, in much 
the same way that a ball would bounce off of a wall. In the next 
section we discuss what happens to waves that have a boundary 
around them. Until then, we confine ourselves to wave phenomena 
that can be analyzed as if the medium (e.g., the water) was infinite 
and the same everywhere. 


It isn’t hard to understand why removing your fingertip creates 
ripples rather than simply allowing the water to sink back down 
uniformly. The initial crater, a/1, left behind by your finger has 
sloping sides, and the water next to the crater flows downhill to fill 
in the hole. The water far away, on the other hand, initially has 
no way of knowing what has happened, because there is no slope 
for it to flow down. As the hole fills up, the rising water at the 
center gains upward momentum, and overshoots, creating a little 
hill where there had been a hole originally. The area just outside of 
this region has been robbed of some of its water in order to build 
the hill, so a depressed “moat” is formed, a/2. This effect cascades 
outward, producing ripples. 


There are three main ways in which wave motion differs from 
the motion of objects made of matter. 


1. Superposition 


If you watched the water in the cup carefully, you noticed the 
ghostlike behavior of the reflected ripples coming back toward the 
center of the cup and the outgoing ripples that hadn’t yet been re- 
flected: they passed right through each other. This is the first, and 
the most profound, difference between wave motion and the mo- 


‘Speaking more carefully, I should say that every basic building block of light 
and matter has both wave and particle properties. 


336 Chapter6 Waves 


b/The two circular patterns of 
ripples pass through each other. 
Unlike material objects, wave pat- 
terns can overlap in space, and 
when this happens they combine 
by addition. 


tion of objects: waves do not display any repulsion of each other 
analogous to the normal forces between objects that come in con- 
tact. Two wave patterns can therefore overlap in the same region of 
space, as shown in figure b. Where the two waves coincide, they add 
together. For instance, suppose that at a certain location in at a cer- 
tain moment in time, each wave would have had a crest 3 cm above 
the normal water level. The waves combine at this point to make a 
6-cm crest. We use negative numbers to represent depressions in the 
water. If both waves would have had a troughs measuring —3 cm, 
then they combine to make an extra-deep —6 cm trough. A +3 cm 
crest and a —3 cm trough result in a height of zero, i.e., the waves 
momentarily cancel each other out at that point. This additive rule 
is referred to as the principle of superposition, “superposition” being 
merely a fancy word for “adding.” 


Superposition can occur not just with sinusoidal waves like the 
ones in the figure above but with waves of any shape. The figures 
on the following page show superposition of wave pulses. A pulse is 
simply a wave of very short duration. These pulses consist only of 
a single hump or trough. If you hit a clothesline sharply, you will 
observe pulses heading off in both directions. This is analogous to 
the way ripples spread out in all directions when you make a distur- 
bance at one point on water. The same occurs when the hammer 
on a piano comes up and hits a string. 


Experiments to date have not shown any deviation from the 
principle of superposition in the case of light waves. For other types 
of waves, it is typically a very good approximation for low-energy 
waves. 
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d/As the wave pulse goes 
by, the ribbon tied to the spring 
is not carried along. The motion 
of the wave pattern is to the 
right, but the medium (spring) is 
moving from side to side, not to 
the right. (PSSC Physics) 


c/As the wave pattern passes the rubber duck, the duck stays 
put. The water isn’t moving with the wave. 


2. The medium is not transported with the wave. 


The sequence of three photos in figure c shows a series of water 
waves before it has reached a rubber duck (left), having just passed 
the duck (middle) and having progressed about a meter beyond the 
duck (right). The duck bobs around its initial position, but is not 
carried along with the wave. This shows that the water itself does 
not flow outward with the wave. If it did, we could empty one end of 
a swimming pool simply by kicking up waves! We must distinguish 
between the motion of the medium (water in this case) and the 
motion of the wave pattern through the medium. The medium 
vibrates; the wave progresses through space. 


self-check A 

In figure d, you can detect the side-to-side motion of the spring because 
the spring appears blurry. At a certain instant, represented by a single 
photo, how would you describe the motion of the different parts of the 
spring? Other than the flat parts, do any parts of the spring have zero 
velocity? > Answer, p. 869 


“A worm example 1 
The worm in the figure is moving to the right. The wave pattern, 
a pulse consisting of a compressed area of its body, moves to 
the left. In other words, the motion of the wave pattern is in the 
opposite direction compared to the motion of the medium. 
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The incorrect belief that the medium moves with the wave is often 
reinforced by garbled secondhand knowledge of surfing. Anyone 
who has actually surfed knows that the front of the board pushes 
the water to the sides, creating a wake — the surfer can even 
drag his hand through the water, as in in figure e. If the water was 
moving along with the wave and the surfer, this wouldn’t happen. 
The surfer is carried forward because forward is downhill, not be- 
cause of any forward flow of the water. If the water was flowing 
forward, then a person floating in the water up to her neck would 
be carried along just as quickly as someone on a surfboard. In 
fact, it is even possible to surf down the back side of a wave, al- 
though the ride wouldn’t last very long because the surfer and the 
wave would quickly part company. 


3. A wave’s velocity depends on the medium. 


A material object can move with any velocity, and can be sped 
up or slowed down by a force that increases or decreases its kinetic 
energy. Not so with waves. The speed of a wave, depends on the 
properties of the medium (and perhaps also on the shape of the 
wave, for certain types of waves). Sound waves travel at about 340 
m/s in air, 1000 m/s in helium. If you kick up water waves in a 
pool, you will find that kicking harder makes waves that are taller 
(and therefore carry more energy), not faster. The sound waves 
from an exploding stick of dynamite carry a lot of energy, but are 
no faster than any other waves. In the following section we will give 
an example of the physical relationship between the wave speed and 
the properties of the medium. 


‘Breaking waves example 3 
The velocity of water waves increases with depth. The crest of a 
wave travels faster than the trough, and this can cause the wave 
to break. 


Once a wave is created, the only reason its speed will change is 
if it enters a different medium or if the properties of the medium 
change. It is not so surprising that a change in medium can slow 
down a wave, but the reverse can also happen. A sound wave trav- 
eling through a helium balloon will slow down when it emerges into 
the air, but if it enters another balloon it will speed back up again! 
Similarly, water waves travel more quickly over deeper water, so a 
wave will slow down as it passes over an underwater ridge, but speed 
up again as it emerges into deeper water. 


‘Hull speed example 4 
The speeds of most boats, and of some surface-swimming ani- 
mals, are limited by the fact that they make a wave due to their 
motion through the water. The boat in figure g is going at the 
same speed as its own waves, and can’t go any faster. No mat- 
ter how hard the boat pushes against the water, it can’t make 
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e/Example 2. The surfer is 
dragging his hand in the water. 


f/Example 3: 
wave. 


a_ breaking 


The boat has 


g/Example 4. 
run up against a limit on its speed 
because it can’t climb over its 
own wave. Dolphins get around 
the problem by leaping out of the 
water. 
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h/ Circular and _— linear 
patterns. 


ia 


i/Plane and_ spherical 
patterns. 


wave 


wave 


the wave move ahead faster and get out of the way. The wave’s 
speed depends only on the medium. Adding energy to the wave 
doesn’t speed it up, it just increases its amplitude. 


A water wave, unlike many other types of wave, has a speed that 
depends on its shape: a broader wave moves faster. The shape 
of the wave made by a boat tends to mold itself to the shape of 
the boat’s hull, so a boat with a longer hull makes a broader wave 
that moves faster. The maximum speed of a boat whose speed is 
limited by this effect is therefore closely related to the length of its 
hull, and the maximum speed is called the hull speed. Sailboats 
designed for racing are not just long and skinny to make them 
more streamlined — they are also long so that their hull speeds 
will be high. 


Wave patterns 


If the magnitude of a wave’s velocity vector is preordained, what 
about its direction? Waves spread out in all directions from every 
point on the disturbance that created them. If the disturbance is 
small, we may consider it as a single point, and in the case of water 
waves the resulting wave pattern is the familiar circular ripple, h/1. 
If, on the other hand, we lay a pole on the surface of the water 
and wiggle it up and down, we create a linear wave pattern, h/2. 
For a three-dimensional wave such as a sound wave, the analogous 
patterns would be spherical waves and plane waves, i. 


Infinitely many patterns are possible, but linear or plane waves 
are often the simplest to analyze, because the velocity vector is in 
the same direction no matter what part of the wave we look at. Since 
all the velocity vectors are parallel to one another, the problem is 
effectively one-dimensional. Throughout this chapter and the next, 
we will restrict ourselves mainly to wave motion in one dimension, 
while not hesitating to broaden our horizons when it can be done 
without too much complication. 
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Discussion Questions 


A __ The left panel of the figure shows a sequence of snapshots of two 
positive pulses on a coil spring as they move through each other. In the 
right panel, which shows a positive pulse and a negative one, the fifth 
frame has the spring just about perfectly flat. If the two pulses have es- 
sentially canceled each other out perfectly, then why does the motion pick 
up again? Why doesn’t the spring just stay flat? 
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j / Discussion question A. 


B Sketch two positive wave pulses on a string that are overlapping but 
not right on top of each other, and draw their superposition. Do the same 
for a positive pulse running into a negative pulse. 


CA traveling wave pulse is moving to the right on a string. Sketch the 
velocity vectors of the various parts of the string. Now do the same for a 
pulse moving to the left. 


D In a spherical sound wave spreading out from a point, how would 
the energy of the wave fall off with distance? 
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k/ Hitting a key on a _ piano 
causes a hammer to come up 
from underneath and hit a string 
(actually a set of three). The 
result is a pair of pulses moving 
away from the point of impact. 


1/A pulse on a string splits 
in two and heads off in both 
directions. 


a 
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m/Modeling a_ string as a 
series of masses connected by 
springs. 


6.1.2 Waves on a string 


So far you’ve learned some counterintuitive things about the 
behavior of waves, but intuition can be trained. The first half of 
this subsection aims to build your intuition by investigating a simple, 
one-dimensional type of wave: a wave on a string. If you have ever 
stretched a string between the bottoms of two open-mouthed cans to 
talk to a friend, you were putting this type of wave to work. Stringed 
instruments are another good example. Although we usually think 
of a piano wire simply as vibrating, the hammer actually strikes 
it quickly and makes a dent in it, which then ripples out in both 
directions. Since this chapter is about free waves, not bounded ones, 
we pretend that our string is infinitely long. 


After the qualitative discussion, we will use simple approxima- 
tions to investigate the speed of a wave pulse on a string. This quick 
and dirty treatment is then followed by a rigorous attack using the 
methods of calculus, which turns out to be both simpler and more 
general. 


Intuitive ideas 


Consider a string that has been struck, 1/1, resulting in the cre- 
ation of two wave pulses, 1/2, one traveling to the left and one to 
the right. This is analogous to the way ripples spread out in all 
directions from a splash in water, but on a one-dimensional string, 
“all directions” becomes “both directions.” 


We can gain insight by modeling the string as a series of masses 
connected by springs, m. (In the actual string the mass and the 
springiness are both contributed by the molecules themselves.) If 
we look at various microscopic portions of the string, there will be 
some areas that are flat, 1, some that are sloping but not curved, 2, 
and some that are curved, 3 and 4. In example 1 it is clear that both 
the forces on the central mass cancel out, so it will not accelerate. 
The same is true of 2, however. Only in curved regions such as 3 
and 4 is an acceleration produced. In these examples, the vector 
sum of the two forces acting on the central mass is not zero. The 
important concept is that curvature makes force: the curved areas of 
a wave tend to experience forces resulting in an acceleration toward 
the mouth of the curve. Note, however, that an uncurved portion 
of the string need not remain motionless. It may move at constant 
velocity to either side. 


Approximate treatment 


We now carry out an approximate treatment of the speed at 
which two pulses will spread out from an initial indentation on a 
string. For simplicity, we imagine a hammer blow that creates a tri- 
angular dent, n/1. We will estimate the amount of time, t, required 
until each of the pulses has traveled a distance equal to the width 
of the pulse itself. The velocity of the pulses is then +w/t. 
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As always, the velocity of a wave depends on the properties of 
the medium, in this case the string. The properties of the string can 
be summarized by two variables: the tension, 7, and the mass per 
unit length, j (Greek letter mu). 


If we consider the part of the string encompassed by the initial 
dent as a single object, then this object has a mass of approxi- 
mately ww (mass/length x length=mass). (Here, and throughout 
the derivation, we assume that h is much less than w, so that we can 
ignore the fact that this segment of the string has a length slightly 
greater than w.) Although the downward acceleration of this seg- 
ment of the string will be neither constant over time nor uniform 
across the pulse, we will pretend that it is constant for the sake of 
our simple estimate. Roughly speaking, the time interval between 
n/1 and n/2 is the amount of time required for the initial dent to ac- 
celerate from rest and reach its normal, flattened position. Of course 
the tip of the triangle has a longer distance to travel than the edges, 
but again we ignore the complications and simply assume that the 
segment as a whole must travel a distance h. Indeed, it might seem 
surprising that the triangle would so neatly spring back to a per- 
fectly flat shape. It is an experimental fact that it does, but our 
analysis is too crude to address such details. 


The string is kinked, i.e., tightly curved, at the edges of the 
triangle, so it is here that there will be large forces that do not 
cancel out to zero. There are two forces acting on the triangular 
hump, one of magnitude T acting down and to the right, and one 
of the same magnitude acting down and to the left. If the angle 
of the sloping sides is 0, then the total force on the segment equals 
2T sin @. Dividing the triangle into two right triangles, we see that 
sin 6 equals h divided by the length of one of the sloping sides. Since 
h is much less than w, the length of the sloping side is essentially 
the same as w/2, so we have sin = 2h/w, and F = 4Th/w. The 
acceleration of the segment (actually the acceleration of its center 
of mass) is 


The time required to move a distance h under constant acceleration 
a is found by solving h = (1/2)at? to yield 


t= /2h/a 


= W nod 
2T 
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n/A triangular pulse spreads out. 
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Our final result for the speed of the pulses is 


e=wit 


The remarkable feature of this result is that the velocity of the 
pulses does not depend at all on w or h, i.e., any triangular pulse 
has the same speed. It is an experimental fact (and we will also 
prove rigorously below) that any pulse of any kind, triangular or 
otherwise, travels along the string at the same speed. Of course, 
after so many approximations we cannot expect to have gotten all 
the numerical factors right. The correct result for the speed of the 


pulses is 
T 
v= 4/— 
m 


The importance of the above derivation lies in the insight it 
brings —that all pulses move with the same speed — rather than in 
the details of the numerical result. The reason for our too-high value 
for the velocity is not hard to guess. It comes from the assumption 
that the acceleration was constant, when actually the total force on 
the segment would diminish as it flattened out. 


Treatment using calculus 


After expending considerable effort for an approximate solution, 
we now display the power of calculus with a rigorous and completely 
general treatment that is nevertheless much shorter and easier. Let 
the flat position of the string define the x axis, so that y measures 
how far a point on the string is from equilibrium. The motion of 
the string is characterized by y(zx,t), a function of two variables. 
Knowing that the force on any small segment of string depends 
on the curvature of the string in that area, and that the second 
derivative is a measure of curvature, it is not surprising to find that 
the infinitesimal force dF’ acting on an infinitesimal segment dz is 
given by : 

O 
dF=T apie 
(This can be proved by vector addition of the two infinitesimal forces 
acting on either side.) The symbol 0 stands for a partial derivative, 
e.g., O/Ox means a derivative with respect to x that is evaluated 
while treating t as a constant. The acceleration is then a = dF'/dm, 
or, substituting dm = pda, 


Oy Se Oy 

at? u Ox? 
The second derivative with respect to time is related to the second 
derivative with respect to position. This is no more than a fancy 
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mathematical statement of the intuitive fact developed above, that 
the string accelerates so as to flatten out its curves. 


Before even bothering to look for solutions to this equation, we 
note that it already proves the principle of superposition, because 
the derivative of a sum is the sum of the derivatives. Therefore the 
sum of any two solutions will also be a solution. 


Based on experiment, we expect that this equation will be sat- 
isfied by any function y(z,t) that describes a pulse or wave pattern 
moving to the left or right at the correct speed v. In general, such 
a function will be of the form y = f(x — vt) or y = f(x + vt), where 
f is any function of one variable. Because of the chain rule, each 
derivative with respect to time brings out a factor of v. Evaluating 
the second derivatives on both sides of the equation gives 


Squaring gets rid of the sign, and we find that we have a valid 
solution for any function f, provided that v is given by 


T 
v= 4/— 
[Lb 


6.1.3 Sound and light waves 
Sound waves 


The phenomenon of sound is easily found to have all the char- 
acteristics we expect from a wave phenomenon: 


- Sound waves obey superposition. Sounds do not knock other 
sounds out of the way when they collide, and we can hear more than 
one sound at once if they both reach our ear simultaneously. 


- The medium does not move with the sound. Even standing in 
front of a titanic speaker playing earsplitting music, we do not feel 
the slightest breeze. 


- The velocity of sound depends on the medium. Sound travels 
faster in helium than in air, and faster in water than in helium. 
Putting more energy into the wave makes it more intense, not faster. 
For example, you can easily detect an echo when you clap your hands 
a short distance from a large, flat wall, and the delay of the echo is 
no shorter for a louder clap. 


Although not all waves have a speed that is independent of the 
shape of the wave, and this property therefore is irrelevant to our 
collection of evidence that sound is a wave phenomenon, sound does 
nevertheless have this property. For instance, the music in a large 
concert hall or stadium may take on the order of a second to reach 
someone seated in the nosebleed section, but we do not notice or 
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care, because the delay is the same for every sound. Bass, drums, 
and vocals all head outward from the stage at 340 m/s, regardless 
of their differing wave shapes. 


If sound has all the properties we expect from a wave, then what 
type of wave is it? It is a series of compressions and expansions of the 
air. Even for a very loud sound, the increase or decrease compared 
to normal atmospheric pressure is no more than a part per million, 
so our ears are apparently very sensitive instruments. In a vacuum, 
there is no medium for the sound waves, and so they cannot exist. 
The roars and whooshes of space ships in Hollywood movies are fun, 
but scientifically wrong. 


Light waves 


Entirely similar observations lead us to believe that light is a 
wave, although the concept of light as a wave had a long and tortu- 
ous history. It is interesting to note that Isaac Newton very influen- 
tially advocated a contrary idea about light. The belief that matter 
was made of atoms was stylish at the time among radical thinkers 
(although there was no experimental evidence for their existence), 
and it seemed logical to Newton that light as well should be made of 
tiny particles, which he called corpuscles (Latin for “small objects” ). 
Newton’s triumphs in the science of mechanics, i.e., the study of 
matter, brought him such great prestige that nobody bothered to 
question his incorrect theory of light for 150 years. One persua- 
sive proof that light is a wave is that according to Newton’s theory, 
two intersecting beams of light should experience at least some dis- 
ruption because of collisions between their corpuscles. Even if the 
corpuscles were extremely small, and collisions therefore very infre- 
quent, at least some dimming should have been measurable. In fact, 
very delicate experiments have shown that there is no dimming. 


The wave theory of light was entirely successful up until the 20th 
century, when it was discovered that not all the phenomena of light 
could be explained with a pure wave theory. It is now believed that 
both light and matter are made out of tiny chunks which have both 
wave and particle properties. For now, we will content ourselves 
with the wave theory of light, which is capable of explaining a great 
many things, from cameras to rainbows. 


If light is a wave, what is waving? What is the medium that 
wiggles when a light wave goes by? It isn’t air. A vacuum is impen- 
etrable to sound, but light from the stars travels happily through 
zillions of miles of empty space. Light bulbs have no air inside them, 
but that doesn’t prevent the light waves from leaving the filament. 
For a long time, physicists assumed that there must be a mysteri- 
ous medium for light waves, and they called it the ether (not to be 
confused with the chemical). Supposedly the ether existed every- 
where in space, and was immune to vacuum pumps. The details of 
the story are more fittingly reserved for later in this course, but the 
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end result was that a long series of experiments failed to detect any 
evidence for the ether, and it is no longer believed to exist. Instead, 
light can be explained as a wave pattern made up of electrical and 
magnetic fields. 


6.1.4 Periodic waves 
Period and frequency of a periodic wave 


You choose a radio station by selecting a certain frequency. We 
have already defined period and frequency for vibrations, 


T = period = seconds per cycle 
f = frequency = 1/T = cycles per second 


w = angular frequency = 27 f = radians per second 


but what do they signify in the case of a wave? We can recycle our 
previous definition simply by stating it in terms of the vibrations 
that the wave causes as it passes a receiving instrument at a certain 
point in space. For a sound wave, this receiver could be an eardrum 
or a microphone. If the vibrations of the eardrum repeat themselves 
over and over, i.e., are periodic, then we describe the sound wave 
that caused them as periodic. Likewise we can define the period 
and frequency of a wave in terms of the period and frequency of 
the vibrations it causes. As another example, a periodic water wave 
would be one that caused a rubber duck to bob in a periodic manner 
as they passed by it. 


The period of a sound wave correlates with our sensory impres- 
sion of musical pitch. A high frequency (short period) is a high note. 
The sounds that really define the musical notes of a song are only 
the ones that are periodic. It is not possible to sing a nonperiodic 
sound like “sh” with a definite pitch. 


The frequency of a light wave corresponds to color. Violet is the 
high-frequency end of the rainbow, red the low-frequency end. A 
color like brown that does not occur in a rainbow is not a periodic 
light wave. Many phenomena that we do not normally think of as 
light are actually just forms of light that are invisible because they 
fall outside the range of frequencies our eyes can detect. Beyond the 
red end of the visible rainbow, there are infrared and radio waves. 
Past the violet end, we have ultraviolet, x-rays, and gamma rays. 


Graphs of waves as a function of position 


Some waves, light sound waves, are easy to study by placing a 
detector at a certain location in space and studying the motion as 
a function of time. The result is a graph whose horizontal axis is 
time. With a water wave, on the other hand, it is simpler just to 
look at the wave directly. This visual snapshot amounts to a graph 
of the height of the water wave as a function of position. Any wave 
can be represented in either way. 
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o/A graph of pressure ver- 
sus time for a periodic sound 
wave, the vowel “ah.” 


p/A_ similar graph for a non- 
periodic wave, “sh.” 


q/A strip chart recorder. 


r/A water wave profile cre- 
ated by a series of repeating 
pulses. 


Free Waves 347 


s/Wavelengths of linear and 
circular waves. 


An easy way to visualize this is in terms of a strip chart recorder, 
an obsolescing device consisting of a pen that wiggles back and forth 
as a roll of paper is fed under it. It can be used to record a per- 
son’s electrocardiogram, or seismic waves too small to be felt as a 
noticeable earthquake but detectable by a seismometer. Taking the 
seismometer as an example, the chart is essentially a record of the 
ground’s wave motion as a function of time, but if the paper was set 
to feed at the same velocity as the motion of an earthquake wave, it 
would also be a full-scale representation of the profile of the actual 
wave pattern itself. Assuming, as is usually the case, that the wave 
velocity is a constant number regardless of the wave’s shape, know- 
ing the wave motion as a function of time is equivalent to knowing 
it as a function of position. 


Wavelength 


Any wave that is periodic will also display a repeating pattern 
when graphed as a function of position. The distance spanned by 
one repetition is referred to as one wavelength. The usual notation 
for wavelength is A, the Greek letter lambda. Wavelength is to space 
as period is to time. 


Wave velocity related to frequency and wavelength 


Suppose that we create a repetitive disturbance by kicking the 
surface of a swimming pool. We are essentially making a series of 
wave pulses. The wavelength is simply the distance a pulse is able to 
travel before we make the next pulse. The distance between pulses 
is A, and the time between pulses is the period, T’, so the speed of 
the wave is the distance divided by the time, 


OS ALE 


This important and useful relationship is more commonly written 
in terms of the frequency, 


v=fr 


‘Wavelength of radio waves example 5 
> The speed of light is 3.0 x 108 m/s. What is the wavelength of 
the radio waves emitted by KLON, a station whose frequency is 
88.1 MHz? 


> Solving for wavelength, we have 
A=v/f 


= (3.0 x 108 m/s) /(88.1 x 10° s~') 
=3.4m 
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The size of a radio antenna is closely related to the wavelength 
of the waves it is intended to receive. The match need not be exact 
(since after all one antenna can receive more than one wavelength!), 
but the ordinary “whip” antenna such as a car’s is 1/4 of a wave- 
length. An antenna optimized to receive KLON’s signal (which is 
the only one my car radio is ever tuned to) would have a length of 
(3.4 m)/4 = 0.85 m. 


The equation v = fA defines a fixed relationship between any two 
of the variables if the other is held fixed. The speed of radio waves 
in air is almost exactly the same for all wavelengths and frequencies 
(it is exactly the same if they are in a vacuum), so there is a fixed 
relationship between their frequency and wavelength. Thus we can 
say either “Are we on the same wavelength?” or “Are we on the 
same frequency?” 


A different example is the behavior of a wave that travels from 
a region where the medium has one set of properties to an area 
where the medium behaves differently. The frequency is now fixed, 
because otherwise the two portions of the wave would otherwise 
get out of step, causing a kink or discontinuity at the boundary, 
which would be unphysical. (A more careful argument is that a 
kink or discontinuity would have infinite curvature, and waves tend 
to flatten out their curvature. An infinite curvature would flatten 
out infinitely fast, i-e., it could never occur in the first place.) Since 
the frequency must stay the same, any change in the velocity that 
results from the new medium must cause a change in wavelength. 


The velocity of water waves depends on the depth of the water, 
so based on \ = v/f, we see that water waves that move into a 
region of different depth must change their wavelength, as shown in 
figure u. This effect can be observed when ocean waves come up to 
the shore. If the deceleration of the wave pattern is sudden enough, 
the tip of the wave can curl over, resulting in a breaking wave. 


A note on dispersive waves 


The discussion of wave velocity given here is actually a little bit 
of an oversimplification for a wave whose velocity depends on its 
frequency and wavelength. Such a wave is called a dispersive wave. 
Nearly all the waves we deal with in this course are nondispersive, 
but the issue becomes important in chapter 13, where it is discussed 
in detail. 


Sinusoidal waves 


Sinusoidal waves are the most important special case of periodic 
waves. In fact, many scientists and engineers would be uncomfort- 
able with defining a waveform like the “ah” vowel sound as having 
a definite frequency and wavelength, because they consider only 
sine waves to be pure examples of a certain frequency and wave- 
lengths. Their bias is not unreasonable, since the French mathe- 
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t/ Ultrasound, i.e., sound with 
frequencies higher than the range 
of human hearing, was used to 
make this image of a fetus. The 
resolution of the image is related 
to the wavelength, since details 
smaller than about one wave- 
length cannot be resolved. High 
resolution therefore requires a 
short wavelength, corresponding 
to a high frequency. 


u/A water wave 
into a region with different depth 
will change its wavelength. 


traveling 
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w/The pattern of waves made 
by a point source moving to the 
right across the water. Note 
the shorter wavelength of the 
forward-emitted waves and 
the longer wavelength of the 
backward-going ones. 


matician Fourier showed that any periodic wave with frequency f 
can be constructed as a superposition of sine waves with frequencies 
f, 2f, 3f, ... In this sense, sine waves are the basic, pure building 
blocks of all waves. (Fourier’s result so surprised the mathematical 
community of France that he was ridiculed the first time he publicly 
presented his theorem.) 


However, what definition to use is really a matter of convenience. 
Our sense of hearing perceives any two sounds having the same 
period as possessing the same pitch, regardless of whether they are 
sine waves or not. This is undoubtedly because our ear-brain system 
evolved to be able to interpret human speech and animal noises, 
which are periodic but not sinusoidal. Our eyes, on the other hand, 
judge a color as pure (belonging to the rainbow set of colors) only 
if it is a sine wave. 


Discussion Questions 


A Suppose we superimpose two sine waves with equal amplitudes 
but slightly different frequencies, as shown in the figure. What will the 
superposition look like? What would this sound like if they were sound 
waves? 
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v / Discussion question A. 


6.1.5 The Doppler effect 


Figure w shows the wave pattern made by the tip of a vibrating 
rod which is moving across the water. If the rod had been vibrating 
in one place, we would have seen the familiar pattern of concentric 
circles, all centered on the same point. But since the source of 
the waves is moving, the wavelength is shortened on one side and 
lengthened on the other. This is known as the Doppler effect. 


Note that the velocity of the waves is a fixed property of the 
medium, so for example the forward-going waves do not get an extra 
boost in speed as would a material object like a bullet being shot 
forward from an airplane. 


We can also infer a change in frequency. Since the velocity is 
constant, the equation v = fX tells us that the change in wave- 
length must be matched by an opposite change in frequency: higher 
frequency for the waves emitted forward, and lower for the ones 
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emitted backward. The frequency Doppler effect is the reason for 
the familiar dropping-pitch sound of a race car going by. As the car 
approaches us, we hear a higher pitch, but after it passes us we hear 
a frequency that is lower than normal. 


The Doppler effect will also occur if the observer is moving but 
the source is stationary. For instance, an observer moving toward a 
stationary source will perceive one crest of the wave, and will then be 
surrounded by the next crest sooner than she otherwise would have, 
because she has moved toward it and hastened her encounter with 
it. Roughly speaking, the Doppler effect depends only the relative 
motion of the source and the observer, not on their absolute state 
of motion (which is not a well-defined notion in physics) or on their 
velocity relative to the medium. 


Restricting ourselves to the case of a moving source, and to waves 
emitted either directly along or directly against the direction of mo- 
tion, we can easily calculate the wavelength, or equivalently the 
frequency, of the Doppler-shifted waves. Let u be the velocity of the 
source. The wavelength of the forward-emitted waves is shortened 
by an amount uZ’ equal to the distance traveled by the source over 
the course of one period. Using the definition f = 1/T and the equa- 
tion v = fA, we find for the wavelength ’ of the Doppler-shifted 
wave the equation 

V=(1-=) 


v 
A similar equation can be used for the backward-emitted waves, but 
with a plus sign rather than a minus sign. 


Doppler-shifted sound from a race car example 6 
> If a race car moves at a velocity of 50 m/s, and the velocity of 
sound is 340 m/s, by what percentage are the wavelength and 
frequency of its sound waves shifted for an observer lying along 
its line of motion? 


> For an observer whom the car is approaching, we find 
fae: 4 
Vv 


so the shift in wavelength is 15%. Since the frequency is inversely 
proportional to the wavelength for a fixed value of the speed of 
sound, the frequency is shifted upward by 


1/0.85 = 1.18, 


i.e., a change of 18%. (For velocities that are small compared 
to the wave velocities, the Doppler shifts of the wavelength and 
frequency are about the same.) 


Doppler shift of the light emitted by a race car example 7 
> What is the percent shift in the wavelength of the light waves 
emitted by a race car’s headlights? 
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x/The galaxy M100 in the 
constellation Coma _ Berenices. 
Under higher magnification, the 
milky clouds reveal themselves to 
be composed of trillions of stars. 


y/The telescope at Mount 
Wilson used by Hubble. 


> Looking up the speed of light in the back of the book, v = 3.0 x 
108 m/s, we find 


= <= 0.99999983__, 


i.e., the percentage shift is only 0.000017%. 


The second example shows that under ordinary earthbound cir- 
cumstances, Doppler shifts of light are negligible because ordinary 
things go so much slower than the speed of light. It’s a different 
story, however, when it comes to stars and galaxies, and this leads 
us to a story that has profound implications for our understanding 
of the origin of the universe. 


The Big Bang 


As soon as astronomers began looking at the sky through tele- 
scopes, they began noticing certain objects that looked like clouds 
in deep space. The fact that they looked the same night after night 
meant that they were beyond the earth’s atmosphere. Not know- 
ing what they really were, but wanting to sound official, they called 
them “nebulae,” a Latin word meaning “clouds” but sounding more 
impressive. In the early 20th century, astronomers realized that al- 
though some really were clouds of gas (e.g., the middle “star” of 
Orion’s sword, which is visibly fuzzy even to the naked eye when 
conditions are good), others were what we now call galaxies: virtual 
island universes consisting of trillions of stars (for example the An- 
dromeda Galaxy, which is visible as a fuzzy patch through binoc- 
ulars). Three hundred years after Galileo had resolved the Milky 
Way into individual stars through his telescope, astronomers real- 
ized that the universe is made of galaxies of stars, and the Milky 
Way is simply the visible part of the flat disk of our own galaxy, 
seen from inside. 


This opened up the scientific study of cosmology, the structure 
and history of the universe as a whole, a field that had not been 
seriously attacked since the days of Newton. Newton had realized 
that if gravity was always attractive, never repulsive, the universe 
would have a tendency to collapse. His solution to the problem was 
to posit a universe that was infinite and uniformly populated with 
matter, so that it would have no geometrical center. The gravita- 
tional forces in such a universe would always tend to cancel out by 
symmetry, so there would be no collapse. By the 20th century, the 
belief in an unchanging and infinite universe had become conven- 
tional wisdom in science, partly as a reaction against the time that 
had been wasted trying to find explanations of ancient geological 
phenomena based on catastrophes suggested by biblical events like 
Noah’s flood. 


In the 1920’s astronomer Edwin Hubble began studying the 
Doppler shifts of the light emitted by galaxies. A former college 
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football player with a serious nicotine addiction, Hubble did not 
set out to change our image of the beginning of the universe. His 
autobiography seldom even mentions the cosmological discovery for 
which he is now remembered. When astronomers began to study the 
Doppler shifts of galaxies, they expected that each galaxy’s direction 
and velocity of motion would be essentially random. Some would be 
approaching us, and their light would therefore be Doppler-shifted 
to the blue end of the spectrum, while an equal number would be 
expected to have red shifts. What Hubble discovered instead was 
that except for a few very nearby ones, all the galaxies had red 
shifts, indicating that they were receding from us at a hefty frac- 
tion of the speed of light. Not only that, but the ones farther away 
were receding more quickly. The speeds were directly proportional 
to their distance from us. 


Did this mean that the earth (or at least our galaxy) was the 
center of the universe? No, because Doppler shifts of light only 
depend on the relative motion of the source and the observer. If 
we see a distant galaxy moving away from us at 10% of the speed 
of light, we can be assured that the astronomers who live in that 
galaxy will see ours receding from them at the same speed in the 
opposite direction. The whole universe can be envisioned as a rising 
loaf of raisin bread. As the bread expands, there is more and more 
space between the raisins. The farther apart two raisins are, the 
greater the speed with which they move apart. 


The universe’s expansion is presumably decelerating because of 
gravitational attraction among the galaxies. We do not presently 
know whether there is enough mass in the universe to cause enough 
attraction to halt the expansion eventually. But perhaps more in- 
teresting than the distant future of the universe is what its present 
expansion implies about its past. Extrapolating backward in time 
using the known laws of physics, the universe must have been denser 
and denser at earlier and earlier times. At some point, it must have 
been extremely dense and hot, and we can even detect the radia- 
tion from this early fireball, in the form of microwave radiation that 
permeates space. The phrase Big Bang was originally coined by the 
doubters of the theory to make it sound ridiculous, but it stuck, 
and today essentially all astronomers accept the Big Bang theory 
based on the very direct evidence of the red shifts and the cosmic 
microwave background radiation. 


Finally it should be noted what the Big Bang theory is not. It is 
not an explanation of why the universe exists. Such questions belong 
to the realm of religion, not science. Science can find ever simpler 
and ever more fundamental explanations for a variety of phenom- 
ena, but ultimately science takes the universe as it is according to 
observations. 


Furthermore, there is an unfortunate tendency, even among many 
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z/How do astronomers know 
what mixture of wavelengths a 
star emitted originally, so that 
they can tell how much the 
Doppler shift was? This image 
(obtained by the author with 
equipment costing about $5, and 
no telescope) shows the mixture 
of colors emitted by the star 
Sirius. (If you have the book in 
black and white, blue is on the left 
and red on the right.) The star 
appears white or bluish-white to 
the eye, but any light looks white 
if it contains roughly an equal 
mixture of the rainbow colors, 
i.e., of all the pure sinusoidal 
waves with wavelengths lying in 
the visible range. Note the black 
“gap teeth.” These are the fin- 
gerprint of hydrogen in the outer 
atmosphere of Sirius. These 
wavelengths are selectively ab- 
sorbed by hydrogen. Sirius is in 
our own galaxy, but similar stars 
in other galaxies would have 
the whole pattern shifted toward 
the red end, indicating they are 
moving away from us. 
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aa/Shock waves are cre- 
ated by the X-15 rocket plane, 
flying at 3.5 times the speed of 
sound. 


ab/ This fighter jet has just 
accelerated past the speed of 


sound. The sudden decom- 
pression of the air causes water 
droplets to condense, forming a 
cloud. 


scientists, to speak of the Big Bang theory was a description of the 
very first event in the universe, which caused everything after it. 
Although it is true that time may have had a beginning (Einstein’s 
theory of general relativity admits such a possibility), the methods 
of science can only work within a certain range of conditions such 
as temperature and density. Beyond a temperature of about 10° K, 
the random thermal motion of subatomic particles becomes so rapid 
that its velocity is comparable to the speed of light. Early enough in 
the history of the universe, when these temperatures existed, New- 
tonian physics becomes less accurate, and we must describe nature 
using the more general description given by Einstein’s theory of 
relativity, which encompasses Newtonian physics as a special case. 
At even higher temperatures, beyond about 10** degrees, physicists 
know that Einstein’s theory as well begins to fall apart, but we don’t 
know how to construct the even more general theory of nature that 
would work at those temperatures. No matter how far physics pro- 
gresses, we will never be able to describe nature at infinitely high 
temperatures, since there is a limit to the temperatures we can ex- 
plore by experiment and observation in order to guide us to the 
right theory. We are confident that we understand the basic physics 
involved in the evolution of the universe starting a few minutes after 
the Big Bang, and we may be able to push back to milliseconds or 
microseconds after it, but we cannot use the methods of science to 
deal with the beginning of time itself. 


A note on Doppler shifts of light 


If Doppler shifts depend only on the relative motion of the source 
and receiver, then there is no way for a person moving with the 
source and another person moving with the receiver to determine 
who is moving and who isn’t. Either can blame the Doppler shift 
entirely on the other’s motion and claim to be at rest herself. This is 
entirely in agreement with the principle stated originally by Galileo 
that all motion is relative. 


On the other hand, a careful analysis of the Doppler shifts of 
water or sound waves shows that it is only approximately true, at 
low speeds, that the shifts just depend on the relative motion of the 
source and observer. For instance, it is possible for a jet plane to 
keep up with its own sound waves, so that the sound waves appear 
to stand still to the pilot of the plane. The pilot then knows she 
is moving at exactly the speed of sound. The reason this doesn’t 
disprove the relativity of motion is that the pilot is not really de- 
termining her absolute motion but rather her motion relative to the 
air, which is the medium of the sound waves. 


Einstein realized that this solved the problem for sound or water 
waves, but would not salvage the principle of relative motion in the 
case of light waves, since light is not a vibration of any physical 
medium such as water or air. Beginning by imagining what a beam 
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of light would look like to a person riding a motorcycle alongside it, 
Einstein eventually came up with a radical new way of describing 
the universe, in which space and time are distorted as measured 
by observers in different states of motion. As a consequence of this 
Theory of Relativity, he showed that light waves would have Doppler 
shifts that would exactly, not just approximately, depend only on 
the relative motion of the source and receiver. 


Discussion Questions 


A If an airplane travels at exactly the speed of sound, what would be 
the wavelength of the forward-emitted part of the sound waves it emitted? 
How should this be interpreted, and what would actually happen? What 
happens if it’s going faster than the speed of sound? Can you use this to 
explain what you see in figures aa and ab? 


B If bullets go slower than the speed of sound, why can a supersonic 
fighter plane catch up to its own sound, but not to its own bullets? 


Cc If someone inside a plane is talking to you, should their speech be 
Doppler shifted? 
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a/A_ cross-sectional view of 
a human body, showing the vocal 
tract. 


b/ Circular water waves are 
reflected from a boundary on the 
left. (PSSC Physics) 


6.2 Bounded Waves 


Speech is what separates humans most decisively from animals. No 
other species can master syntax, and even though chimpanzees can 
learn a vocabulary of hand signs, there is an unmistakable difference 
between a human infant and a baby chimp: starting from birth, the 
human experiments with the production of complex speech sounds. 


Since speech sounds are instinctive for us, we seldom think about 
them consciously. How do we control sound waves so skillfully? 
Mostly we do it by changing the shape of a connected set of hollow 
cavities in our chest, throat, and head. Somehow by moving the 
boundaries of this space in and out, we can produce all the vowel 
sounds. Up until now, we have been studying only those properties 
of waves that can be understood as if they existed in an infinite, 
open space with no boundaries. In this chapter we address what 
happens when a wave is confined within a certain space, or when a 
wave pattern encounters the boundary between two different media, 
such as when a light wave moving through air encounters a glass 
windowpane. 


6.2.1 Reflection, transmission, and absorption 
Reflection and transmission 


Sound waves can echo back from a cliff, and light waves are 
reflected from the surface of a pond. We use the word reflection, 
normally applied only to light waves in ordinary speech, to describe 
any such case of a wave rebounding from a barrier. Figure (a) shows 
a circular water wave being reflected from a straight wall. In this 
chapter, we will concentrate mainly on reflection of waves that move 
in one dimension, as in figure c/1. 


Wave reflection does not surprise us. After all, a material object 
such as a rubber ball would bounce back in the same way. But waves 
are not objects, and there are some surprises in store. 


First, only part of the wave is usually reflected. Looking out 
through a window, we see light waves that passed through it, but a 
person standing outside would also be able to see her reflection in 
the glass. A light wave that strikes the glass is partly reflected and 
partly transmitted (passed) by the glass. The energy of the original 
wave is split between the two. This is different from the behavior of 
the rubber ball, which must go one way or the other, not both. 


Second, consider what you see if you are swimming underwater 
and you look up at the surface. You see your own reflection. This 
is utterly counterintuitive, since we would expect the light waves to 
burst forth to freedom in the wide-open air. A material projectile 
shot up toward the surface would never rebound from the water-air 
boundary! 


What is it about the difference between two media that causes 
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waves to be partly reflected at the boundary between them? Is 
it their density? Their chemical composition? Ultimately all that 
matters is the speed of the wave in the two media. A wave is partially 
reflected and partially transmitted at the boundary between media 
in which it has different speeds. For example, the speed of light 
waves in window glass is about 30% less than in air, which explains 
why windows always make reflections. Figure c shows examples of 
wave pulses being reflected at the boundary between two coil springs 
of different weights, in which the wave speed is different. 


Reflections such as b and c/1, where a wave encounters a massive 
fixed object, can usually be understood on the same basis as cases 
like c/2 and c/3 where two media meet. Example c/1, for instance, 
is like a more extreme version of example c/2. If the heavy coil 
spring in c/2 was made heavier and heavier, it would end up acting 
like the fixed wall to which the light spring in c/1 has been attached. 


self-check B 

In figure c/1, the reflected pulse is upside-down, but its depth is just as 
big as the original pulse’s height. How does the energy of the reflected 
pulse compare with that of the original? > Answer, p. 869 


Fish have internal ears. example 8 
Why don’t fish have ear-holes? The speed of sound waves in 
a fish’s body is not much different from their speed in water, so 
sound waves are not strongly reflected from a fish’s skin. They 
pass right through its body, so fish can have internal ears. 
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c/1. A wave on a coil spring, 
initially traveling to the left, is re- 
flected from the fixed end. 2. A 
wave in the lighter spring, where 
the wave speed is greater, trav- 
els to the left and is then partly 
reflected and partly transmitted at 
the boundary with the heavier coil 
spring, which has a lower wave 
speed. The reflection is inverted. 
3. A wave moving to the right 
in the heavier spring is partly re- 
flected at the boundary with the 
lighter spring. The reflection is 
uninverted. (PSSC Physics) 
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d/An uninverted reflection. The 
reflected pulse is reversed front 
to back, but is not upside-down. 


—_ ; 
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e/An inverted reflection. The 
reflected pulse is reversed both 
front to back and top to bottom. 


Whale songs traveling long distances example 9 
Sound waves travel at drastically different soeeds through rock, 
water, and air. Whale songs are thus strongly reflected both at 
both the bottom and the surface. The sound waves can travel 
hundreds of miles, bouncing repeatedly between the bottom and 
the surface, and still be detectable. Sadly, noise pollution from 
ships has nearly shut down this cetacean version of the inter- 
net. 


Long-distance radio communication example 10 
Radio communication can occur between stations on opposite 
sides of the planet. The mechanism is entirely similar to the one 
explained in the previous example, but the three media involved 
are the earth, the atmosphere, and the ionosphere. 


self-check C 

Sonar is a method for ships and submarines to detect each other by 
producing sound waves and listening for echoes. What properties would 
an underwater object have to have in order to be invisible to sonar? > 
Answer, p. 869 


The use of the word “reflection” naturally brings to mind the cre- 
ation of an image by a mirror, but this might be confusing, because 
we do not normally refer to “reflection” when we look at surfaces 
that are not shiny. Nevertheless, reflection is how we see the surfaces 
of all objects, not just polished ones. When we look at a sidewalk, 
for example, we are actually seeing the reflecting of the sun from 
the concrete. The reason we don’t see an image of the sun at our 
feet is simply that the rough surface blurs the image so drastically. 


Inverted and uninverted reflections 


Notice how the pulse reflected back to the right in example c/2 
comes back upside-down, whereas the one reflected back to the left 
in c/3 returns in its original upright form. This is true for other 
waves as well. In general, there are two possible types of reflections, 
a reflection back into a faster medium and a reflection back into a 
slower medium. One type will always be an inverting reflection and 
one noninverting. 


It’s important to realize that when we discuss inverted and un- 
inverted reflections on a string, we are talking about whether the 
wave is flipped across the direction of motion (i.e., upside-down in 
these drawings). The reflected pulse will always be reversed front 
to back, as shown in figures d and e. This is because it is traveling 
in the other direction. The leading edge of the pulse is what gets 
reflected first, so it is still ahead when it starts back to the left — 
it’s just that “ahead” is now in the opposite direction. 


358 Chapter6 Waves 


Absorption 


So far we have tacitly assumed that wave energy remains as wave 
energy, and is not converted to any other form. If this was true, then 
the world would become more and more full of sound waves, which 
could never escape into the vacuum of outer space. In reality, any 
mechanical wave consists of a traveling pattern of vibrations of some 
physical medium, and vibrations of matter always produce heat, as 
when you bend a coathangar back and forth and it becomes hot. 
We can thus expect that in mechanical waves such as water waves, 
sound waves, or waves on a string, the wave energy will gradually 
be converted into heat. This is referred to as absorption. ‘The 
reduction in the wave’s energy can also be described as a reduction in 
amplitude, the relationship between them being, as with a vibrating 
object, E x A?. 


The wave suffers a decrease in amplitude, as shown in figure f. 
The decrease in amplitude amounts to the same fractional change 
for each unit of distance covered. For example, if a wave decreases 
from amplitude 2 to amplitude 1 over a distance of 1 meter, then 
after traveling another meter it will have an amplitude of 1/2. That 
is, the reduction in amplitude is exponential. This can be proved 
as follows. By the principle of superposition, we know that a wave 
of amplitude 2 must behave like the superposition of two identical 
waves of amplitude 1. If a single amplitude-1 wave would die down to 
amplitude 1/2 over a certain distance, then two amplitude-1 waves 
superposed on top of one another to make amplitude 1+1=2 must 
die down to amplitude 1/2+1/2=1 over the same distance. 


self-check D 
As a wave undergoes absorption, it loses energy. Does this mean that 
it slows down? > Answer, p. 869 


In many cases, this frictional heating effect is quite weak. Sound 
waves in air, for instance, dissipate into heat extremely slowly, and 
the sound of church music in a cathedral may reverberate for as much 
as 3 or 4 seconds before it becomes inaudible. During this time it 
has traveled over a kilometer! Even this very gradual dissipation 
of energy occurs mostly as heating of the church’s walls and by the 
leaking of sound to the outside (where it will eventually end up as 
heat). Under the right conditions (humid air and low frequency), a 
sound wave in a straight pipe could theoretically travel hundreds of 
kilometers before being noticeable attenuated. 


In general, the absorption of mechanical waves depends a great 
deal on the chemical composition and microscopic structure of the 
medium. Ripples on the surface of antifreeze, for instance, die out 
extremely rapidly compared to ripples on water. For sound waves 
and surface waves in liquids and gases, what matters is the viscosity 
of the substance, i.e., whether it flows easily like water or mercury 
or more sluggishly like molasses or antifreeze. This explains why 
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f/A pulse traveling through 


a highly absorptive medium. 
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our intuitive expectation of strong absorption of sound in water is 
incorrect. Water is a very weak absorber of sound (viz. whale songs 
and sonar), and our incorrect intuition arises from focusing on the 
wrong property of the substance: water’s high density, which is 
irrelevant, rather than its low viscosity, which is what matters. 


Light is an interesting case, since although it can travel through 
matter, it is not itself a vibration of any material substance. Thus 
we can look at the star Sirius, 10'4 km away from us, and be as- 
sured that none of its light was absorbed in the vacuum of outer 
space during its 9-year journey to us. The Hubble Space Telescope 
routinely observes light that has been on its way to us since the 
early history of the universe, billions of years ago. Of course the 
energy of light can be dissipated if it does pass through matter (and 
the light from distant galaxies is often absorbed if there happen to 
be clouds of gas or dust in between). 


Soundproofing example 11 
Typical amateur musicians setting out to soundproof their garages 
tend to think that they should simply cover the walls with the 
densest possible substance. In fact, sound is not absorbed very 
strongly even by passing through several inches of wood. A better 
strategy for soundproofing is to create a sandwich of alternating 
layers of materials in which the speed of sound is very different, 
to encourage reflection. 


The classic design is alternating layers of fiberglass and plywood. 
The speed of sound in plywood is very high, due to its stiffness, 
while its speed in fiberglass is essentially the same as its speed 
in air. Both materials are fairly good sound absorbers, but sound 
waves passing through a few inches of them are still not going 
to be absorbed sufficiently. The point of combining them is that 
a sound wave that tries to get out will be strongly reflected at 
each of the fiberglass-plywood boundaries, and will bounce back 
and forth many times like a ping pong ball. Due to all the back- 
and-forth motion, the sound may end up traveling a total distance 
equal to ten times the actual thickness of the soundproofing be- 
fore it escapes. This is the equivalent of having ten times the 
thickness of sound-absorbing material. 


Radio transmission example 12 
A radio transmitting station must have a length of wire or cable 
connecting the amplifier to the antenna. The cable and the an- 
tenna act as two different media for radio waves, and there will 
therefore be partial reflection of the waves as they come from the 
cable to the antenna. If the waves bounce back and forth many 
times between the amplifier and the antenna, a great deal of their 
energy will be absorbed. There are two ways to attack the prob- 
lem. One possibility is to design the antenna so that the speed of 
the waves in it is as close as possible to the speed of the waves 
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in the cable; this minimizes the amount of reflection. The other 
method is to connect the amplifier to the antenna using a type 
of wire or cable that does not strongly absorb the waves. Partial 
reflection then becomes irrelevant, since all the wave energy will 
eventually exit through the antenna. 


Discussion Questions 


A A sound wave that underwent a pressure-inverting reflection would 
have its compressions converted to expansions and vice versa. How 
would its energy and frequency compare with those of the original sound? 
Would it sound any different? What happens if you swap the two wires 
where they connect to a stereo speaker, resulting in waves that vibrate in 
the opposite way? 


6.2.2 Quantitative treatment of reflection 


In this subsection we analyze the reasons why reflections occur at 
a speed-changing boundary, predict quantitatively the intensities of 
reflection and transmission, and discuss how to predict for any type 
of wave which reflections are inverting and which are uninverting. 


Why reflection occurs 


To understand the fundamental reasons for what does occur at 
the boundary between media, let’s first discuss what doesn’t happen. 
For the sake of concreteness, consider a sinusoidal wave on a string. 
If the wave progresses from a heavier portion of the string, in which 
its velocity is low, to a lighter-weight part, in which it is high, then 
the equation v = fA tells us that it must change its frequency, or 
its wavelength, or both. If only the frequency changed, then the 
parts of the wave in the two different portions of the string would 
quickly get out of step with each other, producing a discontinuity in 
the wave, g/1. This is unphysical, so we know that the wavelength 
must change while the frequency remains constant, g/2. 


But there is still something unphysical about figure g/2. The 
sudden change in the shape of the wave has resulted in a sharp kink 
at the boundary. This can’t really happen, because the medium 
tends to accelerate in such a way as to eliminate curvature. A sharp 
kink corresponds to an infinite curvature at one point, which would 
produce an infinite acceleration, which would not be consistent with 
the smooth pattern of wave motion envisioned in fig. g/2. Waves 
can have kinks, but not stationary kinks. 


We conclude that without positing partial reflection of the wave, 
we cannot simultaneously satisfy the requirements of (1) continuity 
of the wave, and (2) no sudden changes in the slope of the wave. In 
other words, we assume that both the wave and its derivative are 
continuous functions.) 


Does this amount to a proof that reflection occurs? Not quite. 
We have only proved that certain types of wave motion are not 
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g/1. A change in frequency 
without a change in wavelength 
would produce a discontinuity in 
the wave. 2. A simple change in 
wavelength without a reflection 
would result in a sharp kink in the 
wave. 


Bounded Waves 361 


oe 
=e A 


h/A pulse being partially re- 
flected and partially transmitted 
at the boundary between two 
strings in which the wave speed 
is different. The top drawing 
shows the pulse heading to the 
right, toward the heaver string. 
For clarity, all but the first and 
last drawings are schematic. 
Once the reflected pulse begins 
to emerge from the boundary, 
it adds together with the trailing 
parts of the incident pulse. Their 
sum, shown as a wider line, is 
what is actually observed. 


valid solutions. In the following subsection, we prove that a valid 
solution can always be found in which a reflection occurs. Now in 
physics, we normally assume (but seldom prove formally) that the 
equations of motion have a unique solution, since otherwise a given 
set of initial conditions could lead to different behavior later on, 
but the Newtonian universe is supposed to be deterministic. Since 
the solution must be unique, and we derive below a valid solution 
involving a reflected pulse, we will have ended up with what amounts 
to a proof of reflection. 


Intensity of reflection 


I will now show, in the case of waves on a string, that it is possible 
to satisfy the physical requirements given above by constructing a 
reflected wave, and as a bonus this will produce an equation for 
the proportions of reflection and transmission and a prediction as 
to which conditions will lead to inverted and which to uninverted 
reflection. We assume only that the principle of superposition holds, 
which is a good approximations for waves on a string of sufficiently 
small amplitude. 


Let the unknown amplitudes of the reflected and transmitted 
waves be R and T, respectively. An inverted reflection would be 
represented by a negative value of R. We can without loss of gen- 
erality take the incident (original) wave to have unit amplitude. 
Superposition tells us that if, for instance, the incident wave had 
double this amplitude, we could immediately find a corresponding 
solution simply by doubling R and T. 


Just to the left of the boundary, the height of the wave is given 
by the height 1 of the incident wave, plus the height R of the part 
of the reflected wave that has just been created and begun heading 
back, for a total height of 1+ R. On the right side immediately next 
to the boundary, the transmitted wave has a height JT. To avoid a 
discontinuity, we must have 


14+ k=T 


Next we turn to the requirement of equal slopes on both sides of 
the boundary. Let the slope of the incoming wave be s immediately 
to the left of the junction. If the wave was 100% reflected, and 
without inversion, then the slope of the reflected wave would be —s, 
since the wave has been reversed in direction. In general, the slope 
of the reflected wave equals —sR, and the slopes of the superposed 
waves on the left side add up to s — sR. On the right, the slope 
depends on the amplitude, 7’, but is also changed by the stretching 
or compression of the wave due to the change in speed. If, for 
example, the wave speed is twice as great on the right side, then 
the slope is cut in half by this effect. The slope on the right is 
therefore s(v1/v2)T’, where v is the velocity in the original medium 
and v2 the velocity in the new medium. Equality of slopes gives 


362 Chapter6 Waves 


s—sR= s(v1/v2)T, or 


(2pa 7 
v2 


Solving the two equations for the unknowns R and T gives 


R- V2 — VI 
v2 + V1 
and 
2 
pe 208 
v2 + V1 


The first equation shows that there is no reflection unless the 
two wave speeds are different, and that the reflection is inverted in 
reflection back into a fast medium. 


The energies of the transmitted and reflected wavers always add 
up to the same as the energy of the original wave. There is never 
any abrupt loss (or gain) in energy when a wave crosses a boundary; 
conversion of wave energy to heat occurs for many types of waves, 
but it occurs throughout the medium. The equation for J’, surpris- 
ingly, allows the amplitude of the transmitted wave to be greater 
than 1, ie., greater than that of the incident wave. This does not 
violate conservation of energy, because this occurs when the second 
string is less massive, reducing its kinetic energy, and the trans- 
mitted pulse is broader and less strongly curved, which lessens its 
potential energy. 


Inverted and uninverted reflections in general (optional) 


For waves on a string, reflections back into a faster medium are 
inverted, while those back into a slower medium are uninverted. Is 
this true for all types of waves? The rather subtle answer is that it 
depends on what property of the wave you are discussing. 


Let’s start by considering wave disturbances of freeway traffic. 
Anyone who has driven frequently on crowded freeways has observed 
the phenomenon in which one driver taps the brakes, starting a chain 
reaction that travels backward down the freeway as each person in 
turn exercises caution in order to avoid rear-ending anyone. The 
reason why this type of wave is relevant is that it gives a simple, 
easily visualized example of our description of a wave depends on 
which aspect of the wave we have in mind. In steadily flowing free- 
way traffic, both the density of cars and their velocity are constant 
all along the road. Since there is no disturbance in this pattern of 
constant velocity and density, we say that there is no wave. Now if 
a wave is touched off by a person tapping the brakes, we can either 
describe it as a region of high density or as a region of decreasing 
velocity. 
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j/In the mirror image, the 
areas of positive excess traffic 
density are still positive, but 
the velocities of the cars have 
all been reversed, so areas of 
positive excess velocity have 
been turned into negative ones. 


k/A_ pulse encounters two 
boundaries. 


|/ A sine wave has been reflected 
at two different boundaries, and 
the two reflections interfere. 


The freeway traffic wave is in fact a good model of a sound wave, 
and a sound wave can likewise be described either by the density 
(or pressure) of the air or by its speed. Likewise many other types 
of waves can be described by either of two functions, one of which 
is often the derivative of the other with respect to position. 


Now let’s consider reflections. If we observe the freeway wave in 
a mirror, the high-density area will still appear high in density, but 
velocity in the opposite direction will now be described by a neg- 
ative number. A person observing the mirror image will draw the 
same density graph, but the velocity graph will be flipped across the 
x axis, and its original region of negative slope will now have posi- 
tive slope. Although I don’t know any physical situation that would 
correspond to the reflection of a traffic wave, we can immediately ap- 
ply the same reasoning to sound waves, which often do get reflected, 
and determine that a reflection can either be density-inverting and 
velocity-uninverting or density-uninverting and velocity-inverting. 


This same type of situation will occur over and over as one en- 
counters new types of waves, and to apply the analogy we need 
only determine which quantities, like velocity, become negated in a 
mirror image and which, like density, stay the same. 


A light wave, for instance consists of a traveling pattern of elec- 
tric and magnetic fields. All you need to know in order to analyze the 
reflection of light waves is how electric and magnetic fields behave 
under reflection; you don’t need to know any of the detailed physics 
of electricity and magnetism. An electric field can be detected, for 
example, by the way one’s hair stands on end. The direction of 
the hair indicates the direction of the electric field. In a mirror im- 
age, the hair points the other way, so the electric field is apparently 
reversed in a mirror image. The behavior of magnetic fields, how- 
ever, is a little tricky. The magnetic properties of a bar magnet, 
for instance, are caused by the aligned rotation of the outermost 
orbiting electrons of the atoms. In a mirror image, the direction of 
rotation is reversed, say from clockwise to counterclockwise, and so 
the magnetic field is reversed twice: once simply because the whole 
picture is flipped and once because of the reversed rotation of the 
electrons. In other words, magnetic fields do not reverse themselves 
in a mirror image. We can thus predict that there will be two pos- 
sible types of reflection of light waves. In one, the electric field is 
inverted and the magnetic field uninverted. In the other, the electric 
field is uninverted and the magnetic field inverted. 


6.2.3 Interference effects 


If you look at the front of a pair of high-quality binoculars, you 
will notice a greenish-blue coating on the lenses. This is advertised 
as a coating to prevent reflection. Now reflection is clearly undesir- 
able — we want the light to go in the binoculars — but so far I’ve 
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described reflection as an unalterable fact of nature, depending only 
on the properties of the two wave media. The coating can’t change 
the speed of light in air or in glass, so how can it work? The key is 
that the coating itself is a wave medium. In other words, we have 
a three-layer sandwich of materials: air, coating, and glass. We will 
analyze the way the coating works, not because optical coatings are 
an important part of your education but because it provides a good 
example of the general phenomenon of wave interference effects. 


There are two different interfaces between media: an air-coating 
boundary and a coating-glass boundary. Partial reflection and par- 
tial transmission will occur at each boundary. For ease of visual- 
ization let’s start by considering an equivalent system consisting of 
three dissimilar pieces of string tied together, and a wave pattern 
consisting initially of a single pulse. Figure k/1 shows the incident 
pulse moving through the heavy rope, in which its velocity is low. 
When it encounters the lighter-weight rope in the middle, a faster 
medium, it is partially reflected and partially transmitted. (The 
transmitted pulse is bigger, but nevertheless has only part of the 
original energy.) The pulse transmitted by the first interface is then 
partially reflected and partially transmitted by the second bound- 
ary, k/3. In figure k/4, two pulses are on the way back out to the 
left, and a single pulse is heading off to the right. (There is still a 
weak pulse caught between the two boundaries, and this will rattle 
back and forth, rapidly getting too weak to detect as it leaks energy 
to the outside with each partial reflection.) 


Note how, of the two reflected pulses in k/4, one is inverted and 
one uninverted. One underwent reflection at the first boundary (a 
reflection back into a slower medium is uninverted), but the other 
was reflected at the second boundary (reflection back into a faster 
medium is inverted). 


Now let’s imagine what would have happened if the incoming 
wave pattern had been a long sinusoidal wave train instead of a 
single pulse. The first two waves to reemerge on the left could be 
in phase, 1/1, or out of phase, 1/2, or anywhere in between. The 
amount of lag between them depends entirely on the width of the 
middle segment of string. If we choose the width of the middle string 
segment correctly, then we can arrange for destructive interference 
to occur, 1/2, with cancellation resulting in a very weak reflected 
wave. 


This whole analysis applies directly to our original case of optical 
coatings. Visible light from most sources does consist of a stream of 
short sinusoidal wave-trains such as the ones drawn above. The only 
real difference between the waves-on-a-rope example and the case of 
an optical coating is that the first and third media are air and glass, 
in which light does not have the same speed. However, the general 
result is the same as long as the air and the glass have light-wave 
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m/A_ pulse bounces back 
and forth. 


n/We model a_ guitar string 
attached to the guitar's body at 
both ends as a light-weight string 
attached to extremely heavy 
strings at its ends. 


speeds that either both greater than the coating’s or both less than 
the coating’s. 


The business of optical coatings turns out to be a very arcane 
one, with a plethora of trade secrets and “black magic” techniques 
handed down from master to apprentice. Nevertheless, the ideas 
you have learned about waves in general are sufficient to allow you 
to come to some definite conclusions without any further technical 
knowledge. The self-check and discussion questions will direct you 
along these lines of thought. 


self-check E 

Color corresponds to wavelength of light waves. Is it possible to choose 
a thickness for an optical coating that will produce destructive interfer- 
ence for all colors of light? > Answer, p. 
869 


This example was typical of a wide variety of wave interference 
effects. With a little guidance, you are now ready to figure out 
for yourself other examples such as the rainbow pattern made by a 
compact disc or by a layer of oil on a puddle. 


Discussion Questions 


A Is it possible to get complete destructive interference in an optical 
coating, at least for light of one specific wavelength? 


B Sunlight consists of sinusoidal wave-trains containing on the order 
of a hundred cycles back-to-back, for a length of something like a tenth of 
a millimeter. What happens if you try to make an optical coating thicker 
than this? 


C Suppose you take two microscope slides and lay one on top of the 
other so that one of its edges is resting on the corresponding edge of the 
bottom one. If you insert a sliver of paper or a hair at the opposite end, 
a wedge-shaped layer of air will exist in the middle, with a thickness that 
changes gradually from one end to the other. What would you expect to 
see if the slides were illuminated from above by light of a single color? 
How would this change if you gradually lifted the lower edge of the top 
slide until the two slides were finally parallel? 


D An observation like the one described in discussion question C was 
used by Newton as evidence against the wave theory of light! If Newton 
didn’t know about inverting and noninverting reflections, what would have 
seemed inexplicable to him about the region where the air layer had zero 
or nearly zero thickness? 


6.2.4 Waves bounded on both sides 


In the example of the previous section, it was theoretically true 
that a pulse would be trapped permanently in the middle medium, 
but that pulse was not central to our discussion, and in any case it 
was weakening severely with each partial reflection. Now consider 
a guitar string. At its ends it is tied to the body of the instrument 
itself, and since the body is very massive, the behavior of the waves 
when they reach the end of the string can be understood in the 
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same way as if the actual guitar string was attached on the ends to 
strings that were extremely massive. Reflections are most intense 
when the two media are very dissimilar. Because the wave speed in 
the body is so radically different from the speed in the string, we 
should expect nearly 100% reflection. 


Although this may seem like a rather bizarre physical model of 
the actual guitar string, it already tells us something interesting 
about the behavior of a guitar that we would not otherwise have 
understood. The body, far from being a passive frame for attaching 
the strings to, is actually the exit path for the wave energy in the 
strings. With every reflection, the wave pattern on the string loses 
a tiny fraction of its energy, which is then conducted through the 
body and out into the air. (The string has too little cross-section to 
make sound waves efficiently by itself.) By changing the properties 
of the body, moreover, we should expect to have an effect on the 
manner in which sound escapes from the instrument. This is clearly 
demonstrated by the electric guitar, which has an extremely massive, 
solid wooden body. Here the dissimilarity between the two wave 
media is even more pronounced, with the result that wave energy 
leaks out of the string even more slowly. This is why an electric 
guitar with no electric pickup can hardly be heard at all, and it is 
also the reason why notes on an electric guitar can be sustained for 
longer than notes on an acoustic guitar. 


If we initially create a disturbance on a guitar string, how will 
the reflections behave? In reality, the finger or pick will give the 
string a triangular shape before letting it go, and we may think of 
this triangular shape as a very broad “dent” in the string which 
will spread out in both directions. For simplicity, however, let’s just 
imagine a wave pattern that initially consists of a single, narrow 
pulse traveling up the neck, m/1. After reflection from the top end, 
it is inverted, m/3. Now something interesting happens: figure m/5 
is identical to figure m/1. After two reflections, the pulse has been 
inverted twice and has changed direction twice. It is now back where 
it started. The motion is periodic. This is why a guitar produces 
sounds that have a definite sensation of pitch. 


self-check F 

Notice that from m/1 to m/5, the pulse has passed by every point on the 
string exactly twice. This means that the total distance it has traveled 
equals 2 L, where L is the length of the string. Given this fact, what are 
the period and frequency of the sound it produces, expressed in terms 
of L and v, the velocity of the wave? > Answer, p. 869 


Note that if the waves on the string obey the principle of super- 
position, then the velocity must be independent of amplitude, and 
the guitar will produce the same pitch regardless of whether it is 
played loudly or softly. In reality, waves on a string obey the prin- 
ciple of superposition approximately, but not exactly. The guitar, 
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o/The period of this double- 
pulse pattern is half of what we’d 


otherwise expect. 
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q/Graphs of loudness _ ver- 
sus frequency for the vowel “ah,” 
sung as three different musical 
notes. G is consonant with D, 
since every overtone of G that 
is close to an overtone of D 
(marked “*”) is at exactly the 
same frequency. G and Ci are 
dissonant together, since some 
of the overtones of G (marked 
“x”) are close to, but not right on 
top of, those of Ct. 


like just about any acoustic instrument, is a little out of tune when 
played loudly. (The effect is more pronounced for wind instruments 
than for strings, but wind players are able to compensate for it.) 


Now there is only one hole in our reasoning. Suppose we some- 
how arrange to have an initial setup consisting of two identical pulses 
heading toward each other, as in figure (g). They will pass through 
each other, undergo a single inverting reflection, and come back to 
a configuration in which their positions have been exactly inter- 
changed. This means that the period of vibration is half as long. 
The frequency is twice as high. 


This might seem like a purely academic possibility, since nobody 
actually plays the guitar with two picks at once! But in fact it is an 
example of a very general fact about waves that are bounded on both 
sides. A mathematical theorem called Fourier’s theorem states that 
any wave can be created by superposing sine waves. Figure p shows 
how even by using only four sine waves with appropriately chosen 
amplitudes, we can arrive at a sum which is a decent approximation 
to the realistic triangular shape of a guitar string being plucked. 
The one-hump wave, in which half a wavelength fits on the string, 
will behave like the single pulse we originally discussed. We call 
its frequency f.. The two-hump wave, with one whole wavelength, 
is very much like the two-pulse example. For the reasons discussed 
above, its frequency is 2f,. Similarly, the three-hump and four-hump 
waves have frequencies of 3f> and 4fo. 


Theoretically we would need to add together infinitely many 
such wave patterns to describe the initial triangular shape of the 
string exactly, although the amplitudes required for the very high 
frequency parts would be very small, and an excellent approximation 
could be achieved with as few as ten waves. 


We thus arrive at the following very general conclusion. When- 
ever a wave pattern exists in a medium bounded on both sides by 
media in which the wave speed is very different, the motion can be 
broken down into the motion of a (theoretically infinite) series of sine 
waves, with frequencies fo, 2fo, 3fo, ... Except for some technical 
details, to be discussed below, this analysis applies to a vast range of 
sound-producing systems, including the air column within the hu- 
man vocal tract. Because sounds composed of this kind of pattern 
of frequencies are so common, our ear-brain system has evolved so 
as to perceive them as a single, fused sensation of tone. 


Musical applications 


Many musicians claim to be able to pick out by ear several of the 
frequencies 2 fo, 3fo, ..., called overtones or harmonics of the funda- 
mental fo, but they are kidding themselves. In reality, the overtone 
series has two important roles in music, neither of which depends 
on this fictitious ability to “hear out” the individual overtones. 
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s / Standing waves on a rope. (PSSC Physics.) 


First, the relative strengths of the overtones is an important 
part of the personality of a sound, called its timbre (rhymes with 
“amber” ). The characteristic tone of the brass instruments, for ex- 
ample, is a sound that starts out with a very strong harmonic series 
extending up to very high frequencies, but whose higher harmonics 
die down drastically as the attack changes to the sustained portion 
of the note. 


Second, although the ear cannot separate the individual harmon- 
ics of a single musical tone, it is very sensitive to clashes between 
the overtones of notes played simultaneously, i.e., in harmony. We 
tend to perceive a combination of notes as being dissonant if they 
have overtones that are close but not the same. Roughly speaking, 
strong overtones whose frequencies differ by more than 1% and less 
than 10% cause the notes to sound dissonant. It is important to re- 
alize that the term “dissonance” is not a negative one in music. No 
matter how long you search the radio dial, you will never hear more 
than three seconds of music without at least one dissonant combi- 
nation of notes. Dissonance is a necessary ingredient in the creation 
of a musical cycle of tension and release. Musically knowledgeable 
people do not usually use the word “dissonant” as a criticism of mu- 
sic, and if they do, what they are really saying is that the dissonance 
has been used in a clumsy way, or without providing any contrast 
between dissonance and consonance. 
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r/lf you take a sine wave 
and make a copy of it shifted 
over, their sum is still a sine wave. 
The same is not true for a square 


wave. 
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t/ Surprisingly, sound waves 
undergo partial reflection at the 
open ends of tubes as well as 
closed ones. 


u/Graphs of excess density 
versus position for the lowest- 
frequency standing waves of 
three types of air columns. Points 
on the axis have normal air 
density. 


Standing waves 


Figure s shows sinusoidal wave patterns made by shaking a rope. 
I used to enjoy doing this at the bank with the pens on chains, back 
in the days when people actually went to the bank. You might think 
that I and the person in the photos had to practice for a long time 
in order to get such nice sine waves. In fact, a sine wave is the only 
shape that can create this kind of wave pattern, called a standing 
wave, which simply vibrates back and forth in one place without 
moving. The sine wave just creates itself automatically when you 
find the right frequency, because no other shape is possible. 


If you think about it, it’s not even obvious that sine waves should 
be able to do this trick. After all, waves are supposed to travel at a 
set speed, aren’t they? The speed isn’t supposed to be zero! Well, we 
can actually think of a standing wave as a superposition of a moving 
sine wave with its own reflection, which is moving the opposite way. 
Sine waves have the unique mathematical property that the sum of 
sine waves of equal wavelength is simply a new sine wave with the 
same wavelength. As the two sine waves go back and forth, they 
always cancel perfectly at the ends, and their sum appears to stand 
still. 


Standing wave patterns are rather important, since atoms are 
really standing-wave patterns of electron waves. You are a standing 
wave! 


Standing-wave patterns of air columns 


The air column inside a wind instrument behaves very much 
like the wave-on-a-string example we’ve been concentrating on so 
far, the main difference being that we may have either inverting or 
noninverting reflections at the ends. 


Some organ pipes are closed at both ends. The speed of sound 
is different in metal than in air, so there is a strong reflection at 
the closed ends, and we can have standing waves. These reflections 
are both density-noninverting, so we get symmetric standing-wave 
patterns, such as the one shown in figure u/1. 


Figure t shows the sound waves in and around a bamboo Japanese 
flute called a shakuhachi, which is open at both ends of the air col- 
umn. We can only have a standing wave pattern if there are re- 
flections at the ends, but that is very counterintuitive — why is 
there any reflection at all, if the sound wave is free to emerge into 
open space, and there is no change in medium? Recall the reason 
why we got reflections at a change in medium: because the wave- 
length changes, so the wave has to readjust itself from one pattern 
to another, and the only way it can do that without developing a 
kink is if there is a reflection. Something similar is happening here. 
The only difference is that the wave is adjusting from being a plane 
wave to being a spherical wave. The reflections at the open ends 


370 Chapter6 Waves 


are density-inverting, u/2, so the wave pattern is pinched off at the 
ends. Comparing panels 1 and 2 of the figure, we see that although 
the wave pattens are different, in both cases the wavelength is the 
same: in the lowest-frequency standing wave, half a wavelength fits 
inside the tube. Thus, it isn’t necessary to memorize which type of 
reflection is inverting and which is inverting. It’s only necessary to 
know that the tubes are symmetric. 


Finally, we can have an asymmetric tube: closed at one end and 
open at the other. A common example is the pan pipes, v, which are 
closed at the bottom and open at the top. The standing wave with 
the lowest frequency is therefore one in which 1/4 of a wavelength 
fits along the length of the tube, as shown in figure u/3. 


Sometimes an instrument’s physical appearance can be mislead- 
ing. A concert flute, w, is closed at the mouth end and open at 
the other, so we would expect it to behave like an asymmetric air 
column; in reality, it behaves like a symmetric air column open at 
both ends, because the embouchure hole (the hole the player blows 
over) acts like an open end. The clarinet and the saxophone look 
similar, having a moutpiece and reed at one end and an open end 
at the other, but they act different. In fact the clarinet’s air col- 
umn has patterns of vibration that are asymmetric, the saxophone 
symmetric. The discrepancy comes from the difference between the 
conical tube of the sax and the cylindrical tube of the clarinet. The 
adjustment of the wave pattern from a plane wave to a spherical 
wave is more gradual at the flaring bell of the saxophone. 


self-check G 

Draw a graph of pressure versus position for the first overtone of the air 
column in a tube open at one end and closed at the other. This will be 
the next-to-longest possible wavelength that allows for a point of maxi- 
mum vibration at one end and a point of no vibration at the other. How 
many times shorter will its wavelength be compared to the frequency 
of the lowest-frequency standing wave, shown in the figure? Based on 
this, how many times greater will its frequency be? —> Answer, p. 869 


‘The speed of sound example 13 
We can get a rough and ready derivation of the equation for the 
speed of sound by analyzing the standing waves in a cylindrical 
air column as a special type of Helmholtz resonance (example 
25 on page 329), in which the cavity happens to have the same 
cross-sectional area as the neck. Roughly speaking, the regions 
of maximum density variation act like the cavity. The regions of 
minimum density variation, on the other hand, are the places 
where the velocity of the air is varying the most; these regions 
throttle back the speed of the vibration, because of the inertia of 
the moving air. If the cylinder has cross-sectional area A, then the 
“cavity” and “neck” parts of the wave both have lengths of some- 
thing like A/2, and the volume of the “cavity” is about AA/2. We 
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v/A pan pipe is an asym- 
metric air column, open at the top 
and closed at the bottom. 


w/A_ concert flute looks like 
an asymmetric air column, open 
at the mouth end and closed at 
the other. However, its patterns of 
vibration are symmetric, because 
the embouchure hole acts like an 
open end. 
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get v = fA = (...)\/yPo/p, where the factor (...) represents nu- 
merical stuff that we can’t possibly hope to have gotten right with 
such a crude argument. The correct result is in fact v = ,\/yP,/p. 
Isaac Newton attempted the same calculation, but didn’t under- 
stand the thermodynamic effects involved, and therefore got a 
result that didn’t have the correct factor of y. 


This chapter is summarized on page 897. Notation and terminology 
are tabulated on pages 885-886. 
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Problems 
The symbols Vv, “, etc. are explained on page 376. 


1 The musical note middle C has a frequency of 262 Hz. What 
are its period and wavelength? 


2 The following is a graph of the height of a water wave as a 
function of position, at a certain moment in time. 


Trace this graph onto another piece of paper, and then sketch below 
it the corresponding graphs that would be obtained if 

(a) the amplitude and frequency were doubled while the velocity 
remained the same; 

(b) the frequency and velocity were both doubled while the ampli- 
tude remained unchanged; 

(c) the wavelength and amplitude were reduced by a factor of three 
while the velocity was doubled. 


Problem by Arnold Arons. z “7 ™ 


3 (a) The graph shows the height of a water wave pulse as a 


function of position. Draw a graph of height as a function of time Problem 3 
for a specific point on the water. Assume the pulse is traveling to 
the right. 


(b) Repeat part a, but assume the pulse is traveling to the left. 

(c) Now assume the original graph was of height as a function of 
time, and draw a graph of height as a function of position, assuming 
the pulse is traveling to the right. 

(d) Repeat part c, but assume the pulse is traveling to the left. 
Problem by Arnold Arons. g 


4 At a particular moment in time, a wave on a string has a shape 
described by y = 3.5cos(0.737a + 0.457t + 0.377). The stuff inside 
the cosine is in radians. Assume that the units of the numerical 
constants are such that x, y, and t are in SI units. ob Hint, p. 865 
(a) Is the wave moving in the positive x or the negative x direction? 
(b) Find the wave’s period, frequency, wavelength. 

(c) Find the wave’s velocity. 

(d) Find the maximum velocity of any point on the string, and 


compare with the magnitude and direction of the wave’s velocity. 
Vv oo 


5 The figure shows one wavelength of a steady sinusoidal wave 
traveling to the right along a string. Define a coordinate system 
in which the positive x axis points to the right and the positive y 
axis up, such that the flattened string would have y = 0. Copy 
the figure, and label with “y = 0” all the appropriate parts of the 
string. Similarly, label with “v = 0” all parts of the string whose 
velocities are zero, and with “a = 0” all parts whose accelerations 
are zero. There is more than one point whose velocity is of the 
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greatest magnitude. Pick one of these, and indicate the direction of 
its velocity vector. Do the same for a point having the maximum 
magnitude of acceleration. 

Problem by Arnold Arons. z 


6 Find an equation for the relationship between the Doppler- 
shifted frequency of a wave and the frequency of the original wave, 
for the case of a stationary observer and a source moving directly 
toward or away from the observer. 


7 Suggest a quantitative experiment to look for any deviation 
from the principle of superposition for surface waves in water. Try 
to make your experiment simple and practical. E 


8 The simplest trick with a lasso is to spin a flat loop in a 
horizontal plane. The whirling loop of a lasso is kept under tension 
mainly due to its own rotation. Although the spoke’s force on the 
loop has an inward component, we'll ignore it. The purpose of this 
problem, which is based on one by A.P. French, is to prove a cute 
fact about wave disturbances moving around the loop. As far as I 
know, this fact has no practical implications for trick roping! Let the 
loop have radius r and mass per unit length yz, and let its angular 
Prohlanis: velocity be w. 
(a) Find the tension, T, in the loop in terms of r, jz, and w. Assume 
the loop is a perfect circle, with no wave disturbances on it yet. 
> Hint, p. 865 pb Answer, p. 875 
(b) Find the velocity of a wave pulse traveling around the loop. 
Discuss what happens when the pulse moves is in the same direction 
as the rotation, and when it travels contrary to the rotation. ~ 


loop 


9 A string hangs vertically, free at the bottom and attached at 
the top. 

(a) Find the velocity of waves on the string as a function of the 
distance from the bottom 

(b) Find the acceleration of waves on the string. p> Answer, p. 875 
(c) Interpret your answers to parts a and b for the case where a 
pulse comes down and reaches the end of the string. What happens 
next? Check your answer against experiment and conservation of 
energy. ns 


10 Singing that is off-pitch by more than about 1% sounds bad. 
How fast would a singer have to be moving relative to a the rest of 
a band to make this much of a change in pitch due to the Doppler 
effect? 


11 Light travels faster in warmer air. Use this fact to explain 
the formation of a mirage appearing like the shiny surface of a pool 
of water when there is a layer of hot air above a road. 


12 (a) Compute the amplitude of light that is reflected back 
into air at an air-water interface, relative to the amplitude of the 
incident wave. Assume that the light arrives in the direction directly 
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perpendicular to the surface.The speeds of light in air and water are 
3.0 x 10° and 2.2 x 108 m/s, respectively. 

(b) Find the energy of the reflected wave as a fraction of the incident 
energy. > Hint, p. 865 Vv 


13 A concert flute produces its lowest note, at about 262 Hz, 
when half of a wavelength fits inside its tube. Compute the length 
of the flute. > Answer, p. 875 © 


14 (a) A good tenor saxophone player can play all of the fol- 
lowing notes without changing her fingering, simply by altering the 
tightness of her lips: Eb (150 Hz), Eb (300 Hz), Bb (450 Hz), and 
E> (600 Hz). How is this possible? (I’m not asking you to analyze 
the coupling between the lips, the reed, the mouthpiece, and the air 
column, which is very complicated.) 

(b) Some saxophone players are known for their ability to use this 
technique to play “freak notes,” i.e., notes above the normal range 
of the instrument. Why isn’t it possible to play notes below the 
normal range using this technique? & 


15 The table gives the frequencies of the notes that make up 
the key of F major, starting from middle C and going up through 
all seven notes. 

(a) Calculate the first four or five harmonics of C and G, and deter- 
mine whether these two notes will be consonant or dissonant. 

(b) Do the same for C and Bb. (Recall that harmonics that differ 
by about 1-10% cause dissonance.) & 


16 A Fabry-Perot interferometer, shown in the figure being used 
to measure the diameter of a thin filament, consists of two glass 
plates with an air gap between them. As a the top plate is moved 
up or down with a screw, the light passing through the plates goes 
through a cycle of constructive and destructive interference, which is 
mainly due to interference between rays that pass straight through 
and those that are reflected twice back into the air gap. (Although 
the dimensions in this drawing are distorted for legibility, the glass 
plates would really be much thicker than the length of the wave- 
trains of light, so no interference effects would be observed due to 
reflections within the glass.) 

(a) If the top plate is cranked down so that the thickness, d, of the 
air gap is much less than the wavelength X of the light, what is the 
phase relationship between the two rays? (Recall that the phase 
can be inverted by a reflection.) With d = 0, is the interference 
constructive or destructive? 

(b) If d is slowly increased from zero, what is the first value of d for 
which the interference is the same as at d = 0? Express your answer 
in terms of X. 

(c) Suppose the apparatus is first set up as shown in the figure. The 
filament is then removed, and n cycles of brightening and dimming 
are counted while the top plate is brought down to d = 0. What is 


261.6 Hz 
293.7 
329.6 
349.2 
392.0 
440.0 

dp 466.2 


DeoQoymgoa 


Problem 15. 


<A 


Problem 16. 
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the thickness of the filament, in terms of n and A? 
Based on a problem by D.J. Raymond. z 


17 (a) A wave pulse moves into a new medium, where its ve- 
locity is greater by a factor a. Find an expression for the fraction, 
f, of the wave energy that is transmitted, in terms of a. Note that, 
as discussed in the text, you cannot simply find f by squaring the 
amplitude of the transmitted wave. > Answer, p. 875 
(b) Suppose we wish to transmit a pulse from one medium to an- 
other, maximizing the fraction of the wave energy transmitted. To 
do so, we sandwich another layer in between them, so that the wave 
moves from the initial medium, where its velocity is v;, through the 
intermediate layer, where it is v2, and on into the final layer, where 
it becomes v3. What is the optimal value of v2? (Assume that the 
middle layer is thicker than the length of the pulse, so there are no 
interference effects. Also, although there will be later echoes that 
are transmitted after multiple reflections back and forth across the 
middle layer, you are only to optimize the strength of the transmit- 
ted pulse that is first to emerge. In other words, it’s simply a matter 
of applying your answer from part a twice to find the amount that 
finally gets through.) > Answer, p. 875 i 


Key to symbols: 
easy © typical challenging Mdifficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Chapter 7 
Relativity 


7.1 Time Is Not Absolute 


When Einstein first began to develop the theory of relativity, around 
1905, the only real-world observations he could draw on were am- 
biguous and indirect. Today, the evidence is part of everyday life. 
For example, every time you use a GPS receiver, a, you’re using 
Einstein’s theory of relativity. Somewhere between 1905 and today, 
technology became good enough to allow conceptually simple ex- 
periments that students in the early 20th century could only discuss 
in terms like “Imagine that we could...” A good jumping-on point 
is 1971. In that year, J.C. Hafele and R.E. Keating brought atomic 
clocks aboard commercial airliners, b, and went around the world, 
once from east to west and once from west to east. Hafele and 
Keating observed that there was a discrepancy between the times 
measured by the traveling clocks and the times measured by similar 
clocks that stayed home at the U.S. Naval Observatory in Wash- 
ington. The east-going clock lost time, ending up off by —59 + 10 
nanoseconds, while the west-going one gained 273 +7 ns. 


7.1.1 The correspondence principle 


This establishes that time doesn’t work the way Newton believed 
it did when he wrote that “Absolute, true, and mathematical time, 
of itself, and from its own nature flows equably without regard to 
anything external...” We are used to thinking of time as absolute 
and universal, so it is disturbing to find that it can flow at a different 
rate for observers in different frames of reference. Nevertheless, the 
effects that Hafele and Keating observed were small. This makes 
sense: Newton’s laws have already been thoroughly tested by ex- 
periments under a wide variety of conditions, so a new theory like 
relativity must agree with Newton’s to a good approximation, within 
the Newtonian theory’s realm of applicability. This requirement of 
backward-compatibility is known as the correspondence principle. 


7.1.2 Causality 


It’s also reassuring that the effects on time were small compared 
to the three-day lengths of the plane trips. There was therefore no 
opportunity for paradoxical scenarios such as one in which the east- 
going experimenter arrived back in Washington before he left and 
then convinced himself not to take the trip. A theory that maintains 


a/ This Global Positioning 
System (GPS) system, running 
on a smartphone attached to a 
bike’s handlebar, depends on 
Einstein’s theory of relativity. 
Time flows at a different rates 
aboard a GPS satellite than it 
does on the bike, and the GPS 
software has to take this into 
account. 


b/ The clock took up two seats, 
and two tickets were bought for it 
under the name of “Mr. Clock.” 
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c/Newton’s laws do not dis- 
tinguish past from future. The 
football could travel in either 
direction while obeying Newton’s 
laws. 


d/All three clocks are mov- 
ing to the east. Even though the 
west-going plane is moving to the 
west relative to the air, the air 
is moving to the east due to the 
earth’s rotation. 


this kind of orderly relationship between cause and effect is said to 
satisfy causality. 


Causality is like a water-hungry front-yard lawn in Los Angeles: 
we know we want it, but it’s not easy to explain why. Even in plain 
old Newtonian physics, there is no clear distinction between past 
and future. In figure c, number 18 throws the football to number 
25, and the ball obeys Newton’s laws of motion. If we took a video 
of the pass and played it backward, we would see the ball flying from 
25 to 18, and Newton’s laws would still be satisfied. Nevertheless, 
we have a strong psychological impression that there is a forward 
arrow of time. I can remember what the stock market did last year, 
but I can’t remember what it will do next year. Joan of Arc’s mil- 
itary victories against England caused the English to burn her at 
the stake; it’s hard to accept that Newton’s laws provide an equally 
good description of a process in which her execution in 1431 caused 
her to win a battle in 1429. There is no consensus at this point 
among physicists on the origin and significance of time’s arrow, and 
for our present purposes we don’t need to solve this mystery. In- 
stead, we merely note the empirical fact that, regardless of what 
causality really means and where it really comes from, its behavior 
is consistent. Specifically, experiments show that if an observer in a 
certain frame of reference observes that event A causes event B, then 
observers in other frames agree that A causes B, not the other way 
around. This is merely a generalization about a large body of ex- 
perimental results, not a logically necessary assumption. If Keating 
had gone around the world and arrived back in Washington before 
he left, it would have disproved this statement about causality. 


7.1.3 Time distortion arising from motion and gravity 


Hafele and Keating were testing specific quantitative predictions 
of relativity, and they verified them to within their experiment’s 
error bars. Let’s work backward instead, and inspect the empirical 
results for clues as to how time works. 


The two traveling clocks experienced effects in opposite direc- 
tions, and this suggests that the rate at which time flows depends 
on the motion of the observer. The east-going clock was moving in 
the same direction as the earth’s rotation, so its velocity relative to 
the earth’s center was greater than that of the clock that remained 
in Washington, while the west-going clock’s velocity was correspond- 
ingly reduced. The fact that the east-going clock fell behind, and 
the west-going one got ahead, shows that the effect of motion is to 
make time go more slowly. This effect of motion on time was pre- 
dicted by Einstein in his original 1905 paper on relativity, written 
when he was 26. 


If this had been the only effect in the Hafele-Keating experiment, 
then we would have expected to see effects on the two flying clocks 
that were equal in size. Making up some simple numbers to keep the 
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arithmetic transparent, suppose that the earth rotates from west to 
east at 1000 km/hr, and that the planes fly at 300 km/hr. Then the 
speed of the clock on the ground is 1000 km/hr, the speed of the 
clock on the east-going plane is 1300 km/hr, and that of the west- 
going clock 700 km/hr. Since the speeds of 700, 1000, and 1300 
km/hr have equal spacing on either side of 1000, we would expect 
the discrepancies of the moving clocks relative to the one in the lab 
to be equal in size but opposite in sign. 


In fact, the two effects are unequal in size: —59 ns and 273 ns. 
This implies that there is a second effect involved, simply due to the 
planes’ being up in the air. Relativity predicts that time’s rate of 
flow also changes with height in a gravitational field. Einstein didn’t 
figure out how to incorporate gravity into relativity until 1915, after 
much frustration and many false starts. The simpler version of the 
theory without gravity is known as special relativity, the full version 
as general relativity. We’ll restrict ourselves to special relativity in 
this book, and that means that what we want to focus on right now 
is the distortion of time due to motion, not gravity. 


We can now see in more detail how to apply the correspondence 
principle. The behavior of the three clocks in the Hafele-Keating 
experiment shows that the amount of time distortion increases as 
the speed of the clock’s motion increases. Newton lived in an era 
when the fastest mode of transportation was a galloping horse, and 
the best pendulum clocks would accumulate errors of perhaps a 
minute over the course of several days. A horse is much slower 
than a jet plane, so the distortion of time would have had a relative 
size of only ~ 107~!° — much smaller than the clocks were capable 
of detecting. At the speed of a passenger jet, the effect is about 
10-!*, and state-of-the-art atomic clocks in 1971 were capable of 
measuring that. A GPS satellite travels much faster than a jet air- 
plane, and the effect on the satellite turns out to be ~ 107!°. The 
general idea here is that all physical laws are approximations, and 
approximations aren’t simply right or wrong in different situations. 
Approximations are better or worse in different situations, and the 
question is whether a particular approximation is good enough in a 
given situation to serve a particular purpose. The faster the motion, 
the worse the Newtonian approximation of absolute time. Whether 
the approximation is good enough depends on what you’re trying 
to accomplish. The correspondence principle says that the approxi- 
mation must have been good enough to explain all the experiments 
done in the centuries before Einstein came up with relativity. 


By the way, don’t get an inflated idea of the importance of the 
Hafele-Keating experiment. Special relativity had already been con- 
firmed by a vast and varied body of experiments decades before 1971. 
The only reason I’m giving such a prominent role to this experiment, 
which was actually more important as a test of general relativity, is 
that it is conceptually very direct. 


e/The correspondence _ prin- 
ciple requires that the relativistic 
distortion of time become small 
for small velocities. 


Section 7.1. Time Is Not Absolute 379 


a/Two events are given as 
points on a graph of position 
versus time. Joan of Arc helps to 
restore Charles VII to the throne. 
At a later time and a different 
position, Joan of Arc is sentenced 
to death. 


b/A change of units distorts 
an x-t graph. This graph depicts 
exactly the same events as figure 
a. The only change is that the x 
and t coordinates are measured 
using different units, so the grid is 
compressed in t and expanded in 
x. 


c/A convention we'll use to 
represent a distortion of time and 
space. 


7.2 Distortion of Space and Time 
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7.2.1 The Lorentz transformation 


Relativity says that when two observers are in different frames of 
reference, each observer considers the other one’s perception of time 
to be distorted. We'll also see that something similar happens to 
their observations of distances, so both space and time are distorted. 
What exactly is this distortion? How do we even conceptualize it? 


The idea isn’t really as radical as it might seem at first. We 
can visualize the structure of space and time using a graph with 
position and time on its axes. These graphs are familiar by now, 
but we’re going to look at them in a slightly different way. Before, we 
used them to describe the motion of objects. The grid underlying 
the graph was merely the stage on which the actors played their 
parts. Now the background comes to the foreground: it’s time and 
space themselves that we’re studying. We don’t necessarily need 
to have a line or a curve drawn on top of the grid to represent a 
particular object. We may, for example, just want to talk about 
events, depicted as points on the graph as in figure a. A distortion 
of the Cartesian grid underlying the graph can arise for perfectly 
ordinary reasons that Isaac Newton would have readily accepted. 
For example, we can simply change the units used to measure time 
and position, as in figure b. 


We’re going to have quite a few examples of this type, so I'll 
adopt the convention shown in figure c for depicting them. Figure 
c summarizes the relationship between figures a and b in a more 
compact form. The gray rectangle represents the original coordinate 
grid of figure a, while the grid of black lines represents the new 
version from figure b. Omitting the grid from the gray rectangle 
makes the diagram easier to decode visually. 


Our goal of unraveling the mysteries of special relativity amounts 
to nothing more than finding out how to draw a diagram like c 
in the case where the two different sets of coordinates represent 
measurements of time and space made by two different observers, 
each in motion relative to the other. Galileo and Newton thought 
they knew the answer to this question, but their answer turned 
out to be only approximately right. To avoid repeating the same 
mistakes, we need to clearly spell out what we think are the basic 
properties of time and space that will be a reliable foundation for 
our reasoning. I want to emphasize that there is no purely logical 
way of deciding on this list of properties. The ones I'll list are simply 
a summary of the patterns observed in the results from a large body 
of experiments. Furthermore, some of them are only approximate. 
For example, property 1 below is only a good approximation when 
the gravitational field is weak, so it is a property that applies to 
special relativity, not to general relativity. 
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Experiments show that: 


1. No point in time or space has properties that make it different 
from any other point. 


2. Likewise, all directions in space have the same properties. 


3. Motion is relative, i.e., all inertial frames of reference are 
equally valid. 


4. Causality holds, in the sense described on page 377. 


5. Time depends on the state of motion of the observer. 


Most of these are not very subversive. Properties 1 and 2 date 
back to the time when Galileo and Newton started applying the 
same universal laws of motion to the solar system and to the earth; 
this contradicted Aristotle, who believed that, for example, a rock 
would naturally want to move in a certain special direction (down) 
in order to reach a certain special location (the earth’s surface). 
Property 3 is the reason that Einstein called his theory “relativity,” 
but Galileo and Newton believed exactly the same thing to be true, 
as dramatized by Galileo’s run-in with the Church over the question 
of whether the earth could really be in motion around the sun. 
Property 4 would probably surprise most people only because it 
asserts in such a weak and specialized way something that they feel 
deeply must be true. The only really strange item on the list is 5, 
but the Hafele-Keating experiment forces it upon us. 


If it were not for property 5, we could imagine that figure d 
would give the correct transformation between frames of reference 
in motion relative to one another. Let’s say that observer 1, whose 
grid coincides with the gray rectangle, is a hitch-hiker standing by 
the side of a road. Event A is a raindrop hitting his head, and 
event B is another raindrop hitting his head. He says that A and B 
occur at the same location in space. Observer 2 is a motorist who 
drives by without stopping; to him, the passenger compartment of 
his car is at rest, while the asphalt slides by underneath. He says 
that A and B occur at different points in space, because during the 
time between the first raindrop and the second, the hitch-hiker has 
moved backward. On the other hand, observer 2 says that events A 
and C occur in the same place, while the hitch-hiker disagrees. The 
slope of the grid-lines is simply the velocity of the relative motion 
of each observer relative to the other. 


Figure d has familiar, comforting, and eminently sensible behav- 
ior, but it also happens to be wrong, because it violates property 
5. The distortion of the coordinate grid has only moved the vertical 
lines up and down, so both observers agree that events like B and 
C are simultaneous. If this was really the way things worked, then 
all observers could synchronize all their clocks with one another for 


d/A Galilean version of the 
relationship between two frames 
of reference. As in all such 
graphs in this chapter, the original 
coordinates, represented by the 
gray rectangle, have a time axis 
that goes to the right, and a 
position axis that goes straight 


up. 
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e/A transformation that leads 
to disagreements about whether 
two events occur at the same 
time and place. This is not just 
a matter of opinion. Either the 
arrow hit the bull’s-eye or it didn't. 


f / A nonlinear transformation. 


once and for all, and the clocks would never get out of sync. This 
contradicts the results of the Hafele-Keating experiment, in which 
all three clocks were initially synchronized in Washington, but later 
went out of sync because of their different states of motion. 


It might seem as though we still had a huge amount of wiggle 
room available for the correct form of the distortion. It turns out, 
however, that properties 1-5 are sufficient to prove that there is only 
one answer, which is the one found by Einstein in 1905. To see why 
this is, let’s work by a process of elimination. 


Figure e shows a transformation that might seem at first glance 
to be as good a candidate as any other, but it violates property 
3, that motion is relative, for the following reason. In observer 2’s 
frame of reference, some of the grid lines cross one another. This 
means that observers 1 and 2 disagree on whether or not certain 
events are the same. For instance, suppose that event A marks the 
arrival of an arrow at the bull’s-eye of a target, and event B is the 
location and time when the bull’s-eye is punctured. Events A and B 
occur at the same location and at the same time. If one observer says 
that A and B coincide, but another says that they don’t, we have 
a direct contradiction. Since the two frames of reference in figure 
e give contradictory results, one of them is right and one is wrong. 
This violates property 3, because all inertial frames of reference are 
supposed to be equally valid. To avoid problems like this, we clearly 
need to make sure that none of the grid lines ever cross one another. 


The next type of transformation we want to kill off is shown in 
figure f, in which the grid lines curve, but never cross one another. 
The trouble with this one is that it violates property 1, the unifor- 
mity of time and space. The transformation is unusually “twisty” 
at A, whereas at B it’s much more smooth. This can’t be correct, 
because the transformation is only supposed to depend on the rela- 
tive state of motion of the two frames of reference, and that given 
information doesn’t single out a special role for any particular point 
in spacetime. If, for example, we had one frame of reference rotating 
relative to the other, then there would be something special about 
the axis of rotation. But we’re only talking about inertial frames of 
reference here, as specified in property 3, so we can’t have rotation; 
each frame of reference has to be moving in a straight line at con- 
stant speed. For frames related in this way, there is nothing that 
could single out an event like A for special treatment compared to 
B, so transformation f violates property 1. 


The examples in figures e and f show that the transformation 
we’re looking for must be linear, meaning that it must transform 
lines into lines, and furthermore that it has to take parallel lines to 
parallel lines. Einstein wrote in his 1905 paper that “...on account 
of the property of homogeneity [property 1] which we ascribe to time 
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and space, the [transformation] must be linear.”! Applying this to 
our diagrams, the original gray rectangle, which is a special type 
of parallelogram containing right angles, must be transformed into 
another parallelogram. There are three types of transformations, 
figure g, that have this property. Case I is the Galilean transforma- 
tion of figure d on page 381, which we’ve already ruled out. 


g/ Three types of transformations that preserve parallelism. Their 
distinguishing feature is what they do to simultaneity, as shown by what 
happens to the left edge of the original rectangle. In |, the left edge 
remains vertical, so simultaneous events remain simultaneous. In Il, the 
left edge turns counterclockwise. In Ill, it turns clockwise. 


Case IT can also be discarded. Here every point on the grid ro- 
tates counterclockwise. What physical parameter would determine 
the amount of rotation? The only thing that could be relevant would 
be v, the relative velocity of the motion of the two frames of reference 
with respect to one another. But if the angle of rotation was pro- 
portional to v, then for large enough velocities the grid would have 
left and right reversed, and this would violate property 4, causality: 
one observer would say that event A caused a later event B, but 
another observer would say that B came first and caused A. 


The only remaining possibility is case II, which I’ve redrawn 
in figure h with a couple of changes. This is the one that Einstein 
predicted in 1905. The transformation is known as the Lorentz 
transformation, after Hendrik Lorentz (1853-1928), who partially 
anticipated Einstein’s work, without arriving at the correct inter- 
pretation. The distortion is a kind of smooshing and stretching, 
as suggested by the hands. Also, we’ve already seen in figures a-c 
on page 380 that we’re free to stretch or compress everything as 
much as we like in the horizontal and vertical directions, because 
this simply corresponds to choosing different units of measurement 
for time and distance. In figure h I’ve chosen units that give the 
whole drawing a convenient symmetry about a 45-degree diagonal 
line. Ordinarily it wouldn’t make sense to talk about a 45-degree 
angle on a graph whose axes had different units. But in relativity, 
the symmetric appearance of the transformation tells us that space 


” 


‘A. Einstein, “On the Electrodynamics of Moving Bodies,” Annalen der 


Physik 17 (1905), p. 891, tr. Saha and Bose, 1920 


h/In the units that are most 
convenient for relativity, the trans- 
formation has symmetry about a 
45-degree diagonal line. 
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i/ Interpretation of the Lorentz 


transformation. The slope in- 
dicated in the figure gives the 
relative velocity of the two frames 
of reference. Events A and B that 
were simultaneous in frame 1 
are not simultaneous in frame 2, 
where event A occurs to the right 
of the t = O line represented by 
the left edge of the grid, but event 
B occurs to its left. 


and time ought to be treated on the same footing, and measured in 
the same units. 


As in our discussion of the Galilean transformation, slopes are 
interpreted as velocities, and the slope of the near-horizontal lines 
in figure i is interpreted as the relative velocity of the two observers. 
The difference between the Galilean version and the relativistic one 
is that now there is smooshing happening from the other side as 
well. Lines that were vertical in the original grid, representing si- 
multaneous events, now slant over to the right. This tells us that, as 
required by property 5, different observers do not agree on whether 
events that occur in different places are simultaneous. The Hafele- 
Keating experiment tells us that this non-simultaneity effect is fairly 
small, even when the velocity is as big as that of a passenger jet, 
and this is what we would have anticipated by the correspondence 
principle. The way that this is expressed in the graph is that if we 
pick the time unit to be the second, then the distance unit turns out 
to be hundreds of thousands of miles. In these units, the velocity 
of a passenger jet is an extremely small number, so the slope v in 
figure i is extremely small, and the amount of distortion is tiny — 
it would be much too small to see on this scale. 


The only thing left to determine about the Lorentz transforma- 
tion is the size of the transformed parallelogram relative to the size 
of the original one. Although the drawing of the hands in figure h 
may suggest that the grid deforms like a framework made of rigid 
coat-hanger wire, that is not the case. If you look carefully at the 
figure, you’ll see that the edges of the smooshed parallelogram are 
actually a little longer than the edges of the original rectangle. In 
fact what stays the same is not lengths but areas, as proved in the 
caption to figure j. 


7.2.2 The + factor 


With a little algebra and geometry (homework problem 7, page 
402), one can use the equal-area property to show that the factor 
(Greek letter gamma) defined in figure k is given by the equation 


1 
1= a 


If you’ve had good training in physics, the first thing you probably 
think when you look at this equation is that it must be nonsense, 
because its units don’t make sense. How can we take something 
with units of velocity squared, and subtract it from a unitless 1? 
But remember that this is expressed in our special relativistic units, 
in which the same units are used for distance and time. In this 
system, velocities are always unitless. This sort of thing happens 
frequently in physics. For instance, before James Joule discovered 
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j/ Proof that Lorentz transformations don’t change area: We first subject a square to a transformation 
with velocity v, and this increases its area by a factor R(v), which we want to prove equals 1. We chop the 
resulting parallelogram up into little squares and finally apply a —v transformation; this changes each little 
square’s area by a factor R(—v), so the whole figure’s area is also scaled by R(—v). The final result is to restore 
the square to its original shape and area, so A(v)A(—v) = 1. But A(v) = A(—v) by property 2 of spacetime on 
page 381, which states that all directions in space have the same properties, so A(v) = 1. 


conservation of energy, nobody knew that heat and mechanical en- 
ergy were different forms of the same thing, so instead of measuring 
them both in units of joules as we would do now, they measured 
heat in one unit (such as calories) and mechanical energy in another 
(such as foot-pounds). In ordinary metric units, we just need an 
extra convension factor c, and the equation becomes 


Here’s why we care about y. Figure k defines it as the ratio of two 
times: the time between two events as expressed in one coordinate 
system, and the time between the same two events as measured in 
the other one. The interpretation is: 


Time dilation 

A clock runs fastest in the frame of reference of an observer 
who is at rest relative to the clock. An observer in motion 
relative to the clock at speed v perceives the clock as running k/ The y factor. 
more slowly by a factor of +. 
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|/ Apparatus used for the test 
of relativistic time dilation de- 
scribed in example 1. The promi- 
nent black and white blocks are 
large magnets surrounding a cir- 
cular pipe with a vacuum inside. 
(c) 1974 by CERN. 
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m/Muons accelerated to nearly 
c undergo radioactive decay 
much more slowly than they 
would according to an observer 
at rest with respect to the muons. 
The first two data-points (unfilled 
circles) were subject to large 
systematic errors. 
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Since the Lorentz transformation treats time and distance entirely 
symmetrically, we could just as well have defined y using the upright 
x axis in figure k, and we therefore have a similar interpretation in 
terms of space: 


Length contraction 

A meter-stick appears longest to an observer who is at rest 
relative to it. An observer moving relative to the meter-stick 
at v observes the stick to be shortened by a factor of +. 


self-check A 
What is y when v = 0? What does this mean? 


> Answer, p. 869 


Large time dilation example 1 
The time dilation effect in the Hafele-Keating experiment was very 
small. If we want to see a large time dilation effect, we can’t do 
it with something the size of the atomic clocks they used; the ki- 
netic energy would be greater than the total megatonnage of all 
the world’s nuclear arsenals. We can, however, accelerate sub- 
atomic particles to speeds at which y is large. For experimental 
particle physicists, relativity is something you do all day before 
heading home and stopping off at the store for milk. An early, low- 
precision experiment of this kind was performed by Rossi and Hall 
in 1941, using naturally occurring cosmic rays. Figure | shows a 
1974 experiment? of a similar type which verified the time dilation 
predicted by relativity to a precision of about one part per thou- 
sand. 


Particles called muons (named after the Greek letter u, “myoo”) 
were produced by an accelerator at CERN, near Geneva. A muon 


Bailey at al., Nucl. Phys. B150(1979) 1 
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is essentially a heavier version of the electron. Muons undergo 
radioactive decay, lasting an average of only 2.197 us before they 
evaporate into an electron and two neutrinos. The 1974 experi- 
ment was actually built in order to measure the magnetic proper- 
ties of muons, but it produced a high-precision test of time dilation 
as a byproduct. Because muons have the same electric charge 
as electrons, they can be trapped using magnetic fields. Muons 
were injected into the ring shown in figure I, circling around it un- 
til they underwent radioactive decay. At the speed at which these 
muons were traveling, they had y = 29.33, so on the average they 
lasted 29.33 times longer than the normal lifetime. In other words, 
they were like tiny alarm clocks that self-destructed at a randomly 
selected time. Figure m shows the number of radioactive decays 
counted, as a function of the time elapsed after a given stream of 
muons was injected into the storage ring. The two dashed lines 
show the rates of decay predicted with and without relativity. The 
relativistic line is the one that agrees with experiment. 


An example of length contraction example 2 
Figure n shows an artist’s rendering of the length contraction for 
the collision of two gold nuclei at relativistic speeds in the RHIC 
accelerator in Long Island, New York, which went on line in 2000. 
The gold nuclei would appear nearly spherical (or just slightly 
lengthened like an American football) in frames moving along with 
them, but in the laboratory’s frame, they both appear drastically 
foreshortened as they approach the point of collision. The later 
pictures show the nuclei merging to form a hot soup, in which 
experimenters hope to observe a new form of matter. 


n/Colliding nuclei show relativis- 
tic length contraction. 
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o/Example 3: In the garage’s frame of reference, the bus is moving, and can fit in the garage due to 
its length contraction. In the bus’s frame of reference, the garage is moving, and can’t hold the bus due to its 


length contraction. 


The garage paradox example 3 
One of the most famous of all the so-called relativity paradoxes 
has to do with our incorrect feeling that simultaneity is well de- 
fined. The idea is that one could take a schoolbus and drive it at 
relativistic speeds into a garage of ordinary size, in which it nor- 
mally would not fit. Because of the length contraction, the bus 
would supposedly fit in the garage. The driver, however, will per- 
ceive the garage as being contracted and thus even less able to 
contain the bus. 


The paradox is resolved when we recognize that the concept of 
fitting the bus in the garage “all at once” contains a hidden as- 
sumption, the assumption that it makes sense to ask whether the 
front and back of the bus can simultaneously be in the garage. 
Observers in different frames of reference moving at high rela- 
tive speeds do not necessarily agree on whether things happen 
simultaneously. As shown in figure 0, the person in the garage’s 
frame can shut the door at an instant B he perceives to be si- 
multaneous with the front bumper’s arrival A at the back wall of 
the garage, but the driver would not agree about the simultaneity 
of these two events, and would perceive the door as having shut 
long after she plowed through the back wall. 
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7.2.3. The universal speed c 


Let’s think a little more about the role of the 45-degree diagonal 
in the Lorentz transformation. Slopes on these graphs are inter- 
preted as velocities. This line has a slope of 1 in relativistic units, 
but that slope corresponds to c in ordinary metric units. We al- 
ready know that the relativistic distance unit must be extremely 
large compared to the relativistic time unit, so c must be extremely 
large. Now note what happens when we perform a Lorentz transfor- 
mation: this particular line gets stretched, but the new version of 
the line lies right on top of the old one, and its slope stays the same. 
In other words, if one observer says that something has a velocity 
equal to c, every other observer will agree on that velocity as well. 
(The same thing happens with —c.) 


Velocities don’t simply add and subtract. 


This is counterintuitive, since we expect velocities to add and 
subtract in relative motion. If a dog is running away from me at 5 
m/s relative to the sidewalk, and I run after it at 3 m/s, the dog’s 
velocity in my frame of reference is 2 m/s. According to everything 
we have learned about motion, the dog must have different speeds 
in the two frames: 5 m/s in the sidewalk’s frame and 2 m/s in 
mine. But velocities are measured by dividing a distance by a time, 
and both distance and time are distorted by relativistic effects, so 
we actually shouldn’t expect the ordinary arithmetic addition of 
velocities to hold in relativity; it’s an approximation that’s valid at 
velocities that are small compared to c. 


A universal speed limit 


For example, suppose Janet takes a trip in a spaceship, and 
accelerates until she is moving at 0.6c relative to the earth. She 
then launches a space probe in the forward direction at a speed 
relative to her ship of 0.6c. We might think that the probe was then 
moving at a velocity of 1.2c, but in fact the answer is still less than 
c (problem 1, page 401). This is an example of a more general fact 
about relativity, which is that c represents a universal speed limit. 
This is required by causality, as shown in figure p. 


Light travels at c. 


Now consider a beam of light. We’re used to talking casually 
about the “speed of light,” but what does that really mean? Motion 
is relative, so normally if we want to talk about a velocity, we have 
to specify what it’s measured relative to. A sound wave has a certain 
speed relative to the air, and a water wave has its own speed relative 
to the water. If we want to measure the speed of an ocean wave, for 
example, we should make sure to measure it in a frame of reference 
at rest relative to the water. But light isn’t a vibration of a physical 
medium; it can propagate through the near-perfect vacuum of outer 
space, as when rays of sunlight travel to earth. This seems like a 
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p/A proof that causality im- 
poses a universal speed limit. In 
the original frame of reference, 
represented by the square, event 
A happens a little before event B. 
In the new frame, shown by the 
parallelogram, A happens after 
t = 0, but B happens before t = 0; 
that is, B happens before A. The 
time ordering of the two events 
has been reversed. This can only 
happen because events A and B 
are very close together in time 
and fairly far apart in space. The 
line segment connecting A and 
B has a slope greater than 1, 
meaning that if we wanted to be 
present at both events, we would 
have to travel at a speed greater 
than c (which equals 1 in the 
units used on this graph). You will 
find that if you pick any two points 
for which the slope of the line 
segment connecting them is less 
than 1, you can never get them to 
straddle the new t¢ = 0 line in this 
funny, time-reversed way. Since 
different observers disagree on 
the time order of events like A 
and B, causality requires that 
information never travel from 
A to B or from B to A; if it did, 
then we would have time-travel 
paradoxes. The conclusion is that 
cis the maximum speed of cause 
and effect in relativity. 


Distortion of Space and Time 389 


paradox: light is supposed to have a specific speed, but there is no 
way to decide what frame of reference to measure it in. The way 
out of the paradox is that light must travel at a velocity equal to c. 
Since all observers agree on a velocity of c, regardless of their frame 
of reference, everything is consistent. 


The Michelson-Morley experiment 


The constancy of the speed of light had in fact already been 
observed when Einstein was an 8-year-old boy, but because nobody 
could figure out how to interpret it, the result was largely ignored. 

q/ The Michelson-Morley experi- In 1887 Michelson and Morley set up a clever apparatus to measure 
ment, shown in photographs, and 


: je any difference in the speed of light beams traveling east-west and 
drawings from the original 1887 


haber. 4s- sAsainipliied arene north-south. The motion of the earth around the sun at 110,000 
ing of the apparatus. A beam of km/hour (about 0.01% of the speed of light) is to our west during the 
light from the source, s, is par- day. Michelson and Morley believed that light was a vibration of a 
tially reflected and partially trans- mysterious medium called the ether, so they expected that the speed 
mitted by the half-silvered mirror of light would be a fixed value relative to the ether. As the earth 
hy. The two half-intensity parts of moved through the ether, they thought they would observe an effect 
the beam are reflected by the mir- on the velocity of light along an east-west line. For instance, if they 
rors at a and b, reunited, and ob- released a beam of light in a westward direction during the day, they 
sotved in tie: ileseope; tI ne expected that it would move away from them at less than the normal 


earth’s surface was supposed to wae 
be moving through the aes ‘hen speed because the earth was chasing it through the ether. They were 


the times taken by the two light surprised when they found that the expected 0.01% change in the 
waves to pass through the mov- speed of light did not occur. 

ing ether would be unequal, and 
the resulting time lag would be 
detectable by observing the inter- 
ference between the waves when a 
they were reunited. 2. In the real Se 
apparatus, the light beams were SER Y 
reflected multiple times. The ef- yk We 
fective length of each arm was “ “ey 
increased to 11 meters, which SSUSSe GT 
greatly improved its sensitivity to YORE 
the small expected difference in 
the speed of light. 3. In an 2 : See SoS 
earlier version of the experiment, ne : eae SQ 
they had run into problems with | eae eee 
its “extreme sensitiveness to vi- ae 
bration,’ which was “so great that , 
it was impossible to see the in- 1 2 
terference fringes except at brief 
intervals ...even at two o’clock 
in the morning.” They therefore 
mounted the whole thing on a 
massive stone floating in a pool of 
mercury, which also made it pos- 
sible to rotate it easily. 4. A photo 
of the apparatus. 
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Discussion Questions 


A A person in a spaceship moving at 99.99999999% of the speed 
of light relative to Earth shines a flashlight forward through dusty air, so 
the beam is visible. What does she see? What would it look like to an 
observer on Earth? 


B A question that students often struggle with is whether time and 
space can really be distorted, or whether it just seems that way. Compare 
with optical illusions or magic tricks. How could you verify, for instance, 
that the lines in the figure are actually parallel? Are relativistic effects the 
same, or not? 


C  Onaspaceship moving at relativistic speeds, would a lecture seem 
even longer and more boring than normal? 


D Mechanical clocks can be affected by motion. For example, it was 
a significant technological achievement to build a clock that could sail 
aboard a ship and still Keep accurate time, allowing longitude to be deter- 
mined. How is this similar to or different from relativistic time dilation? 


E Figure n from page 387, depicting the collision of two nuclei at the 
RHIC accelerator, is reproduced below. What would the shapes of the two 
nuclei look like to a microscopic observer riding on the left-hand nucleus? 
To an observer riding on the right-hand one? Can they agree on what is 
happening? If not, why not — after all, shouldn’t they see the same thing 
if they both compare the two nuclei side-by-side at the same instant in 
time? 


r/Discussion question E: colliding nuclei show relativistic length 
contraction. 


= 


Discussion question B 
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F If you stick a piece of foam rubber out the window of your car while 
driving down the freeway, the wind may compress it a little. Does it make 
sense to interpret the relativistic length contraction as a type of strain 
that pushes an object’s atoms together like this? How does this relate to 
discussion question E? 


7.3 Dynamics 


So far we have said nothing about how to predict motion in relativ- 
ity. Do Newton’s laws still work? Do conservation laws still apply? 
The answer is yes, but many of the definitions need to be modified, 
and certain entirely new phenomena occur, such as the conversion 
of mass to energy and energy to mass, as described by the famous 


equation E = mc?. 


7.3.1 Momentum 


Consider the following scheme for traveling faster than the speed 
of light. The basic idea can be demonstrated by dropping a ping- 
pong ball and a baseball stacked on top of each other like a snowman. 
They separate slightly in mid-air, and the baseball therefore has time 
to hit the floor and rebound before it collides with the ping-pong 
ball, which is still on the way down. The result is a surprise if you 
haven’t seen it before: the ping-pong ball flies off at high speed and 
hits the ceiling! A similar fact is known to people who investigate 
the scenes of accidents involving pedestrians. If a car moving at 
90 kilometers per hour hits a pedestrian, the pedestrian flies off at 
nearly double that speed, 180 kilometers per hour. Now suppose 
the car was moving at 90 percent of the speed of light. Would the 
pedestrian fly off at 180% of c? 


To see why not, we have to back up a little and think about 
where this speed-doubling result comes from. For any collision, there 
is a special frame of reference, the center-of-mass frame, in which 
the two colliding objects approach each other, collide, and rebound 
with their velocities reversed. In the center-of-mass frame, the total 
momentum of the objects is zero both before and after the collision. 
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Figure a/1 shows such a frame of reference for objects of very 
unequal mass. Before the collision, the large ball is moving relatively 
slowly toward the top of the page, but because of its greater mass, 
its momentum cancels the momentum of the smaller ball, which is 
moving rapidly in the opposite direction. The total momentum is 
zero. After the collision, the two balls just reverse their directions of 
motion. We know that this is the right result for the outcome of the 
collision because it conserves both momentum and kinetic energy, 
and everything not forbidden is mandatory, i.e., in any experiment, 
there is only one possible outcome, which is the one that obeys all 
the conservation laws. 


self-check B 
How do we know that momentum and kinetic energy are conserved in 
figure a/1? > Answer, p. 869 


Let’s make up some numbers as an example. Say the small ball 
has a mass of 1 kg, the big one 8 kg. In frame 1, let’s make the 
velocities as follows: 


before the collision after the collision 
° | -0.8 0.8 
O | 0.1 -0.1 


Figure a/2 shows the same collision in a frame of reference where 
the small ball was initially at rest. To find all the velocities in this 
frame, we just add 0.8 to all the ones in the previous table. 


before the collision after the collision 


a / An unequal collision, viewed in 
the center-of-mass frame, 1, and 
in the frame where the small ball 
is initially at rest, 2. The motion 
is shown as it would appear on 
the film of an old-fashioned movie 
camera, with an equal amount of 
time separating each frame from 
the next. Film 1 was made by 
a camera that tracked the center 
of mass, film 2 by one that was 
initially tracking the small ball, 
and kept on moving at the same 
speed after the collision. 


° 10 
O | 0.9 


1.6 
0.7 
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b/An 8-kg ball moving at 83% 
of the speed of light hits a 1-kg 
ball. The balls appear foreshort- 
ened due to the relativistic distor- 
tion of space. 


In this frame, as expected, the small ball flies off with a velocity, 
1.6, that is almost twice the initial velocity of the big ball, 0.9. 


If all those velocities were in meters per second, then that’s ex- 
actly what happened. But what if all these velocities were in units 
of the speed of light? Now it’s no longer a good approximation just 
to add velocities. We need to combine them according to the rel- 
ativistic rules. For instance, the technique used in problem 1 can 
be used to show that combining a velocity of 0.8 times the speed 
of light with another velocity of 0.8 results in 0.98, not 1.6. The 
results are very different: 


before the collision after the collision 
° 10 0.98 
O | 0.83 0.76 


We can interpret this as follows. Figure a/1 is one in which the 
big ball is moving fairly slowly. This is very nearly the way the 
scene would be seen by an ant standing on the big ball. According 
to an observer in frame b, however, both balls are moving at nearly 
the speed of light after the collision. Because of this, the balls 
appear foreshortened, but the distance between the two balls is also 
shortened. To this observer, it seems that the small ball isn’t pulling 
away from the big ball very fast. 


Now here’s what’s interesting about all this. The outcome shown 
in figure a/2 was supposed to be the only one possible, the only 
one that satisfied both conservation of energy and conservation of 
momentum. So how can the different result shown in figure b be 
possible? The answer is that relativistically, momentum must not 
equal mv. The old, familiar definition is only an approximation 
that’s valid at low speeds. If we observe the behavior of the small 
ball in figure b, it looks as though it somehow had some extra inertia. 
It’s as though a football player tried to knock another player down 
without realizing that the other guy had a three-hundred-pound bag 
full of lead shot hidden under his uniform — he just doesn’t seem 
to react to the collision as much as he should. This extra inertia is 
described by redefining momentum as 


p=myv 
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At very low velocities, y is close to 1, and the result is very nearly 
mv, as demanded by the correspondence principle. But at very high 
velocities, y gets very big — the small ball in figure b has a ¥y of 
5.0, and therefore has five times more inertia than we would expect 
nonrelativistically. 


This also explains the answer to another paradox often posed 
by beginners at relativity. Suppose you keep on applying a steady 
force to an object that’s already moving at 0.9999c. Why doesn’t 
it just keep on speeding up past c? The answer is that force is the 
rate of change of momentum. At 0.9999c, an object already has a ~ 
of 71, and therefore has already sucked up 71 times the momentum 
you’d expect at that speed. As its velocity gets closer and closer to 
c, its y approaches infinity. To move at c, it would need an infinite 
momentum, which could only be caused by an infinite force. 


7.3.2. Equivalence of mass and energy 


Now we’re ready to see why mass and energy must be equivalent 
as claimed in the famous E = mc?. So far we’ve only considered 
collisions in which none of the kinetic energy is converted into any 
other form of energy, such as heat or sound. Let’s consider what 
happens if a blob of putty moving at velocity v hits another blob 
that is initially at rest, sticking to it. The nonrelativistic result is 
that to obey conservation of momentum the two blobs must fly off 
together at v/2. Half of the initial kinetic energy has been converted 
to heat. 


Relativistically, however, an interesting thing happens. A hot 
object has more momentum than a cold object! This is because 
the relativistically correct expression for momentum is myv, and 
the more rapidly moving atoms in the hot object have higher values 
of y. In our collision, the final combined blob must therefore be 
moving a little more slowly than the expected v/2, since otherwise 
the final momentum would have been a little greater than the initial 
momentum. To an observer who believes in conservation of momen- 
tum and knows only about the overall motion of the objects and not 
about their heat content, the low velocity after the collision would 
seem to be the result of a magical change in the mass, as if the mass 
of two combined, hot blobs of putty was more than the sum of their 
individual masses. 


Now we know that the masses of all the atoms in the blobs must 
be the same as they always were. The change is due to the change in 
y with heating, not to a change in mass. The heat energy, however, 
seems to be acting as if it was equivalent to some extra mass. 


3A double-mass object moving at half the speed does not have the same 
kinetic energy. Kinetic energy depends on the square of the velocity, so cutting 
the velocity in half reduces the energy by a factor of 1/4, which, multiplied by 
the doubled mass, makes 1/2 the original energy. 
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But this whole argument was based on the fact that heat is a 
form of kinetic energy at the atomic level. Would E = mc? apply to 
other forms of energy as well? Suppose a rocket ship contains some 
electrical energy stored in a battery. If we believed that E = mc? 
applied to forms of kinetic energy but not to electrical energy, then 
we would have to believe that the pilot of the rocket could slow 
the ship down by using the battery to run a heater! This would 
not only be strange, but it would violate the principle of relativity, 
because the result of the experiment would be different depending 
on whether the ship was at rest or not. The only logical conclusion is 
that all forms of energy are equivalent to mass. Running the heater 
then has no effect on the motion of the ship, because the total 
energy in the ship was unchanged; one form of energy (electrical) 
was simply converted to another (heat). 


The equation E = mc? tells us how much energy is equivalent 
to how much mass: the conversion factor is the square of the speed 
of light, c. Since c a big number, you get a really really big number 
when you multiply it by itself to get c?. This means that even a small 
amount of mass is equivalent to a very large amount of energy. 


c/ Example 4, page 397. 
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Gravity bending light example 4 
Gravity is a universal attraction between things that have mass, 
and since the energy in a beam of light is equivalent to a some 
very small amount of mass, we expect that light will be affected 
by gravity, although the effect should be very small. The first im- 
portant experimental confirmation of relativity came in 1919 when 
stars next to the sun during a solar eclipse were observed to have 
shifted a little from their ordinary position. (If there was no eclipse, 
the glare of the sun would prevent the stars from being observed.) 
Starlight had been deflected by the sun’s gravity. Figure c is a 
photographic negative, so the circle that appears bright is actu- 
ally the dark face of the moon, and the dark area is really the 
bright corona of the sun. The stars, marked by lines above and 
below then, appeared at positions slightly different than their nor- 
mal ones. 


Black holes example 5 
A star with sufficiently strong gravity can prevent light from leav- 
ing. Quite a few black holes have been detected via their gravita- 
tional forces on neighboring stars or clouds of gas and dust. 


You’ve learned about conservation of mass and conservation of 
energy, but now we see that they’re not even separate conservation 
laws. As a consequence of the theory of relativity, mass and en- 
ergy are equivalent, and are not separately conserved — one can 
be converted into the other. Imagine that a magician waves his 
wand, and changes a bowl of dirt into a bowl of lettuce. You’d be 
impressed, because you were expecting that both dirt and lettuce 
would be conserved quantities. Neither one can be made to vanish, 
or to appear out of thin air. However, there are processes that can 
change one into the other. A farmer changes dirt into lettuce, and 
a compost heap changes lettuce into dirt. At the most fundamen- 
tal level, lettuce and dirt aren’t really different things at all; they’re 
just collections of the same kinds of atoms — carbon, hydrogen, and 
so on. Because mass and energy are like two different sides of the 
same coin, we may speak of mass-energy, a single conserved quantity, 
found by adding up all the mass and energy, with the appropriate 
conversion factor: E + mc?. 


A rusting nail example 6 
> An iron nail is left in a cup of water until it turns entirely to rust. 
The energy released is about 0.5 MJ. In theory, would a suffi- 
ciently precise scale register a change in mass? If so, how much? 


> The energy will appear as heat, which will be lost to the envi- 
ronment. The total mass-energy of the cup, water, and iron will 
indeed be lessened by 0.5 Mu. (If it had been perfectly insulated, 
there would have been no change, since the heat energy would 
have been trapped in the cup.) The speed of light is c = 3 x 108 


LIGHTS ALL ASKEW 
IN THE HEAVENS 


Men of Science More or Less 
Agog Over Results of Eclipse 
Observations. 


EINSTEIN THEORY TRIUMPHS 


Stars Not Where They Seemed 
or Were Calculated to be, 
but Nobody Need Worry. 


A BOOK FOR 12 WISE MEN 


No More in All the Worid Could 
Comprehend It, Said Einstein When 
His Daring Publishers Accepted It. 


d/A New York Times head- 
line from November 10, 1919, 
describing the observations 
discussed in example 4. 
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e/ Top: A PET scanner. Middle: 
Each positron annihilates with an 
electron, producing two gamma- 
rays that fly off back-to-back. 
When two gamma rays are ob- 
served simultaneously in the ring 
of detectors, they are assumed to 
come from the same annihilation 
event, and the point at which they 
were emitted must lie on the line 
connecting the two detectors. 
Bottom: A scan of a person’s 
torso. The body has concentrated 
the radioactive tracer around the 
stomach, indicating an abnormal 
medical condition. 


meters per second, so converting to mass units, we have 
M= C2 
_ 0.5 x 106 J 
(3 x 108 m/s)* 
= 6 x 107!” kilograms 


The change in mass is too small to measure with any practical 
technique. This is because the square of the speed of light is 
such a large number. 


‘Electron-positron annihilation example 7 
Natural radioactivity in the earth produces positrons, which are 
like electrons but have the opposite charge. A form of antimat- 
ter, positrons annihilate with electrons to produce gamma rays, a 
form of high-frequency light. Such a process would have been 
considered impossible before Einstein, because conservation of 
mass and energy were believed to be separate principles, and 
this process eliminates 100% of the original mass. The amount 
of energy produced by annihilating 1 kg of matter with 1 kg of 
antimatter is 


E = mc? 


2 
= (2 kg) (3.0 x 108 m/s) 
=2x107J 


which is on the same order of magnitude as a day’s energy con- 
sumption for the entire world’s population! 


Positron annihilation forms the basis for the medical imaging tech- 
nique called a PET (positron emission tomography) scan, in which 
a positron-emitting chemical is injected into the patient and map- 
ped by the emission of gamma rays from the parts of the body 
where it accumulates. 


One commonly hears some misinterpretations of E = mc’, one 


being that the equation tells us how much kinetic energy an object 
would have if it was moving at the speed of light. This wouldn’t 
make much sense, both because the equation for kinetic energy has 
1/2 in it, KE = (1/2)mv”, and because a material object can’t be 
made to move at the speed of light. However, this naturally leads 
to the question of just how much mass-energy a moving object has. 
We know that when the object is at rest, it has no kinetic energy, so 
its mass-energy is simply equal to the energy-equivalent of its mass, 


me?, 


E = mc? when v = 0 ; 


where the symbol E stands for mass-energy. (You can write this 
symbol yourself by writing an E, and then adding an extra line to 
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it. Have fun!) The point of using the new symbol is simply to 
remind ourselves that we’re talking about relativity, so an object at 
rest has E = mc’, not E = 0 as we’d assume in classical physics. 


Suppose we start accelerating the object with a constant force. 
A constant force means a constant rate of transfer of momentum, 
but p = myv approaches infinity as v approaches c, so the object 
will only get closer and closer to the speed of light, but never reach 
it. Now what about the work being done by the force? The force 
keeps doing work and doing work, which means that we keep on 
using up energy. Mass-energy is conserved, so the energy being 
expended must equal the increase in the object’s mass-energy. We 
can continue this process for as long as we like, and the amount of 
mass-energy will increase without limit. We therefore conclude that 
an object’s mass-energy approaches infinity as its speed approaches 
the speed of light, 


E+ oo when v > c¢ 


Now that we have some idea what to expect, what is the actual 
equation for the mass-energy? As proved in my book Simple Nature, 
it is 


E = myc? 
self-check C 
Verify that this equation has the two properties we wanted. > 
Answer, p. 869 
KE compared to mc? at low speeds example 8 


> An object is moving at ordinary nonrelativistic speeds. Compare 
its kinetic energy to the energy mc? it has purely because of its 
mass. 


> The speed of light is a very big number, so mc? is a huge num- 
ber of joules. The object has a gigantic amount of energy be- 
cause of its mass, and only a relatively small amount of additional 
kinetic energy because of its motion. 


Another way of seeing this is that at low speeds, y is only a tiny 
bit greater than 1, so E is only a tiny bit greater than mc?. 


The correspondence principle for mass-energy example 9 
> Show that the equation E = myc? obeys the correspondence 
principle. 


> As we accelerate an object from rest, its mass-energy becomes 
greater than its resting value. Classically, we interpret this excess 
mass-energy as the object’s kinetic energy, 


KE = E(v) — E(v = 0) 
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Dynamics 


399 


400 


Expressing y as (1 — v2/o2)-1/? and making use of the approx- 
imation (1 + €)? = 1+ pe for small e, we have y = 1 + v2/2c?, 
SO 


Ths 
=m 
a 


which is the classical expression. As demanded by the corre- 
spondence principle, relativity agrees with classical physics at 
speeds that are small compared to the speed of light. 
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Problems 
The symbols Vv, “, etc. are explained on page 405. 


1 The figure illustrates a Lorentz transformation using the 
conventions employed in section 7.2. For simplicity, the transfor- 
mation chosen is one that lengthens one diagonal by a factor of 2. 
Since Lorentz transformations preserve area, the other diagonal is 
shortened by a factor of 2. Let the original frame of reference, de- 
picted with the square, be A, and the new one B. (a) By measuring 
with a ruler on the figure, show that the velocity of frame B rela- 
tive to frame A is 0.6c. (b) Print out a copy of the page. With a 
ruler, draw a third parallelogram that represents a second succes- 
sive Lorentz transformation, one that lengthens the long diagonal 
by another factor of 2. Call this third frame C. Use measurements 
with a ruler to determine frame C’s velocity relative to frame A. 
Does it equal double the velocity found in part a? Explain why it 
should be expected to turn out the way it does. Vv of 
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2 Astronauts in three different spaceships are communicating 
with each other. Those aboard ships A and B agree on the rate at 
which time is passing, but they disagree with the ones on ship C. 
(a) Alice is aboard ship A. How does she describe the motion of her 
own ship, in its frame of reference? 

(b) Describe the motion of the other two ships according to Alice. 
(c) Give the description according to Betty, whose frame of reference 
is ship B. 

(d) Do the same for Cathy, aboard ship C. ie 


3 What happens in the equation for 7 when you put in a negative 
number for v? Explain what this means physically, and why it makes 
sense. z 


4 The Voyager 1 space probe, launched in 1977, is moving faster 
relative to the earth than any other human-made object, at 17,000 
meters per second. 

(a) Calculate the probe’s 4. 

(b) Over the course of one year on earth, slightly less than one year 
passes on the probe. How much less? (There are 31 million seconds 
in a year.) Vv 


5 In example 1 on page 386, I remarked that accelerating a 
macroscopic (i.e., not microscopic) object to close to the speed of 
light would require an unreasonable amount of energy. Suppose that 
the starship Enterprise from Star Trek has a mass of 8.0 x 10’ kg, 
about the same as the Queen Elizabeth 2. Compute the kinetic 
energy it would have to have if it was moving at half the speed of 
light. Compare with the total energy content of the world’s nuclear 
arsenals, which is about 107! J. vo 


6 The earth is orbiting the sun, and therefore is contracted 
relativistically in the direction of its motion. Compute the amount 
by which its diameter shrinks in this direction. v 


~ 


7  Inthis homework problem, you'll fill in the steps of the algebra 
required in order to find the equation for y on page 384. To keep the 
algebra simple, let the time t in figure k equal 1, as suggested in the 
figure accompanying this homework problem. The original square 
then has an area of 1, and the transformed parallelogram must also 
have an area of 1. (a) Prove that point P is at x = v7, so that its 
(t, x) coordinates are (y, vy). (b) Find the (¢, x) coordinates of point 
Q. (c) Find the length of the short diagonal connecting P and Q. 
(d) Average the coordinates of P and Q to find the coordinates of 
the midpoint C of the parallelogram, and then find distance OC. (e) 
Find the area of the parallelogram by computing twice the area of 
triangle PQO. [Hint: You can take PQ to be the base of the triangle.] 
(f) Set this area equal to 1 and solve for y to prove 7 = 1/V1— v?. 
of 


~_ 


Problem 7. 
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8 (a) A free neutron (as opposed to a neutron bound into an 
atomic nucleus) is unstable, and undergoes beta decay (which you 
may want to review). The masses of the particles involved are as 
follows: 


neutron 1.67495 x 107?” ke 
proton 1.67265 x 10-2” kg 
electron 0.00091 x 10-27 kg 


antineutrino < 107% kg 


Find the energy released in the decay of a free neutron. Vv 
(b) Neutrons and protons make up essentially all of the mass of the 
ordinary matter around us. We observe that the universe around us 
has no free neutrons, but lots of free protons (the nuclei of hydrogen, 
which is the element that 90% of the universe is made of). We find 
neutrons only inside nuclei along with other neutrons and protons, 
not on their own. 


If there are processes that can convert neutrons into protons, 
we might imagine that there could also be proton-to-neutron con- 
versions, and indeed such a process does occur sometimes in nuclei 
that contain both neutrons and protons: a proton can decay into a 
neutron, a positron, and a neutrino. A positron is a particle with 
the same properties as an electron, except that its electrical charge 
is positive (see chapter 7). A neutrino, like an antineutrino, has 
negligible mass. 


Although such a process can occur within a nucleus, explain why 
it cannot happen to a free proton. (If it could, hydrogen would be 


radioactive, and you wouldn’t exist!) E 
9 (a) Find a relativistic equation for the velocity of an object 
in terms of its mass and momentum (eliminating 7). v 


(b) Show that your result is approximately the same as the classical 
value, p/m, at low velocities. 
(c) Show that very large momenta result in speeds close to the speed 


of light. 7 
10 (a) Show that for v = (3/5)c, 7 comes out to be a simple 
fraction. 

(b) Find another value of v for which 7 is a simple fraction. E 


11 An object moving at a speed very close to the speed of light 
is referred to as ultrarelativistic. Ordinarily (luckily) the only ul- 
trarelativistic objects in our universe are subatomic particles, such 
as cosmic rays or particles that have been accelerated in a particle 
accelerator. 

(a) What kind of number is y for an ultrarelativistic particle? 

(b) Repeat example 8 on page 399, but instead of very low, nonrel- 
ativistic speeds, consider ultrarelativistic speeds. 

(c) Find an equation for the ratio E/p. The speed may be relativis- 
tic, but don’t assume that it’s ultrarelativistic. Vv 
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(d) Simplify your answer to part c for the case where the speed is 
ultrarelativistic. Vv 
(e) We can think of a beam of light as an ultrarelativistic object — 
it certainly moves at a speed that’s sufficiently close to the speed 
of light! Suppose you turn on a one-watt flashlight, leave it on for 
one second, and then turn it off. Compute the momentum of the 


recoiling flashlight, in units of kg-m/s. v 
(f) Discuss how your answer in part e relates to the correspondence 
principle. x 


12 As discussed in chapter 6, the speed at which a disturbance 
travels along a string under tension is given by v = \/T/j, where pu 
is the mass per unit length, and T is the tension. 

(a) Suppose a string has a density p, and a cross-sectional area A. 
Find an expression for the maximum tension that could possibly 
exist in the string without producing v > c, which is impossible 
according to relativity. Express your answer in terms of p, A, and 
c. The interpretation is that relativity puts a limit on how strong 
any material can be. v 
(b) Every substance has a tensile strength, defined as the force 
per unit area required to break it by pulling it apart. The ten- 
sile strength is measured in units of N/m?, which is the same as the 
pascal (Pa), the mks unit of pressure. Make a numerical estimate 
of the maximum tensile strength allowed by relativity in the case 
where the rope is made out of ordinary matter, with a density on 
the same order of magnitude as that of water. (For comparison, 
kevlar has a tensile strength of about 4 x 10° Pa, and there is spec- 
ulation that fibers made from carbon nanotubes could have values 
as high as 6 x 10! Pa.) Vv 
(c) A black hole is a star that has collapsed and become very dense, 
so that its gravity is too strong for anything ever to escape from it. 
For instance, the escape velocity from a black hole is greater than 
c, so a projectile can’t be shot out of it. Many people, when they 
hear this description of a black hole in terms of an escape velocity 
greater than c, wonder why it still wouldn’t be possible to extract 
an object from a black hole by other means than launching it out 
as a projectile. For example, suppose we lower an astronaut into a 
black hole on a rope, and then pull him back out again. Why might 
this not work? 


13 (a) A charged particle is surrounded by a uniform electric 
field. Starting from rest, it is accelerated by the field to speed v after 
traveling a distance d. Now it is allowed to continue for a further 
distance 3d, for a total displacement from the start of 4d. What 
speed will it reach, assuming classical physics? 

(b) Find the relativistic result for the case of v = c/2. - 


14 In the equation for the relativistic addition of velocities u 
and v, consider the limit in which wu approaches 1, but v simulta- 
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neously approaches —1. Give both a physical and a mathematical 
interpretation. g 


15 Expand the equation K = m(y-— 1) in a Taylor series, and 
find the first two nonvaninishing terms. Show that the first term is 
the classical expression for kinetic energy. & 


16 Expand the relativistic equation for momentum in a Taylor 
series, and find the first two nonvaninishing terms. Show that the 
first term is the classical expression. & 


Key to symbols: 
easy . typical challenging Mdifficult © very difficult 
Vv An answer check is available at www.lightandmatter.com. 


Problems 


405 


406 


Exercises 
Exercise 7A: The Michelson-Morley Experiment 


a ee 
— | 
— — 
IK 


P= Ile tlc 


A 
laboratory's x,t frame of reference sun's x',t' frame (lab moving to the right) 


In this exercise you will analyze the Michelson-Morley experiment, and find what the results 
should have been according to Galilean relativity and Einstein’s theory of relativity. A beam of 
light coming from the west (not shown) comes to the half-silvered mirror A. Half the light goes 
through to the east, is reflected by mirror C, and comes back to A. The other half is reflected 
north by A, is reflected by B, and also comes back to A. When the beams reunite at A, part 
of each ends up going south, and these parts interfere with one another. If the time taken 
for a round trip differs by, for example, half the period of the wave, there will be destructive 
interference. 


The point of the experiment was to search for a difference in the experimental results between 
the daytime, when the laboratory was moving west relative to the sun, and the nighttime, when 
the laboratory was moving east relative to the sun. Galilean relativity and Einstein’s theory 
of relativity make different predictions about the results. According to Galilean relativity, the 
speed of light cannot be the same in all reference frames, so it is assumed that there is one special 
reference frame, perhaps the sun’s, in which light travels at the same speed in all directions; 
in other frames, Galilean relativity predicts that the speed of light will be different in different 
directions, e.g., slower if the observer is chasing a beam of light. There are four different ways 
to analyze the experiment: 


e Laboratory’s frame of reference, Galilean relativity. This is not a useful way to analyze 
the experiment, since one does not know how fast light will travel in various directions. 


e Sun’s frame of reference, Galilean relativity. We assume that in this special frame of 
reference, the speed of light is the same in all directions: we call this speed c. In this 
frame, the laboratory moves with velocity v, and mirrors A, B, and C move while the light 
beam is in flight. 


Laboratory’s frame of reference, Einstein’s theory of relativity. The analysis is extremely 
simple. Let the length of each arm be L. Then the time required to get from A to either 
mirror is L/c, so each beam’s round-trip time is 2L/c. 


e Sun’s frame of reference, Einstein’s theory of relativity. We analyze this case by starting 
with the laboratory’s frame of reference and then transforming to the sun’s frame. 
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Groups 1-4 work in the sun’s frame of reference according to Galilean relativity. 


Group 1 finds time AC. Group 2 finds time CA. Group 3 finds time AB. Group 4 finds time 
BA. 


Groups 5 and 6 transform the lab-frame results into the sun’s frame according to Einstein’s 
theory. 


Group 5 transforms the x and t when ray ACA gets back to A into the sun’s frame of reference, 
and group 6 does the same for ray ABA. 


Discussion: 


Michelson and Morley found no change in the interference of the waves between day and night. 
Which version of relativity is consistent with their results? 


What does each theory predict if v approaches c? 
What if the arms are not exactly equal in length? 


Does it matter if the “special” frame is some frame other than the sun’s? 
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Exercise 7B: Sports in Slowlightland 


In Slowlightland, the speed of light is 20 mi/hr + 32 km/hr ~ 9 m/s. Think of an example of 
how relativistic effects would work in sports. Things can get very complex very quickly, so try 
to think of a simple example that focuses on just one of the following effects: 


- relativistic momentum 

- relativistic kinetic energy 

- relativistic addition of velocities 

- time dilation and length contraction 

- Doppler shifts of light 

- equivalence of mass and energy 

- time it takes for light to get to an athlete’s eye 
- deflection of light rays by gravity 
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Chapter 8 
Atoms and Electromagnetism 


8.1 The Electric Glue 


Where the telescope ends, the microscope begins. Which of the two 
has the grander view? Victor Hugo 


His father died during his mother’s pregnancy. Rejected by her 
as a boy, he was packed off to boarding school when she remarried. 
He himself never married, but in middle age he formed an intense 
relationship with a much younger man, a relationship that he ter- 
minated when he underwent a psychotic break. Following his early 
scientific successes, he spent the rest of his professional life mostly 
in frustration over his inability to unlock the secrets of alchemy. 


The man being described is Isaac Newton, but not the triumphant 
Newton of the standard textbook hagiography. Why dwell on the 
sad side of his life? To the modern science educator, Newton’s life- 
long obsession with alchemy may seem an embarrassment, a distrac- 
tion from his main achievement, the creation the modern science of 
mechanics. To Newton, however, his alchemical researches were nat- 
urally related to his investigations of force and motion. What was 
radical about Newton’s analysis of motion was its universality: it 
succeeded in describing both the heavens and the earth with the 
same equations, whereas previously it had been assumed that the 
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sun, moon, stars, and planets were fundamentally different from 
earthly objects. But Newton realized that if science was to describe 
all of nature in a unified way, it was not enough to unite the human 
scale with the scale of the universe: he would not be satisfied until 
he fit the microscopic universe into the picture as well. 


It should not surprise us that Newton failed. Although he was a 
firm believer in the existence of atoms, there was no more experimen- 
tal evidence for their existence than there had been when the ancient 
Greeks first posited them on purely philosophical grounds. Alchemy 
labored under a tradition of secrecy and mysticism. Newton had 
already almost single-handedly transformed the fuzzyheaded field 
of “natural philosophy” into something we would recognize as the 
modern science of physics, and it would be unjust to criticize him 
for failing to change alchemy into modern chemistry as well. The 
time was not ripe. The microscope was a new invention, and it was 
cutting-edge science when Newton’s contemporary Hooke discovered 
that living things were made out of cells. 


8.1.1 The quest for the atomic force 


Newton was not the first of the age of reason. He was the last 
of the magicians. John Maynard Keynes 


Newton’s quest 


Nevertheless it will be instructive to pick up Newton’s train of 
thought and see where it leads us with the benefit of modern hind- 
sight. In uniting the human and cosmic scales of existence, he had 
reimagined both as stages on which the actors were objects (trees 
and houses, planets and stars) that interacted through attractions 
and repulsions. He was already convinced that the objects inhab- 
iting the microworld were atoms, so it remained only to determine 
what kinds of forces they exerted on each other. 


His next insight was no less brilliant for his inability to bring it to 
fruition. He realized that the many human-scale forces — friction, 
sticky forces, the normal forces that keep objects from occupying 
the same space, and so on — must all simply be expressions of a 
more fundamental force acting between atoms. Tape sticks to paper 
because the atoms in the tape attract the atoms in the paper. My 
house doesn’t fall to the center of the earth because its atoms repel 
the atoms of the dirt under it. 


Here he got stuck. It was tempting to think that the atomic force 
was a form of gravity, which he knew to be universal, fundamental, 
and mathematically simple. Gravity, however, is always attractive, 
so how could he use it to explain the existence of both attractive 
and repulsive atomic forces? The gravitational force between ob- 
jects of ordinary size is also extremely small, which is why we never 
notice cars and houses attracting us gravitationally. It would be 
hard to understand how gravity could be responsible for anything 
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as vigorous as the beating of a heart or the explosion of gunpowder. 
Newton went on to write a million words of alchemical notes filled 
with speculation about some other force, perhaps a “divine force” or 
“vegetative force” that would for example be carried by the sperm 
to the egg. 


Luckily, we now know enough to investigate a different suspect 
as a candidate for the atomic force: electricity. Electric forces are 
often observed between objects that have been prepared by rubbing 
(or other surface interactions), for instance when clothes rub against 
each other in the dryer. A useful example is shown in figure a/1: 
stick two pieces of tape on a tabletop, and then put two more pieces 
on top of them. Lift each pair from the table, and then separate 
them. The two top pieces will then repel each other, a/2, as will 
the two bottom pieces. A bottom piece will attract a top piece, 
however, a/3. Electrical forces like these are similar in certain ways 
to gravity, the other force that we already know to be fundamental: 


e Electrical forces are universal. Although some substances, 
such as fur, rubber, and plastic, respond more strongly to 
electrical preparation than others, all matter participates in 
electrical forces to some degree. There is no such thing as a 
“nonelectric” substance. Matter is both inherently gravita- 
tional and inherently electrical. 


e Experiments show that the electrical force, like the gravita- 
tional force, is an inverse square force. That is, the electrical 
force between two spheres is proportional to 1/r?, where r is 
the center-to-center distance between them. 


Furthermore, electrical forces make more sense than gravity as 
candidates for the fundamental force between atoms, because we 
have observed that they can be either attractive or repulsive. 


8.1.2 Charge, electricity and magnetism 
Charge 


“Charge” is the technical term used to indicate that an object 
has been prepared so as to participate in electrical forces. This is 
to be distinguished from the common usage, in which the term is 
used indiscriminately for anything electrical. For example, although 
we speak colloquially of “charging” a battery, you may easily verify 
that a battery has no charge in the technical sense, e.g., it does not 
exert any electrical force on a piece of tape that has been prepared 
as described in the previous section. 
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a/Four pieces of tape are 
prepared, 1, as described in the 
text. Depending on which com- 
bination is tested, the interaction 
can be either repulsive, 2, or 
attractive, 3. 
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Two types of charge 


We can easily collect reams of data on electrical forces between 
different substances that have been charged in different ways. We 
find for example that cat fur prepared by rubbing against rabbit 
fur will attract glass that has been rubbed on silk. How can we 
make any sense of all this information? A vast simplification is 
achieved by noting that there are really only two types of charge. 
Suppose we pick cat fur rubbed on rabbit fur as a representative of 
type A, and glass rubbed on silk for type B. We will now find that 
there is no “type C.” Any object electrified by any method is either 
A-like, attracting things A attracts and repelling those it repels, or 
B-like, displaying the same attractions and repulsions as B. The two 
types, A and B, always display opposite interactions. If A displays 
an attraction with some charged object, then B is guaranteed to 
undergo repulsion with it, and vice-versa. 


The indexcoulomb (unit)coulomb 


Although there are only two types of charge, each type can come 
in different amounts. The metric unit of charge is the coulomb 
(rhymes with “drool on”), defined as follows: 


One Coulomb (C) is the amount of charge such that a force of 
9.0 x 10° N occurs between two pointlike objects with charges 
of 1 C separated by a distance of 1 m. 


The notation for an amount of charge is g. The numerical factor 
in the definition is historical in origin, and is not worth memoriz- 
ing. The definition is stated for pointlike, i.e., very small, objects, 
because otherwise different parts of them would be at different dis- 
tances from each other. 


A model of two types of charged particles 


Experiments show that all the methods of rubbing or otherwise 
charging objects involve two objects, and both of them end up get- 
ting charged. If one object acquires a certain amount of one type of 
charge, then the other ends up with an equal amount of the other 
type. Various interpretations of this are possible, but the simplest 
is that the basic building blocks of matter come in two flavors, one 
with each type of charge. Rubbing objects together results in the 
transfer of some of these particles from one object to the other. In 
this model, an object that has not been electrically prepared may ac- 
tually possesses a great deal of both types of charge, but the amounts 
are equal and they are distributed in the same way throughout it. 
Since type A repels anything that type B attracts, and vice versa, 
the object will make a total force of zero on any other object. The 
rest of this chapter fleshes out this model and discusses how these 
mysterious particles can be understood as being internal parts of 
atoms. 
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Use of positive and negative signs for charge 


Because the two types of charge tend to cancel out each other’s 
forces, it makes sense to label them using positive and negative signs, 
and to discuss the total charge of an object. It is entirely arbitrary 
which type of charge to call negative and which to call positive. 
Benjamin Franklin decided to describe the one we’ve been calling 
“A” as negative, but it really doesn’t matter as long as everyone is 
consistent with everyone else. An object with a total charge of zero 
(equal amounts of both types) is referred to as electrically neutral. 


self-check A 

Criticize the following statement: “There are two types of charge, attrac- 
tive and repulsive.” > Answer, p. 
869 


A large body of experimental observations can be summarized 
as follows: 


Coulomb’s law: The magnitude of the force acting between point- 
like charged objects at a center-to-center distance r is given by the 
equation 


ail {q2| 
Fee 
where the constant k equals 9.0 x 109 N-m?/C?. The force is attrac- 
tive if the charges are of different signs, and repulsive if they have 


the same sign. 


Clever modern techniques have allowed the 1/r? form of Coulomb’s 


law to be tested to incredible accuracy, showing that the exponent 
is in the range from 1.9999999999999998 to 2.0000000000000002. 


Note that Coulomb’s law is closely analogous to Newton’s law 
of gravity, where the magnitude of the force is Gm ymz2/r?, except 
that there is only one type of mass, not two, and gravitational forces 
are never repulsive. Because of this close analogy between the two 
types of forces, we can recycle a great deal of our knowledge of 
gravitational forces. For instance, there is an electrical equivalent 
of the shell theorem: the electrical forces exerted externally by a 
uniformly charged spherical shell are the same as if all the charge 
was concentrated at its center, and the forces exerted internally are 
Zero. 


Conservation of indexcharge!conservation ofcharge 


An even more fundamental reason for using positive and nega- 
tive signs for electrical charge is that experiments show that charge 
is conserved according to this definition: in any closed system, the 
total amount of charge is a constant. This is why we observe that 
rubbing initially uncharged substances together always has the re- 
sult that one gains a certain amount of one type of charge, while 
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b/A charged piece of tape 
attracts uncharged pieces of 
paper from a distance, and they 
leap up to it. 


c/The paper has zero total 
charge, but it does have charged 
particles in it that can move. 


the other acquires an equal amount of the other type. Conservation 
of charge seems natural in our model in which matter is made of 
positive and negative particles. If the charge on each particle is a 
fixed property of that type of particle, and if the particles themselves 
can be neither created nor destroyed, then conservation of charge is 
inevitable. 


Electrical forces involving neutral objects 


As shown in figure b, an electrically charged object can attract 
objects that are uncharged. How is this possible? The key is that 
even though each piece of paper has a total charge of zero, it has at 
least some charged particles in it that have some freedom to move. 
Suppose that the tape is positively charged, c. Mobile particles 
in the paper will respond to the tape’s forces, causing one end of 
the paper to become negatively charged and the other to become 
positive. The attraction between the paper and the tape is now 
stronger than the repulsion, because the negatively charged end is 
closer to the tape. 


self-check B 
What would have happened if the tape was negatively charged? > 
Answer, p. 869 


The path ahead 


We have begun to encounter complex electrical behavior that we 
would never have realized was occurring just from the evidence of our 
eyes. Unlike the pulleys, blocks, and inclined planes of mechanics, 
the actors on the stage of electricity and magnetism are invisible 
phenomena alien to our everyday experience. For this reason, the 
flavor of the second half of your physics education is dramatically 
different, focusing much more on experiments and techniques. Even 
though you will never actually see charge moving through a wire, 
you can learn to use an ammeter to measure the flow. 


Students also tend to get the impression from their first semester 
of physics that it is a dead science. Not so! We are about to pick 
up the historical trail that leads directly to the cutting-edge physics 
research you read about in the newspaper. The atom-smashing ex- 
periments that began around 1900, which we will be studying in this 
chapter, were not that different from the ones of the year 2000 — 
just smaller, simpler, and much cheaper. 


Magnetic forces 


A detailed mathematical treatment of magnetism won’t come 
until much later in this book, but we need to develop a few simple 
ideas about magnetism now because magnetic forces are used in the 
experiments and techniques we come to next. Everyday magnets 
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come in two general types. Permanent magnets, such as the ones 
on your refrigerator, are made of iron or substances like steel that 
contain iron atoms. (Certain other substances also work, but iron 
is the cheapest and most common.) The other type of magnet, 
an example of which is the ones that make your stereo speakers 
vibrate, consist of coils of wire through which electric charge flows. 
Both types of magnets are able to attract iron that has not been 
magnetically prepared, for instance the door of the refrigerator. 


A single insight makes these apparently complex phenomena 
much simpler to understand: magnetic forces are interactions be- 
tween moving charges, occurring in addition to the electric forces. 
Suppose a permanent magnet is brought near a magnet of the coiled- 
wire type. The coiled wire has moving charges in it because we force 
charge to flow. The permanent magnet also has moving charges in 
it, but in this case the charges that naturally swirl around inside the 
iron. (What makes a magnetized piece of iron different from a block 
of wood is that the motion of the charge in the wood is random 
rather than organized.) The moving charges in the coiled-wire mag- 
net exert a force on the moving charges in the permanent magnet, 
and vice-versa. 


The mathematics of magnetism is significantly more complex 
than the Coulomb force law for electricity, which is why we will 
wait until chapter 11 before delving deeply into it. Two simple facts 
will suffice for now: 


(1) If a charged particle is moving in a region of space near where 
other charged particles are also moving, their magnetic force on it 
is directly proportional to its velocity. 


(2) The magnetic force on a moving charged particle is always 
perpendicular to the direction the particle is moving. 


A magnetic compass example 1 
The Earth is molten inside, and like a pot of boiling water, it roils 
and churns. To make a drastic oversimplification, electric charge 
can get carried along with the churning motion, so the Earth con- 
tains moving charge. The needle of a magnetic compass is itself 
a small permanent magnet. The moving charge inside the earth 
interacts magnetically with the moving charge inside the compass 
needle, causing the compass needle to twist around and point 
north. 


A television tube example 2 
A TV picture is painted by a stream of electrons coming from 
the back of the tube to the front. The beam scans across the 
whole surface of the tube like a reader scanning a page of a book. 
Magnetic forces are used to steer the beam. As the beam comes 
from the back of the tube to the front, up-down and left-right forces 
are needed for steering. But magnetic forces cannot be used 
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to get the beam up to speed in the first place, since they can 
only push perpendicular to the electrons’ direction of motion, not 
forward along it. 


Discussion Questions 


A _ Ifthe electrical attraction between two pointlike objects at a distance 
of 1 mis 9x 10° N, why can’t we infer that their charges are +1 and —1 C? 
What further observations would we need to do in order to prove this? 


B- Anelectrically charged piece of tape will be attracted to your hand. 
Does that allow us to tell whether the mobile charged particles in your 
hand are positive or negative, or both? 


8.1.3 Atoms 
I was brought up to look at the atom as a nice, hard fellow, red 
or grey in color according to taste. Rutherford 


Indexatomismatomism 


The Greeks have been kicked around a lot in the last couple of 
millennia: dominated by the Romans, bullied during the crusades 
by warlords going to and from the Holy Land, and occupied by 
Turkey until recently. It’s no wonder they prefer to remember their 
salad days, when their best thinkers came up with concepts like 
democracy and atoms. Greece is democratic again after a period 
of military dictatorship, and an atom is proudly pictured on one of 
their coins. That’s why it hurts me to have to say that the ancient 
Greek hypothesis that matter is made of atoms was pure guess- 
work. There was no real experimental evidence for atoms, and the 
18th-century revival of the atom concept by Dalton owed little to 
the Greeks other than the name, which means “unsplittable.” Sub- 
tracting even more cruelly from Greek glory, the name was shown 
to be inappropriate in 1897 when physicist J.J. Thomson proved ex- 
perimentally that atoms had even smaller things inside them, which 
could be extracted. (Thomson called them “electrons.”) The “un- 
splittable” was splittable after all. 


But that’s getting ahead of our story. What happened to the 
atom concept in the intervening two thousand years? Educated peo- 
ple continued to discuss the idea, and those who were in favor of it 
could often use it to give plausible explanations for various facts and 
phenomena. One fact that was readily explained was conservation 
of mass. For example, if you mix 1 kg of water with 1 kg of dirt, 
you get exactly 2 kg of mud, no more and no less. The same is true 
for the a variety of processes such as freezing of water, fermenting 
beer, or pulverizing sandstone. If you believed in atoms, conserva- 
tion of mass made perfect sense, because all these processes could 
be interpreted as mixing and rearranging atoms, without changing 
the total number of atoms. Still, this is nothing like a proof that 
atoms exist. 


If atoms did exist, what types of atoms were there, and what dis- 
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tinguished the different types from each other? Was it their sizes, 
their shapes, their weights, or some other quality? The chasm be- 
tween the ancient and modern atomisms becomes evident when we 
consider the wild speculations that existed on these issues until the 
present century. The ancients decided that there were four types of 
atoms, earth, water, air and fire; the most popular view was that 
they were distinguished by their shapes. Water atoms were spher- 
ical, hence water’s ability to flow smoothly. Fire atoms had sharp 
points, which was why fire hurt when it touched one’s skin. (There 
was no concept of temperature until thousands of years later.) The 
drastically different modern understanding of the structure of atoms 
was achieved in the course of the revolutionary decade stretching 
1895 to 1905. The main purpose of this chapter is to describe those 
momentous experiments. 


Atoms, light, and everything else 


Although I tend to ridicule ancient Greek philosophers like Aris- 
totle, let’s take a moment to praise him for something. If you read 
Aristotle’s writings on physics (or just skim them, which is all I’ve 
done), the most striking thing is how careful he is about classifying 
phenomena and analyzing relationships among phenomena. The hu- 
man brain seems to naturally make a distinction between two types 
of physical phenomena: objects and motion of objects. When a 
phenomenon occurs that does not immediately present itself as one 
of these, there is a strong tendency to conceptualize it as one or 
the other, or even to ignore its existence completely. For instance, 
physics teachers shudder at students’ statements that “the dynamite 
exploded, and force came out of it in all directions.” In these exam- 
ples, the nonmaterial concept of force is being mentally categorized 
as if it was a physical substance. The statement that “winding the 
clock stores motion in the spring” is a miscategorization of electrical 
energy as a form of motion. An example of ignoring the existence 
of a phenomenon altogether can be elicited by asking people why 
we need lamps. The typical response that “the lamp illuminates 
the room so we can see things,” ignores the necessary role of light 
coming into our eyes from the things being illuminated. 


If you ask someone to tell you briefly about atoms, the likely 
response is that “everything is made of atoms,” but we’ve now seen 
that it’s far from obvious which “everything” this statement would 
properly refer to. For the scientists of the early 1900s who were 
trying to investigate atoms, this was not a trivial issue of defini- 
tions. There was a new gizmo called the vacuum tube, of which the 
only familiar example today is the picture tube of a TV. In short 
order, electrical tinkerers had discovered a whole flock of new phe- 
nomena that occurred in and around vacuum tubes, and given them 
picturesque names like “x-rays,” “cathode rays,” “Hertzian waves,” 
and “N-rays.” These were the types of observations that ended up 
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telling us that we know about matter, but fierce controversies ensued 
over whether these were themselves forms of matter. 


Let’s bring ourselves up to the level of classification of phenom- 
ena employed by physicists in the year 1900. They recognized three 
categories: 


e Matter has mass, can have kinetic energy, and can travel 
through a vacuum, transporting its mass and kinetic energy 
with it. Matter is conserved, both in the sense of conservation 
of mass and conservation of the number of atoms of each ele- 
ment. Atoms can’t occupy the same space as other atoms, so 
a convenient way to prove something is not a form of matter 
is to show that it can pass through a solid material, in which 
the atoms are packed together closely. 


Light has no mass, always has energy, and can travel through a 
vacuum, transporting its energy with it. Two light beams can 
penetrate through each other and emerge from the collision 
without being weakened, deflected, or affected in any other 
way. Light can penetrate certain kinds of matter, e.g., glass. 


e The third category is everything that doesn’t fit the defini- 
tion of light or matter. This catch-all category includes, for 
example, time, velocity, heat, and force. 


The chemical indexelements, chemicalelements 


How would one find out what types of atoms there were? To- 
day, it doesn’t seem like it should have been very difficult to work 
out an experimental program to classify the types of atoms. For 
each type of atom, there should be a corresponding element, i.e., a 
pure substance made out of nothing but that type of atom. Atoms 
are supposed to be unsplittable, so a substance like milk could not 
possibly be elemental, since churning it vigorously causes it to split 
up into two separate substances: butter and whey. Similarly, rust 
could not be an element, because it can be made by combining two 
substances: iron and oxygen. Despite its apparent reasonableness, 
no such program was carried out until the eighteenth century. The 
ancients presumably did not do it because observation was not uni- 
versally agreed on as the right way to answer questions about nature, 
and also because they lacked the necessary techniques or the tech- 
niques were the province of laborers with low social status, such as 
smiths and miners. Alchemists were hindered by atomism’s repu- 
tation for subversiveness, and by a tendency toward mysticism and 
secrecy. (The most celebrated challenge facing the alchemists, that 
of converting lead into gold, is one we now know to be impossible, 
since lead and gold are both elements.) 


By 1900, however, chemists had done a reasonably good job of 
finding out what the elements were. They also had determined the 


418 Chapter 8 Atoms and Electromagnetism 


ratios of the different atoms’ masses fairly accurately. A typical 
technique would be to measure how many grams of sodium (Na) 
would combine with one gram of chlorine (Cl) to make salt (NaCl). 
(This assumes you’ve already decided based on other evidence that 
salt consisted of equal numbers of Na and Cl atoms.) The masses of 
individual atoms, as opposed to the mass ratios, were known only 
to within a few orders of magnitude based on indirect evidence, and 
plenty of physicists and chemists denied that individual atoms were 
anything more than convenient symbols. 


Making sense of the elements 


As the information accumulated, the challenge was to find a 
way of systematizing it; the modern scientist’s aesthetic sense rebels 
against complication. This hodgepodge of elements was an embar- 
rassment. One contemporary observer, William Crookes, described 
the elements as extending “before us as stretched the wide Atlantic 
before the gaze of Columbus, mocking, taunting and murmuring 
strange riddles, which no man has yet been able to solve.” It wasn’t 
long before people started recognizing that many atoms’ masses were 
nearly integer multiples of the mass of hydrogen, the lightest ele- 
ment. A few excitable types began speculating that hydrogen was 
the basic building block, and that the heavier elements were made 
of clusters of hydrogen. It wasn’t long, however, before their parade 
was rained on by more accurate measurements, which showed that 
not all of the elements had atomic masses that were near integer 
multiples of hydrogen, and even the ones that were close to being 
integer multiples were off by one percent or so. 
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Chemistry professor Dmitri Mendeleev, preparing his lectures in 
1869, wanted to find some way to organize his knowledge for his stu- 
dents to make it more understandable. He wrote the names of all 
the elements on cards and began arranging them in different ways 
on his desk, trying to find an arrangement that would make sense of 
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the muddle. The row-and-column scheme he came up with is essen- 
tially our modern periodic table. The columns of the modern version 
represent groups of elements with similar chemical properties, and 
each row is more massive than the one above it. Going across each 
row, this almost always resulted in placing the atoms in sequence 
by weight as well. What made the system significant was its predic- 
tive value. There were three places where Mendeleev had to leave 
gaps in his checkerboard to keep chemically similar elements in the 
same column. He predicted that elements would exist to fill these 
gaps, and extrapolated or interpolated from other elements in the 
same column to predict their numerical properties, such as masses, 
boiling points, and densities. Mendeleev’s professional stock sky- 
rocketed when his three elements (later named gallium, scandium 
and germanium) were discovered and found to have very nearly the 
properties he had predicted. 


One thing that Mendeleev’s table made clear was that mass was 
not the basic property that distinguished atoms of different ele- 
ments. To make his table work, he had to deviate from ordering 
the elements strictly by mass. For instance, iodine atoms are lighter 
than tellurium, but Mendeleev had to put iodine after tellurium so 
that it would lie in a column with chemically similar elements. 


Direct proof that atoms existed 


The success of the kinetic theory of heat was taken as strong evi- 
dence that, in addition to the motion of any object as a whole, there 
is an invisible type of motion all around us: the random motion of 
atoms within each object. But many conservatives were not con- 
vinced that atoms really existed. Nobody had ever seen one, after 
all. It wasn’t until generations after the kinetic theory of heat was 
developed that it was demonstrated conclusively that atoms really 
existed and that they participated in continuous motion that never 
died out. 


The smoking gun to prove atoms were more than mathematical 
abstractions came when some old, obscure observations were reex- 
amined by an unknown Swiss patent clerk named Albert Einstein. 
A botanist named Brown, using a microscope that was state of the 
art in 1827, observed tiny grains of pollen in a drop of water on a 
microscope slide, and found that they jumped around randomly for 
no apparent reason. Wondering at first if the pollen he’d assumed to 
be dead was actually alive, he tried looking at particles of soot, and 
found that the soot particles also moved around. The same results 
would occur with any small grain or particle suspended in a liquid. 
The phenomenon came to be referred to as Brownian motion, and 
its existence was filed away as a quaint and thoroughly unimportant 
fact, really just a nuisance for the microscopist. 


It wasn’t until 1906 that Einstein found the correct interpreta- 
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tion for Brown’s observation: the water molecules were in continuous 
random motion, and were colliding with the particle all the time, 
kicking it in random directions. After all the millennia of speculation 
about atoms, at last there was solid proof. Einstein’s calculations 
dispelled all doubt, since he was able to make accurate predictions 
of things like the average distance traveled by the particle in a cer- 
tain amount of time. (Einstein received the Nobel Prize not for his 
theory of relativity but for his papers on Brownian motion and the 
photoelectric effect.) 


Discussion Questions 


A Howcould knowledge of the size of an individual aluminum atom be 
used to infer an estimate of its mass, or vice versa? 


B How could one test Einstein’s interpretation of Brownian motion by 
observing it at different temperatures? 


8.1.4 Quantization of charge 


Proving that atoms actually existed was a big accomplishment, 
but demonstrating their existence was different from understanding 
their properties. Note that the Brown-Einstein observations had 
nothing at all to do with electricity, and yet we know that matter 
is inherently electrical, and we have been successful in interpreting 
certain electrical phenomena in terms of mobile positively and nega- 
tively charged particles. Are these particles atoms? Parts of atoms? 
Particles that are entirely separate from atoms? It is perhaps pre- 
mature to attempt to answer these questions without any conclusive 
evidence in favor of the charged-particle model of electricity. 


Strong support for the charged-particle model came from a 1911 


experiment by physicist Robert Millikan at the University of Chicago. 


Consider a jet of droplets of perfume or some other liquid made by 
blowing it through a tiny pinhole. The droplets emerging from the 
pinhole must be smaller than the pinhole, and in fact most of them 
are even more microscopic than that, since the turbulent flow of air 
tends to break them up. Millikan reasoned that the droplets would 
acquire a little bit of electric charge as they rubbed against the chan- 
nel through which they emerged, and if the charged-particle model 
of electricity was right, the charge might be split up among so many 
minuscule liquid drops that a single drop might have a total charge 
amounting to an excess of only a few charged particles — perhaps 
an excess of one positive particle on a certain drop, or an excess of 
two negative ones on another. 


Millikan’s ingenious apparatus, g, consisted of two metal plates, 
which could be electrically charged as needed. He sprayed a cloud of 
oil droplets into the space between the plates, and selected one drop 
through a microscope for study. First, with no charge on the plates, 
he would determine the drop’s mass by letting it fall through the 
air and measuring its terminal velocity, i.e., the velocity at which 
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the force of air friction canceled out the force of gravity. The force 
of air drag on a slowly moving sphere had already been found by 
experiment to be bur?, where b was a constant. Setting the total 
force equal to zero when the drop is at terminal velocity gives 


bur? — mg = 0 ‘ 


and setting the known density of oil equal to the drop’s mass divided 
by its volume gives a second equation, 


m 


ie $nr3 

Everything in these equations can be measured directly except for 
m and r, so these are two equations in two unknowns, which can be 
solved in order to determine how big the drop is. 


Next Millikan charged the metal plates, adjusting the amount 
of charge so as to exactly counteract gravity and levitate the drop. 
If, for instance, the drop being examined happened to have a total 
charge that was negative, then positive charge put on the top plate 
would attract it, pulling it up, and negative charge on the bottom 
plate would repel it, pushing it up. (Theoretically only one plate 
would be necessary, but in practice a two-plate arrangement like this 
gave electrical forces that were more uniform in strength throughout 
the space where the oil drops were.) The amount of charge on the 
plates required to levitate the charged drop gave Millikan a handle 
on the amount of charge the drop carried. The more charge the 
drop had, the stronger the electrical forces on it would be, and the 
less charge would have to be put on the plates to do the trick. Un- 
fortunately, expressing this relationship using Coulomb’s law would 
have been impractical, because it would require a perfect knowledge 
of how the charge was distributed on each plate, plus the ability 
to perform vector addition of all the forces being exerted on the 
drop by all the charges on the plate. Instead, Millikan made use of 
the fact that the electrical force experienced by a pointlike charged 
object at a certain point in space is proportional to its charge, 


F 
— = constant 
qd 


With a given amount of charge on the plates, this constant could be 
determined for instance by discarding the oil drop, inserting between 
the plates a larger and more easily handled object with a known 
charge on it, and measuring the force with conventional methods. 
(Millikan actually used a slightly different set of techniques for de- 
termining the constant, but the concept is the same.) The amount 
of force on the actual oil drop had to equal mg, since it was just 
enough to levitate it, and once the calibration constant had been 
determined, the charge of the drop could then be found based on its 
previously determined mass. 
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The table on the left shows a few of the results from Millikan’s 
1911 paper. (Millikan took data on both negatively and positively 
charged drops, but in his paper he gave only a sample of his data on 
negatively charged drops, so these numbers are all negative.) Even a 
quick look at the data leads to the suspicion that the charges are not 
simply a series of random numbers. For instance, the second charge 
is almost exactly equal to half the first one. Millikan explained the 
observed charges as all being integer multiples of a single number, 
1.64x 107! CG. In the second column, dividing by this constant gives 
numbers that are essentially integers, allowing for the random errors 
present in the experiment. Millikan states in his paper that these 
results were a 


... direct and tangible demonstration ...of the correct- 
ness of the view advanced many years ago and supported 
by evidence from many sources that all electrical charges, 
however produced, are exact multiples of one definite, 
elementary electrical charge, or in other words, that an 
electrical charge instead of being spread uniformly over 
the charged surface has a definite granular structure, 
consisting, in fact, of ...specks, or atoms of electric- 
ity, all precisely alike, peppered over the surface of the 
charged body. 


In other words, he had provided direct evidence for the charged- 
particle model of electricity and against models in which electricity 
was described as some sort of fluid. The basic charge is notated e, 
and the modern value is e = 1.60 x 107!9 C. The word “quantized” 
is used in physics to describe a quantity that can only have certain 
numerical values, and cannot have any of the values between those. 
In this language, we would say that Millikan discovered that charge 
is quantized. The charge e is referred to as the quantum of charge. 


A historical note on millikan’s fraud 


Very few undergraduate physics textbooks mention the well- 
documented fact that although Millikan’s conclusions were correct, 
he was guilty of scientific fraud. His technique was difficult and 
painstaking to perform, and his original notebooks, which have been 
preserved, show that the data were far less perfect than he claimed 
in his published scientific papers. In his publications, he stated cat- 
egorically that every single oil drop observed had had a charge that 
was a multiple of e, with no exceptions or omissions. But his note- 
books are replete with notations such as “beautiful data, keep,” and 
“bad run, throw out.” Millikan, then, appears to have earned his 
Nobel Prize by advocating a correct position with dishonest descrip- 
tions of his data. 


Why do textbook authors fail to mention Millikan’s fraud? It 
may be that they think students are too unsophisticated to cor- 
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rectly evaluate the implications of the fact that scientific fraud has 
sometimes existed and even been rewarded by the scientific estab- 
lishment. Maybe they are afraid students will reason that fudging 
data is OK, since Millikan got the Nobel Prize for it. But falsifying 
history in the name of encouraging truthfulness is more than a little 
ironic. English teachers don’t edit Shakespeare’s tragedies so that 
the bad characters are always punished and the good ones never 
suffer! 


self-check C 
Is money quantized? What is the quantum of money? > Answer, p. 
870 


8.1.5 The electron 
Indexcathode rayscathode rays 


Nineteenth-century physicists spent a lot of time trying to come 
up with wild, random ways to play with electricity. The best ex- 
periments of this kind were the ones that made big sparks or pretty 
colors of light. 


One such parlor trick was the cathode ray. To produce it, you 
first had to hire a good glassblower and find a good vacuum pump. 
The glassblower would create a hollow tube and embed two pieces of 
metal in it, called the electrodes, which were connected to the out- 
side via metal wires passing through the glass. Before letting him 
seal up the whole tube, you would hook it up to a vacuum pump, 
and spend several hours huffing and puffing away at the pump’s 
hand crank to get a good vacuum inside. Then, while you were still 
pumping on the tube, the glassblower would melt the glass and seal 
the whole thing shut. Finally, you would put a large amount of pos- 
itive charge on one wire and a large amount of negative charge on 
the other. Metals have the property of letting charge move through 
them easily, so the charge deposited on one of the wires would 
quickly spread out because of the repulsion of each part of it for 
every other part. This spreading-out process would result in nearly 
all the charge ending up in the electrodes, where there is more room 
to spread out than there is in the wire. For obscure historical rea- 
sons a negative electrode is called a cathode and a positive one is 
an anode. 


Figure i shows the light-emitting stream that was observed. If, 
as shown in this figure, a hole was made in the anode, the beam 
would extend on through the hole until it hit the glass. Drilling a 
hole in the cathode, however would not result in any beam coming 
out on the left side, and this indicated that the stuff, whatever it 
was, was coming from the cathode. The rays were therefore chris- 
tened “cathode rays.” (The terminology is still used today in the 
term “cathode ray tube” or “CRT” for the picture tube of a TV or 
computer monitor.) 
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Were cathode rays a form of light, or of matter? 


Were cathode rays a form of light, or matter? At first no one re- 
ally cared what they were, but as their scientific importance became 
more apparent, the light-versus-matter issue turned into a contro- 
versy along nationalistic lines, with the Germans advocating light 
and the English holding out for matter. The supporters of the ma- 
terial interpretation imagined the rays as consisting of a stream of 
atoms ripped from the substance of the cathode. 


One of our defining characteristics of matter is that material 
objects cannot pass through each other. Experiments showed that 
cathode rays could penetrate at least some small thickness of matter, 
such as a metal foil a tenth of a millimeter thick, implying that they 
were a form of light. 


Other experiments, however, pointed to the contrary conclusion. 
Light is a wave phenomenon, and one distinguishing property of 
waves is demonstrated by speaking into one end of a paper towel 
roll. The sound waves do not emerge from the other end of the 
tube as a focused beam. Instead, they begin spreading out in all 
directions as soon as they emerge. This shows that waves do not 
necessarily travel in straight lines. If a piece of metal foil in the shape 
of a star or a cross was placed in the way of the cathode ray, then 
a “shadow” of the same shape would appear on the glass, showing 
that the rays traveled in straight lines. This straight-line motion 
suggested that they were a stream of small particles of matter. 


These observations were inconclusive, so what was really needed 
was a determination of whether the rays had mass and weight. The 
trouble was that cathode rays could not simply be collected in a cup 
and put on a scale. When the cathode ray tube is in operation, one 
does not observe any loss of material from the cathode, or any crust 
being deposited on the anode. 


Nobody could think of a good way to weigh cathode rays, so the 
next most obvious way of settling the light/matter debate was to 
check whether the cathode rays possessed electrical charge. Light 
was known to be uncharged. If the cathode rays carried charge, 
they were definitely matter and not light, and they were presum- 
ably being made to jump the gap by the simultaneous repulsion of 
the negative charge in the cathode and attraction of the positive 
charge in the anode. The rays would overshoot the anode because 
of their momentum. (Although electrically charged particles do not 
normally leap across a gap of vacuum, very large amounts of charge 
were being used, so the forces were unusually intense.) 


Thomson’s experiments 


Physicist J.J. Thomson at Cambridge carried out a series of 
definitive experiments on cathode rays around the year 1897. By 
turning them slightly off course with electrical forces, k, he showed 
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k / Thomson’s experiment proving 
cathode rays had electric charge 
(redrawn from his original paper). 
The cathode, C, and anode, A, 
are as in any cathode ray tube. 
The rays pass through a slit in 
the anode, and a second slit, B, 
is interposed in order to make 
the beam thinner and eliminate 
rays that were not going straight. 
Charging plates D and E shows 
that cathode rays have charge: 
they are attracted toward the pos- 
itive plate D and repelled by the 
negative plate E. 


that they were indeed electrically charged, which was strong evi- 
dence that they were material. Not only that, but he proved that 
they had mass, and measured the ratio of their mass to their charge, 
m/q. Since their mass was not zero, he concluded that they were 
a form of matter, and presumably made up of a stream of micro- 
scopic, negatively charged particles. When Millikan published his 
results fourteen years later, it was reasonable to assume that the 
charge of one such particle equaled minus one fundamental charge, 
q = —e, and from the combination of Thomson’s and Millikan’s re- 
sults one could therefore determine the mass of a single cathode ray 
particle. 


= y 


The basic technique for determining m/q was simply to measure 
the angle through which the charged plates bent the beam. The 
electric force acting on a cathode ray particle while it was between 
the plates would be proportional to its charge, 


Flee = (known constant) - ¢ 


Application of Newton’s second law, a = F'/m, would allow m/q 
to be determined: 


m known constant 


qd a 


There was just one catch. Thomson needed to know the cathode 
ray particles’ velocity in order to figure out their acceleration. At 
that point, however, nobody had even an educated guess as to the 
speed of the cathode rays produced in a given vacuum tube. The 
beam appeared to leap across the vacuum tube practically instan- 
taneously, so it was no simple matter of timing it with a stopwatch! 


Thomson’s clever solution was to observe the effect of both elec- 
tric and magnetic forces on the beam. The magnetic force exerted 
by a particular magnet would depend on both the cathode ray’s 
charge and its velocity: 


Fmag = (known constant #2) - qv 
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Thomson played with the electric and magnetic forces until ei- 
ther one would produce an equal effect on the beam, allowing him 
to solve for the velocity, 


(known constant) 


(known constant #2) 


Knowing the velocity (which was on the order of 10% of the 
speed of light for his setup), he was able to find the acceleration 
and thus the mass-to-charge ratio m/q. Thomson’s techniques were 
relatively crude (or perhaps more charitably we could say that they 
stretched the state of the art of the time), so with various methods 
he came up with m/q values that ranged over about a factor of two, 
even for cathode rays extracted from a cathode made of a single 
material. The best modern value is m/q = 5.69 x 107!? kg/C, 
which is consistent with the low end of Thomson’s range. 


The cathode ray as a subatomic particle: the indexelectronelectron 


What was significant about Thomson’s experiment was not the 
actual numerical value of m/q, however, so much as the fact that, 
combined with Millikan’s value of the fundamental charge, it gave 
a mass for the cathode ray particles that was thousands of times 
smaller than the mass of even the lightest atoms. Even without 
Millikan’s results, which were 14 years in the future, Thomson rec- 
ognized that the cathode rays’ m/q was thousands of times smaller 
than the m/q ratios that had been measured for electrically charged 
atoms in chemical solutions. He correctly interpreted this as evi- 
dence that the cathode rays were smaller building blocks — he called 
them electrons — out of which atoms themselves were formed. This 
was an extremely radical claim, coming at a time when atoms had 
not yet been proven to exist! Even those who used the word “atom” 
often considered them no more than mathematical abstractions, not 
literal objects. The idea of searching for structure inside of “un- 
splittable” atoms was seen by some as lunacy, but within ten years 
Thomson’s ideas had been amply verified by many more detailed 
experiments. 


Discussion Questions 


A __ Thomson started to become convinced during his experiments that 
the “cathode rays” observed coming from the cathodes of vacuum tubes 
were building blocks of atoms — what we now call electrons. He then 
carried out observations with cathodes made of a variety of metals, and 
found that m/q was roughly the same in every case, considering his lim- 
ited accuracy. Given his suspicion, why did it make sense to try different 
metals? How would the consistent values of m/q serve to test his hypoth- 
esis? 


B My students have frequently asked whether the m/g that Thomson 
measured was the value for a single electron, or for the whole beam. Can 
you answer this question? 
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|/The raisin cookie model of 
the atom with four units of 
charge, which we now know to be 
beryllium. 


C Thomson found that the m/q of an electron was thousands of times 
smaller than that of charged atoms in chemical solutions. Would this imply 
that the electrons had more charge? Less mass? Would there be no way 
to tell? Explain. Remember that Millikan’s results were still many years in 
the future, so g was unknown. 


D Can you guess any practical reason why Thomson couldn't just 
let one electron fly across the gap before disconnecting the battery and 
turning off the beam, and then measure the amount of charge deposited 
on the anode, thus allowing him to measure the charge of a single electron 
directly? 

E Why is it not possible to determine m and q themselves, rather than 
just their ratio, by observing electrons’ motion in electric and magnetic 
fields? 


8.1.6 The raisin cookie model of the atom 


Based on his experiments, Thomson proposed a picture of the 
atom which became known as the raisin cookie model. In the neutral 
atom, |, there are four electrons with a total charge of —4e, sitting in 
a sphere (the “cookie” ) with a charge of +4e spread throughout it. 
It was known that chemical reactions could not change one element 
into another, so in Thomson’s scenario, each element’s cookie sphere 
had a permanently fixed radius, mass, and positive charge, different 
from those of other elements. The electrons, however, were not a 
permanent feature of the atom, and could be tacked on or pulled out 
to make charged ions. Although we now know, for instance, that a 
neutral atom with four electrons is the element beryllium, scientists 
at the time did not know how many electrons the various neutral 
atoms possessed. 


This model is clearly different from the one you’ve learned in 
grade school or through popular culture, where the positive charge 
is concentrated in a tiny nucleus at the atom’s center. An equally 
important change in ideas about the atom has been the realization 
that atoms and their constituent subatomic particles behave entirely 
differently from objects on the human scale. For instance, we’ll see 
later that an electron can be in more than one place at one time. 
The raisin cookie model was part of a long tradition of attempts 
to make mechanical models of phenomena, and Thomson and his 
contemporaries never questioned the appropriateness of building a 
mental model of an atom as a machine with little parts inside. To- 
day, mechanical models of atoms are still used (for instance the 
tinker-toy-style molecular modeling kits like the ones used by Wat- 
son and Crick to figure out the double helix structure of DNA), but 
scientists realize that the physical objects are only aids to help our 
brains’ symbolic and visual processes think about atoms. 


Although there was no clear-cut experimental evidence for many 
of the details of the raisin cookie model, physicists went ahead and 
started working out its implications. For instance, suppose you had 
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a four-electron atom. All four electrons would be repelling each 
other, but they would also all be attracted toward the center of the 
“cookie” sphere. The result should be some kind of stable, sym- 
metric arrangement in which all the forces canceled out. People 
sufficiently clever with math soon showed that the electrons in a 
four-electron atom should settle down at the vertices of a pyramid 
with one less side than the Egyptian kind, i.e., a regular tetrahe- 
dron. This deduction turns out to be wrong because it was based 
on incorrect features of the model, but the model also had many 
successes, a few of which we will now discuss. 


Flow of electrical charge in wires example 3 
One of my former students was the son of an electrician, and 
had become an electrician himself. He related to me how his 
father had remained refused to believe all his life that electrons 
really flowed through wires. If they had, he reasoned, the metal 
would have gradually become more and more damaged, eventu- 
ally crumbling to dust. 


His opinion is not at all unreasonable based on the fact that elec- 
trons are material particles, and that matter cannot normally pass 
through matter without making a hole through it. Nineteenth- 
century physicists would have shared his objection to a charged- 
particle model of the flow of electrical charge. In the raisin-cookie 
model, however, the electrons are very low in mass, and there- 
fore presumably very small in size as well. It is not surprising that 
they can slip between the atoms without damaging them. 


Flow of electrical charge across cell membranes example 4 
Your nervous system is based on signals carried by charge mov- 
ing from nerve cell to nerve cell. Your body is essentially all liquid, 
and atoms in a liquid are mobile. This means that, unlike the case 
of charge flowing in a solid wire, entire charged atoms can flow in 
your nervous system 


Emission of electrons in a cathode ray tube example 5 
Why do electrons detach themselves from the cathode of a vac- 
uum tube? Certainly they are encouraged to do so by the re- 
pulsion of the negative charge placed on the cathode and the 
attraction from the net positive charge of the anode, but these are 
not strong enough to rip electrons out of atoms by main force — 
if they were, then the entire apparatus would have been instantly 
vaporized as every atom was simultaneously ripped apart! 


The raisin cookie model leads to a simple explanation. We know 
that heat is the energy of random motion of atoms. The atoms in 
any object are therefore violently jostling each other all the time, 
and a few of these collisions are violent enough to knock electrons 
out of atoms. If this occurs near the surface of a solid object, the 
electron may can come loose. Ordinarily, however, this loss of 
electrons is a self-limiting process; the loss of electrons leaves 
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the object with a net positive charge, which attracts the lost sheep 
home to the fold. (For objects immersed in air rather than vacuum, 
there will also be a balanced exchange of electrons between the 
air and the object.) 


This interpretation explains the warm and friendly yellow glow of 
the vacuum tubes in an antique radio. To encourage the emission 
of electrons from the vacuum tubes’ cathodes, the cathodes are 
intentionally warmed up with little heater coils. 


Discussion Questions 


A Today many people would define an ion as an atom (or molecule) 
with missing electrons or extra electrons added on. How would people 
have defined the word “ion” before the discovery of the electron? 


B__ Since electrically neutral atoms were known to exist, there had to be 
positively charged subatomic stuff to cancel out the negatively charged 
electrons in an atom. Based on the state of knowledge immediately after 
the Millikan and Thomson experiments, was it possible that the positively 
charged stuff had an unquantized amount of charge? Could it be quan- 
tized in units of +e? In units of +2e? In units of +5/7e? 
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8.2 The Nucleus 


8.2.1 Radioactivity 


Becquerel’s discovery of radioactivity 


How did physicists figure out that the raisin cookie model was 
incorrect, and that the atom’s positive charge was concentrated in 
a tiny, central nucleus? The story begins with the discovery of ra- 
dioactivity by the French chemist Becquerel. Up until radioactivity 
was discovered, all the processes of nature were thought to be based 
on chemical reactions, which were rearrangements of combinations 
of atoms. Atoms exert forces on each other when they are close to- 
gether, so sticking or unsticking them would either release or store 
electrical energy. That energy could be converted to and from other 
forms, as when a plant uses the energy in sunlight to make sugars 
and carbohydrates, or when a child eats sugar, releasing the energy 
in the form of kinetic energy. 


Becquerel discovered a process that seemed to release energy 
from an unknown new source that was not chemical. Becquerel, 
whose father and grandfather had also been physicists, spent the 
first twenty years of his professional life as a successful civil engi- 
neer, teaching physics on a part-time basis. He was awarded the 
chair of physics at the Musée d’Histoire Naturelle in Paris after the 
death of his father, who had previously occupied it. Having now a 
significant amount of time to devote to physics, he began studying 


a/Marie and Pierre Curie were 
the first to purify radium in signifi- 
cant quantities. Radium’s intense 
radioactivity made possible the 
experiments that led to the mod- 
ern planetary model of the atom, 
in which electrons orbit a nucleus 
made of protons and neutrons. 
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of a Maltese cross that was 
placed between the plate and the 
uranium salts. 


the interaction of light and matter. He became interested in the phe- 
nomenon of phosphorescence, in which a substance absorbs energy 
from light, then releases the energy via a glow that only gradually 
goes away. One of the substances he investigated was a uranium 
compound, the salt UKSO5. One day in 1896, cloudy weather in- 
terfered with his plan to expose this substance to sunlight in order 
to observe its fluorescence. He stuck it in a drawer, coincidentally on 
top of a blank photographic plate — the old-fashioned glass-backed 
counterpart of the modern plastic roll of film. The plate had been 
carefully wrapped, but several days later when Becquerel checked it 
in the darkroom before using it, he found that it was ruined, as if it 
had been completely exposed to light. 


History provides many examples of scientific discoveries that 
happened this way: an alert and inquisitive mind decides to in- 
vestigate a phenomenon that most people would not have worried 
about explaining. Becquerel first determined by further experiments 
that the effect was produced by the uranium salt, despite a thick 
wrapping of paper around the plate that blocked out all light. He 
tried a variety of compounds, and found that it was the uranium 
that did it: the effect was produced by any uranium compound, but 
not by any compound that didn’t include uranium atoms. The effect 
could be at least partially blocked by a sufficient thickness of metal, 
and he was able to produce silhouettes of coins by interposing them 
between the uranium and the plate. This indicated that the effect 
traveled in a straight line., so that it must have been some kind of 
ray rather than, e.g., the seepage of chemicals through the paper. 
He used the word “radiations,” since the effect radiated out from 
the uranium salt. 


At this point Becquerel still believed that the uranium atoms 
were absorbing energy from light and then gradually releasing the 
energy in the form of the mysterious rays, and this was how he 
presented it in his first published lecture describing his experiments. 
Interesting, but not earth-shattering. But he then tried to determine 
how long it took for the uranium to use up all the energy that had 
supposedly been stored in it by light, and he found that it never 
seemed to become inactive, no matter how long he waited. Not only 
that, but a sample that had been exposed to intense sunlight for a 
whole afternoon was no more or less effective than a sample that 
had always been kept inside. Was this a violation of conservation 
of energy? If the energy didn’t come from exposure to light, where 
did it come from? 


Three kinds of “radiations” 


Unable to determine the source of the energy directly, turn-of- 
the-century physicists instead studied the behavior of the “radia- 
tions” once they had been emitted. Becquerel had already shown 
that the radioactivity could penetrate through cloth and paper, so 
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the first obvious thing to do was to investigate in more detail what 
thickness of material the radioactivity could get through. They soon 
learned that a certain fraction of the radioactivity’s intensity would 
be eliminated by even a few inches of air, but the remainder was 
not eliminated by passing through more air. Apparently, then, the 
radioactivity was a mixture of more than one type, of which one was 
blocked by air. They then found that of the part that could pene- 
trate air, a further fraction could be eliminated by a piece of paper 
or a very thin metal foil. What was left after that, however, was 
a third, extremely penetrating type, some of whose intensity would 
still remain even after passing through a brick wall. They decided 
that this showed there were three types of radioactivity, and with- 
out having the faintest idea of what they really were, they made up 
names for them. The least penetrating type was arbitrarily labeled 
a (alpha), the first letter of the Greek alphabet, and so on through 
G (beta) and finally ~ (gamma) for the most penetrating type. 


Radium: a more intense source of radioactivity 


The measuring devices used to detect radioactivity were crude: 
photographic plates or even human eyeballs (radioactivity makes 
flashes of light in the jelly-like fluid inside the eye, which can be 
seen by the eyeball’s owner if it is otherwise very dark). Because the 
ways of detecting radioactivity were so crude and insensitive, further 
progress was hindered by the fact that the amount of radioactivity 
emitted by uranium was not really very great. The vital contribu- 
tion of physicist /chemist Marie Curie and her husband Pierre was 
to discover the element radium, and to purify and isolate significant 
quantities it. Radium emits about a million times more radioactivity 
per unit mass than uranium, making it possible to do the experi- 
ments that were needed to learn the true nature of radioactivity. 
The dangers of radioactivity to human health were then unknown, 
and Marie died of leukemia thirty years later. (Pierre was run over 
and killed by a horsecart.) 


Tracking down the nature of alphas, betas, and gammas 


As radium was becoming available, an apprentice scientist named 
Ernest Rutherford arrived in England from his native New Zealand 
and began studying radioactivity at the Cavendish Laboratory. The 
young colonial’s first success was to measure the mass-to-charge ra- 
tio of beta rays. The technique was essentially the same as the one 
Thomson had used to measure the mass-to-charge ratio of cathode 
rays by measuring their deflections in electric and magnetic fields. 
The only difference was that instead of the cathode of a vacuum 
tube, a nugget of radium was used to supply the beta rays. Not 
only was the technique the same, but so was the result. Beta rays 
had the same m/q ratio as cathode rays, which suggested they were 
one and the same. Nowadays, it would make sense simply to use 
the term “electron,” and avoid the archaic “cathode ray” and “beta 
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showing that alpha particles were 
doubly ionized helium atoms. 


e/These pellets of uranium 
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and beta radiation, which the 
gloves are thick enough to stop. 


particle,” but the old labels are still widely used, and it is unfortu- 
nately necessary for physics students to memorize all three names 
for the same thing. 


At first, it seemed that neither alphas or gammas could be de- 
flected in electric or magnetic fields, making it appear that neither 
was electrically charged. But soon Rutherford obtained a much more 
powerful magnet, and was able to use it to deflect the alphas but 
not the gammas. The alphas had a much larger value of m/q than 
the betas (about 4000 times greater), which was why they had been 
so hard to deflect. Gammas are uncharged, and were later found to 
be a form of light. 


The m/q ratio of alpha particles turned out to be the same 
as those of two different types of ions, Hett (a helium atom with 
two missing electrons) and H} (two hydrogen atoms bonded into a 
molecule, with one electron missing) , so it seemed likely that they 
were one or the other of those. The diagram shows a simplified ver- 
sion of Rutherford’s ingenious experiment proving that they were 
He** ions. The gaseous element radon, an alpha emitter, was in- 
troduced into one half of a double glass chamber. The glass wall 
dividing the chamber was made extremely thin, so that some of the 
rapidly moving alpha particles were able to penetrate it. The other 
chamber, which was initially evacuated, gradually began to accu- 
mulate a population of alpha particles (which would quickly pick up 
electrons from their surroundings and become electrically neutral). 
Rutherford then determined that it was helium gas that had ap- 
peared in the second chamber. Thus alpha particles were proved to 
be Het* ions. The nucleus was yet to be discovered, but in modern 
terms, we would describe a Het* ion as the nucleus of a He atom. 


To summarize, here are the three types of radiation emitted by 
radioactive elements, and their descriptions in modern terms: 


a particle | stopped by a few inches of air | He nucleus 
6 particle | stopped by a piece of paper 
y ray penetrates thick shielding 


electron 
a type of light 


Discussion Questions 


A Most sources of radioactivity emit alphas, betas, and gammas, not 
just one of the three. In the radon experiment, how did Rutherford know 
that he was studying the alphas? 


8.2.2 The planetary model of the atom 


The stage was now set for the unexpected discovery that the 
positively charged part of the atom was a tiny, dense lump at the 
atom’s center rather than the “cookie dough” of the raisin cookie 
model. By 1909, Rutherford was an established professor, and had 
students working under him. For a raw undergraduate named Mars- 
den, he picked a research project he thought would be tedious but 
straightforward. 
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It was already known that although alpha particles would be 
stopped completely by a sheet of paper, they could pass through a 
sufficiently thin metal foil. Marsden was to work with a gold foil 
only 1000 atoms thick. (The foil was probably made by evaporating 
a little gold in a vacuum chamber so that a thin layer would be 
deposited on a glass microscope slide. The foil would then be lifted 
off the slide by submerging the slide in water.) 


Rutherford had already determined in his previous experiments 
the speed of the alpha particles emitted by radium, a fantastic 1.5 x 
10’ m/s. The experimenters in Rutherford’s group visualized them 
as very small, very fast cannonballs penetrating the “cookie dough” 
part of the big gold atoms. A piece of paper has a thickness of a 
hundred thousand atoms or so, which would be sufficient to stop 
them completely, but crashing through a thousand would only slow 
them a little and turn them slightly off of their original paths. 


Marsden’s supposedly ho-hum assignment was to use the appa- 
ratus shown in figure g to measure how often alpha particles were 
deflected at various angles. A tiny lump of radium in a box emit- 
ted alpha particles, and a thin beam was created by blocking all 
the alphas except those that happened to pass out through a tube. 
Typically deflected in the gold by only a small amount, they would 
reach a screen very much like the screen of a TV’s picture tube, 
which would make a flash of light when it was hit. Here is the first 
example we have encountered of an experiment in which a beam of 
particles is detected one at a time. This was possible because each 
alpha particle carried so much kinetic energy; they were moving at 
about the same speed as the electrons in the Thomson experiment, 
but had ten thousand times more mass. 


Marsden sat in a dark room, watching the apparatus hour after 
hour and recording the number of flashes with the screen moved to 
various angles. The rate of the flashes was highest when he set the 
screen at an angle close to the line of the alphas’ original path, but if 
he watched an area farther off to the side, he would also occasionally 
see an alpha that had been deflected through a larger angle. After 
seeing a few of these, he got the crazy idea of moving the screen to 
see if even larger angles ever occurred, perhaps even angles larger 
than 90 degrees. 


The crazy idea worked: a few alpha particles were deflected 
through angles of up to 180 degrees, and the routine experiment 
had become an epoch-making one. Rutherford said, “We have been 
able to get some of the alpha particles coming backwards. It was 
almost as incredible as if you fired a 15-inch shell at a piece of tissue 
paper and it came back and hit you.” Explanations were hard to 
come by in the raisin cookie model. What intense electrical forces 
could have caused some of the alpha particles, moving at such astro- 
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h/ Alpha particles being scat- 
tered by a gold nucleus. On this 
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nomical speeds, to change direction so drastically? Since each gold 
atom was electrically neutral, it would not exert much force on an 
alpha particle outside it. True, if the alpha particle was very near to 
or inside of a particular atom, then the forces would not necessarily 
cancel out perfectly; if the alpha particle happened to come very 
close to a particular electron, the 1/r? form of the Coulomb force 
law would make for a very strong force. But Marsden and Ruther- 
ford knew that an alpha particle was 8000 times more massive than 
an electron, and it is simply not possible for a more massive object 
to rebound backwards from a collision with a less massive object 
while conserving momentum and energy. It might be possible in 
principle for a particular alpha to follow a path that took it very 
close to one electron, and then very close to another electron, and so 
on, with the net result of a large deflection, but careful calculations 
showed that such multiple “close encounters” with electrons would 
be millions of times too rare to explain what was actually observed. 


At this point, Rutherford and Marsden dusted off an unpop- 
ular and neglected model of the atom, in which all the electrons 
orbited around a small, positively charged core or “nucleus,” just 
like the planets orbiting around the sun. All the positive charge 
and nearly all the mass of the atom would be concentrated in the 
nucleus, rather than spread throughout the atom as in the raisin 
cookie model. The positively charged alpha particles would be re- 
pelled by the gold atom’s nucleus, but most of the alphas would not 
come close enough to any nucleus to have their paths drastically 
altered. The few that did come close to a nucleus, however, could 
rebound backwards from a single such encounter, since the nucleus of 
a heavy gold atom would be fifty times more massive than an alpha 
particle. It turned out that it was not even too difficult to derive a 
formula giving the relative frequency of deflections through various 
angles, and this calculation agreed with the data well enough (to 
within 15%), considering the difficulty in getting good experimental 
statistics on the rare, very large angles. 


What had started out as a tedious exercise to get a student 
started in science had ended as a revolution in our understanding 
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of nature. Indeed, the whole thing may sound a little too much 
like a moralistic fable of the scientific method with overtones of 
the Horatio Alger genre. The skeptical reader may wonder why 
the planetary model was ignored so thoroughly until Marsden and 
Rutherford’s discovery. Is science really more of a sociological enter- 
prise, in which certain ideas become accepted by the establishment, 
and other, equally plausible explanations are arbitrarily discarded? 
Some social scientists are currently ruffling a lot of scientists’ feath- 
ers with critiques very much like this, but in this particular case, 
there were very sound reasons for rejecting the planetary model. As 
you'll learn in more detail later in this course, any charged particle 
that undergoes an acceleration dissipate energy in the form of light. 
In the planetary model, the electrons were orbiting the nucleus in 
circles or ellipses, which meant they were undergoing acceleration, 
just like the acceleration you feel in a car going around acurve. They 
should have dissipated energy as light, and eventually they should 
have lost all their energy. Atoms don’t spontaneously collapse like 
that, which was why the raisin cookie model, with its stationary 
electrons, was originally preferred. There were other problems as 
well. In the planetary model, the one-electron atom would have 
to be flat, which would be inconsistent with the success of molecu- 
lar modeling with spherical balls representing hydrogen and atoms. 
These molecular models also seemed to work best if specific sizes 
were used for different atoms, but there is no obvious reason in the 
planetary model why the radius of an electron’s orbit should be a 
fixed number. In view of the conclusive Marsden-Rutherford results, 
however, these became fresh puzzles in atomic physics, not reasons 
for disbelieving the planetary model. 


Some phenomena explained with the planetary model 


The planetary model may not be the ultimate, perfect model of 
the atom, but don’t underestimate its power. It already allows us 
to visualize correctly a great many phenomena. 


As an example, let’s consider the distinctions among nonmetals, 
metals that are magnetic, and metals that are nonmagnetic. As 
shown in figure j, a metal differs from a nonmetal because its outer- 
most electrons are free to wander rather than owing their allegiance 
to a particular atom. A metal that can be magnetized is one that 
is willing to line up the rotations of some of its electrons so that 
their axes are parallel. Recall that magnetic forces are forces made 
by moving charges; we have not yet discussed the mathematics and 
geometry of magnetic forces, but it is easy to see how random ori- 
entations of the atoms in the nonmagnetic substance would lead to 
cancellation of the forces. 


Even if the planetary model does not immediately answer such 
questions as why one element would be a metal and another a non- 
metal, these ideas would be difficult or impossible to conceptualize 
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j/The planetary model ap- 
plied to a nonmetal, 1, an 
unmagnetized metal, 2, and a 
magnetized metal, 3. Note that 
these figures are all simplified in 
several ways. For one thing, the 
electrons of an individual atom do 
not all revolve around the nucleus 
in the same plane. It is also very 
unusual for a metal to become so 
strongly magnetized that 100% 
of its atoms have their rotations 
aligned as shown in this figure. 
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k/A modern periodic table, 
labeled with atomic numbers. 
Mendeleev’s original table was 
upside-down compared to this 
one. 


in the raisin cookie model. 
Discussion Questions 


A In reality, charges of the same type repel one another and charges 
of different types are attracted. Suppose the rules were the other way 
around, giving repulsion between opposite charges and attraction be- 
tween similar ones. What would the universe be like? 


B_ Rutherford knew the velocity of the alpha particles emitted by radium, 
and guessed that the positively charged part of a gold atom had a charge 
of about +100e (we now know it is +79e). Considering the fact that some 
alpha particles were deflected by 180°, how could he then use conserva- 
tion of energy to derive an upper limit on the size of a gold nucleus? (For 
simplicity, assume the size of the alpha particle is negligible compared to 
that of the gold nucleus, and ignore the fact that the gold nucleus recoils 
a little from the collision, picking up a little kinetic energy.) 


8.2.3 Atomic number 

As alluded to in a discussion question in the previous section, 
scientists of this period had only a very approximate idea of how 
many units of charge resided in the nuclei of the various chemical 
elements. Although we now associate the number of units of nu- 
clear charge with the element’s position on the periodic table, and 
call it the atomic number, they had no idea that such a relation- 
ship existed. Mendeleev’s table just seemed like an organizational 
tool, not something with any necessary physical significance. And 
everything Mendeleev had done seemed equally valid if you turned 
the table upside-down or reversed its left and right sides, so even if 
you wanted to number the elements sequentially with integers, there 
was an ambiguity as to how to do it. Mendeleev’s original table was 
in fact upside-down compared to the modern one. 


1 2 
H He 
3 5 6 7 8 9 10 
Li | Be B;C|N/]O]|F |Ne 
1 12 13 14 15 16 17 18 
Na| Mg Al} Si} P| S | Cl) Ar 
79 | 20] 24 22] 23] 24] 25] 26 | 27] 28] 29) 30 | 31 | a2 | 33 | 34 | 35 | 36 
K |Ca| Sc} | Ti | V | Cr|Mn} Fe} Co} Ni | Cu] Zn| Ga] Ge] As | Se) Br | Kr 
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 
Rb| Sr| Y | | Zr | Nb| Mo} Tc} Ru} Rh| Pd| Ag} Cd) In| Sn| Sb} Te) | | Xe 
55 56 57 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 
Cs | Ba| La}*| Hf | Ta; W| Re} Os} Ir | Pt | Auj Hg} Tl) Pb] Bi | Po} At | Rn 
87 | 88 | 89 104 | 105 | 106 | 107 | 108 | 109 | 110 | 111 | 112 | 113 | 114 | 115 | 116 ] 117 | 118 
Fr | Ra} Ac #4 Rf | Ha 


58 61 62 63 64 | 65 66 | 67 | 68 69 70 71 
*|Ce}Pr |Nd|Pm/Sm| Eu |Gd| Tb | Dy} Ho} Er | Tm] Yb} Lu 

9 

1 


93 94 | 95 
U |Np | Pu | Am|Cm| Bk | Cf | Es | Fm) Md} Noj| Lr 


kk 


In the period immediately following the discovery of the nucleus, 
physicists only had rough estimates of the charges of the various 
nuclei. In the case of the very lightest nuclei, they simply found 
the maximum number of electrons they could strip off by various 


438 Chapter 8 Atoms and Electromagnetism 


methods: chemical reactions, electric sparks, ultraviolet light, and 
so on. For example they could easily strip of one or two electrons 
from helium, making Het or Het*, but nobody could make He**™*, 
presumably because the nuclear charge of helium was only +2e. 
Unfortunately only a few of the lightest elements could be stripped 
completely, because the more electrons were stripped off, the greater 
the positive net charge remaining, and the more strongly the rest of 
the negatively charged electrons would be held on. The heavy ele- 
ments’ atomic numbers could only be roughly extrapolated from the 
light elements, where the atomic number was about half the atom’s 
mass expressed in units of the mass of a hydrogen atom. Gold, for 
example, had a mass about 197 times that of hydrogen, so its atomic 
number was estimated to be about half that, or somewhere around 
100. We now know it to be 79. 


How did we finally find out? The riddle of the nuclear charges 
was at last successfully attacked using two different techniques, 
which gave consistent results. One set of experiments, involving 
x-rays, was performed by the young Henry Mosely, whose scientific 
brilliance was soon to be sacrificed in a battle between European im- 
perialists over who would own the Dardanelles, during that pointless 
conflict then known as the War to End All Wars, and now referred 
to as World War I. 


alpha particle 


e 
copper nucleus 


alpha particle 


gold nucleus 


Since Mosely’s analysis requires several concepts with which you 
are not yet familiar, we will instead describe the technique used 
by James Chadwick at around the same time. An added bonus of 
describing Chadwick’s experiments is that they presaged the impor- 
tant modern technique of studying collisions of subatomic particles. 
In grad school, I worked with a professor whose thesis adviser’s the- 
sis adviser was Chadwick, and he related some interesting stories 
about the man. Chadwick was apparently a little nutty and a com- 
plete fanatic about science, to the extent that when he was held in a 


1/ An alpha particle has to come 
much closer to the low-charged 
copper nucleus in order to be de- 


flected through the same angle. 
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m/An alpha particle must be 
headed for the ring on the front 
of the imaginary cylindrical pipe 
in order to produce scattering at 
an angle between 19 and 20 de- 
grees. The area of this ring 
is called the “cross-section” for 
scattering at 19-20°because it is 
the cross-sectional area of a cut 
through the pipe. 


German prison camp during World War II, he managed to cajole his 
captors into allowing him to scrounge up parts from broken radios 
so that he could attempt to do physics experiments. 


Chadwick’s experiment worked like this. Suppose you perform 
two Rutherford-type alpha scattering measurements, first one witha 
gold foil as a target as in Rutherford’s original experiment, and then 
one with a copper foil. It is possible to get large angles of deflection 
in both cases, but as shown in figure m, the alpha particle must 
be heading almost straight for the copper nucleus to get the same 
angle of deflection that would have occurred with an alpha that 
was much farther off the mark; the gold nucleus’ charge is so much 
greater than the copper’s that it exerts a strong force on the alpha 
particle even from far off. The situation is very much like that of a 
blindfolded person playing darts. Just as it is impossible to aim an 
alpha particle at an individual nucleus in the target, the blindfolded 
person cannot really aim the darts. Achieving a very close encounter 
with the copper atom would be akin to hitting an inner circle on the 
dartboard. It’s much more likely that one would have the luck to 
hit the outer circle, which covers a greater number of square inches. 
By analogy, if you measure the frequency with which alphas are 
scattered by copper at some particular angle, say between 19 and 
20 degrees, and then perform the same measurement at the same 
angle with gold, you get a much higher percentage for gold than for 
copper. 


nucleus 


In fact, the numerical ratio of the two nuclei’s charges can be 
derived from this same experimentally determined ratio. Using the 
standard notation Z for the atomic number (charge of the nucleus 
divided by e), the following equation can be proved (example 6): 


Zuid number of alphas scattered by gold at 19-20° 


Z2 ~~ number of alphas scattered by copper at 19-20° 


copper 


By making such measurements for targets constructed from all the 
elements, one can infer the ratios of all the atomic numbers, and 
since the atomic numbers of the light elements were already known, 


440 Chapter 8 Atoms and Electromagnetism 


atomic numbers could be assigned to the entire periodic table. Ac- 
cording to Mosely, the atomic numbers of copper, silver and plat- 
inum were 29, 47, and 78, which corresponded well with their posi- 
tions on the periodic table. Chadwick’s figures for the same elements 
were 29.3, 46.3, and 77.4, with error bars of about 1.5 times the fun- 
damental charge, so the two experiments were in good agreement. 


The point here is absolutely not that you should be ready to plug 
numbers into the above equation for a homework or exam question! 
My overall goal in this chapter is to explain how we know what we 
know about atoms. An added bonus of describing Chadwick’s ex- 
periment is that the approach is very similar to that used in modern 
particle physics experiments, and the ideas used in the analysis are 
closely related to the now-ubiquitous concept of a “cross-section.” 
In the dartboard analogy, the cross-section would be the area of the 
circular ring you have to hit. The reasoning behind the invention of 
the term “cross-section” can be visualized as shown in figure m. In 
this language, Rutherford’s invention of the planetary model came 
from his unexpected discovery that there was a nonzero cross-section 
for alpha scattering from gold at large angles, and Chadwick con- 
firmed Mosely’s determinations of the atomic numbers by measuring 
cross-sections for alpha scattering. 


Proof of the relationship between Z and scattering example 6 
The equation above can be derived by the following not very rigor- 
ous proof. To deflect the alpha particle by a certain angle requires 
that it acquire a certain momentum component in the direction 
perpendicular to its original momentum. Although the nucleus’s 
force on the alpha particle is not constant, we can pretend that 
it is approximately constant during the time when the alpha is 
within a distance equal to, say, 150% of its distance of closest 
approach, and that the force is zero before and after that part of 
the motion. (If we chose 120% or 200%, it shouldn’t make any 
difference in the final result, because the final result is a ratio, 
and the effects on the numerator and denominator should cancel 
each other.) In the approximation of constant force, the change 
in the alpha’s perpendicular momentum component is then equal 
to FAt. The Coulomb force law says the force is proportional to 
Z/r?. Although r does change somewhat during the time interval 
of interest, it’s good enough to treat it as a constant number, since 
we're only computing the ratio between the two experiments’ re- 
sults. Since we are approximating the force as acting over the 
time during which the distance is not too much greater than the 
distance of closest approach, the time interval At must be propor- 
tional to r, and the sideways momentum imparted to the alpha, 
FAt, is proportional to (Z/r)r, or Z/r. If we’re comparing alphas 
scattered at the same angle from gold and from copper, then Ap 
is the same in both cases, and the proportionality Ap « Z/r tells 
us that the ones scattered from copper at that angle had to be 
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headed in along a line closer to the central axis by a factor equal- 
INQ Zgo1a/Zcopper If you imagine a “dartboard ring” that the alphas 
have to hit, then the ring for the gold experiment has the same 
proportions as the one for copper, but it is enlarged by a factor 
equal to Zgo1id/Zoopper- That is, not only is the radius of the ring 
greater by that factor, but unlike the rings on a normal dartboard, 
the thickness of the outer ring is also greater in proportion to its 
radius. When you take a geometric shape and scale it up in size 
like a photographic enlargement, its area is increased in propor- 
tion to the square of the enlargement factor, so the area of the 
dartboard ring in the gold experiment is greater by a factor equal 
to (Zgoia/Zcopper)*. Since the alphas are aimed entirely randomly, 
the chances of an alpha hitting the ring are in proportion to the 
area of the ring, which proves the equation given above. 


As an example of the modern use of scattering experiments and 
cross-section measurements, you may have heard of the recent ex- 
perimental evidence for the existence of a particle called the top 
quark. Of the twelve subatomic particles currently believed to be the 
smallest constituents of matter, six form a family called the quarks, 
distinguished from the other six by the intense attractive forces that 
make the quarks stick to each other. (The other six consist of the 
electron plus five other, more exotic particles.) The only two types of 
quarks found in naturally occurring matter are the “up quark” and 
“down quark,” which are what protons and neutrons are made of, 
but four other types were theoretically predicted to exist, for a total 
of six. (The whimsical term “quark” comes from a line by James 
Joyce reading “Three quarks for master Mark.”) Until recently, only 
five types of quarks had been proven to exist via experiments, and 
the sixth, the top quark, was only theorized. There was no hope 
of ever detecting a top quark directly, since it is radioactive, and 
only exists for a zillionth of a second before evaporating. Instead, 
the researchers searching for it at the Fermi National Accelerator 
Laboratory near Chicago measured cross-sections for scattering of 
nuclei off of other nuclei. The experiment was much like those of 
Rutherford and Chadwick, except that the incoming nuclei had to 
be boosted to much higher speeds in a particle accelerator. The 
resulting encounter with a target nucleus was so violent that both 
nuclei were completely demolished, but, as Einstein proved, energy 
can be converted into matter, and the energy of the collision creates 
a spray of exotic, radioactive particles, like the deadly shower of 
wood fragments produced by a cannon ball in an old naval battle. 
Among those particles were some top quarks. The cross-sections 
being measured were the cross-sections for the production of certain 
combinations of these secondary particles. However different the 
details, the principle was the same as that employed at the turn of 
the century: you smash things together and look at the fragments 
that fly off to see what was inside them. The approach has been 
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compared to shooting a clock with a rifle and then studying the 
pieces that fly off to figure out how the clock worked. 


Discussion Questions 


A The diagram, showing alpha particles being deflected by a gold 
nucleus, was drawn with the assumption that alpha particles came in on 
lines at many different distances from the nucleus. Why wouldn’t they all 
come in along the same line, since they all came out through the same 
tube? 


B Why does it make sense that, as shown in the figure, the trajectories 
that result in 19° and 20° scattering cross each other? 


8.2.4 The structure of nuclei 
The proton 


The fact that the nuclear charges were all integer multiples of e 
suggested to many physicists that rather than being a pointlike ob- 
ject, the nucleus might contain smaller particles having individual 
charges of +e. Evidence in favor of this idea was not long in arriv- 
ing. Rutherford reasoned that if he bombarded the atoms of a very 
light element with alpha particles, the small charge of the target 
nuclei would give a very weak repulsion. Perhaps those few alpha 
particles that happened to arrive on head-on collision courses would 
get so close that they would physically crash into some of the target 
nuclei. An alpha particle is itself a nucleus, so this would be a col- 
lision between two nuclei, and a violent one due to the high speeds 
involved. Rutherford hit pay dirt in an experiment with alpha par- 
ticles striking a target containing nitrogen atoms. Charged particles 
were detected flying out of the target like parts flying off of cars in 
a high-speed crash. Measurements of the deflection of these parti- 
cles in electric and magnetic fields showed that they had the same 
charge-to-mass ratio as singly-ionized hydrogen atoms. Rutherford 
concluded that these were the conjectured singly-charged particles 
that held the charge of the nucleus, and they were later named 
protons. The hydrogen nucleus consists of a single proton, and in 
general, an element’s atomic number gives the number of protons 
contained in each of its nuclei. The mass of the proton is about 1800 
times greater than the mass of the electron. 


The neutron 


It would have been nice and simple if all the nuclei could have 
been built only from protons, but that couldn’t be the case. If you 
spend a little time looking at a periodic table, you will soon notice 
that although some of the atomic masses are very nearly integer 
multiples of hydrogen’s mass, many others are not. Even where the 
masses are close whole numbers, the masses of an element other 
than hydrogen is always greater than its atomic number, not equal 
to it. Helium, for instance, has two protons, but its mass is four 
times greater than that of hydrogen. 
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n/Examples of the construction 
of atoms: hydrogen (top) and 
helium (bottom). On this scale, 
the electrons’ orbits would be the 
size of a college campus. 


Chadwick cleared up the confusion by proving the existence of 
a new subatomic particle. Unlike the electron and proton, which 
are electrically charged, this particle is electrically neutral, and he 
named it the neutron. Chadwick’s experiment has been described 
in detail in chapter 3, but briefly the method was to expose a sam- 
ple of the light element beryllium to a stream of alpha particles 
from a lump of radium. Beryllium has only four protons, so an al- 
pha that happens to be aimed directly at a beryllium nucleus can 
actually hit it rather than being stopped short of a collision by elec- 
trical repulsion. Neutrons were observed as a new form of radiation 
emerging from the collisions, and Chadwick correctly inferred that 
they were previously unsuspected components of the nucleus that 
had been knocked out. As described in 3, Chadwick also determined 
the mass of the neutron; it is very nearly the same as that of the 
proton. 


To summarize, atoms are made of three types of particles: 


charge | mass in units of | location in atom 
the proton’s mass 


proton | +e 1 in nucleus 
neutron | 0 1.001 in nucleus 
electron | —e 1/1836 orbiting nucleus 


The existence of neutrons explained the mysterious masses of 
the elements. Helium, for instance, has a mass very close to four 
times greater than that of hydrogen. This is because it contains 
two neutrons in addition to its two protons. The mass of an atom is 
essentially determined by the total number of neutrons and protons. 
The total number of neutrons plus protons is therefore referred to 
as the atom’s mass number. 


Isotopes 


We now have a clear interpretation of the fact that helium is 
close to four times more massive than hydrogen, and similarly for 
all the atomic masses that are close to an integer multiple of the 
mass of hydrogen. But what about copper, for instance, which had 
an atomic mass 63.5 times that of hydrogen? It didn’t seem rea- 
sonable to think that it possessed an extra half of a neutron! The 
solution was found by measuring the mass-to-charge ratios of singly- 
ionized atoms (atoms with one electron removed). The technique 
is essentially that same as the one used by Thomson for cathode 
rays, except that whole atoms do not spontaneously leap out of the 
surface of an object as electrons sometimes do. Figure o shows an 
example of how the ions can be created and injected between the 
charged plates for acceleration. 


Injecting a stream of copper ions into the device, we find a sur- 
prise — the beam splits into two parts! Chemists had elevated to 
dogma the assumption that all the atoms of a given element were 
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identical, but we find that 69% of copper atoms have one mass, and 
31% have another. Not only that, but both masses are very nearly 
integer multiples of the mass of hydrogen (63 and 65, respectively). 
Copper gets its chemical identity from the number of protons in its 
nucleus, 29, since chemical reactions work by electric forces. But 
apparently some copper atoms have 63 — 29 = 34 neutrons while 
others have 65 — 29 = 36. The atomic mass of copper, 63.5, reflects 
the proportions of the mixture of the mass-63 and mass-65 varieties. 
The different mass varieties of a given element are called isotopesof 
that element. 


Isotopes can be named by giving the mass number as a subscript 
to the left of the chemical symbol, e.g., ®’Cu. Examples: 


protons | neutrons | mass number 
'H 1 0 0+1=1 
eHe. (2 2 DAD = A 
Re 6 6 6+6 = 12 
i 6 8 6+8 = 14 
62a | 105 157 105+157 = 262 
self-check D 


Why are the positive and negative charges of the accelerating plates 
reversed in the isotope-separating apparatus compared to the Thomson 
apparatus? > Answer, p. 870 


Chemical reactions are all about the exchange and sharing of 
electrons: the nuclei have to sit out this dance because the forces 
of electrical repulsion prevent them from ever getting close enough 
to make contact with each other. Although the protons do have a 
vitally important effect on chemical processes because of their elec- 
trical forces, the neutrons can have no effect on the atom’s chemical 
reactions. It is not possible, for instance, to separate ©*Cu from ©°Cu 
by chemical reactions. This is why chemists had never realized that 
different isotopes existed. (To be perfectly accurate, different iso- 
topes do behave slightly differently because the more massive atoms 
move more sluggishly and therefore react with a tiny bit less inten- 
sity. This tiny difference is used, for instance, to separate out the 
isotopes of uranium needed to build a nuclear bomb. The smallness 
of this effect makes the separation process a slow and difficult one, 
which is what we have to thank for the fact that nuclear weapons 
have not been built by every terrorist cabal on the planet.) 


Sizes and shapes of nuclei 


Matter is nearly all nuclei if you count by weight, but in terms 
of volume nuclei don’t amount to much. The radius of an individual 
neutron or proton is very close to 1 fm (1 fm=10~!° m), so even a big 
lead nucleus with a mass number of 208 still has a diameter of only 
about 13 fm, which is ten thousand times smaller than the diameter 
of a typical atom. Contrary to the usual imagery of the nucleus as a 
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extra charged plates 
or magnets for de- 
flecting the beam _ . 
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vacuum chamber 


0/A version of the Thomson 
apparatus modified for measuring 
the mass-to-charge ratios of 
ions rather than electrons. A 
small sample of the element in 
question, copper in our example, 
is boiled in the oven to create 
a thin vapor. (A vacuum pump 
is continuously sucking on the 
main chamber to keep it from 
accumulating enough gas to stop 
the beam of ions.) Some of the 
atoms of the vapor are ionized by 
a spark or by ultraviolet light. lons 
that wander out of the nozzle 
and into the region between 
the charged plates are then 
accelerated toward the top of the 
figure. As in the Thomson experi- 
ment, mass-to-charge ratios are 
inferred from the deflection of the 
beam. 
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p/A nuclear power plant at Cat- 
tenom, France. Unlike the coal 
and oil plants that supply most 
of the U.S’s electrical power, a 
nuclear power plant like this one 
releases no pollution or green- 
house gases into the Earth’s at- 
mosphere, and therefore doesn’t 
contribute to global warming. The 
white stuff puffing out of this 
plant is non-radioactive water va- 
por. Although nuclear power 
plants generate long-lived nuclear 
waste, this waste arguably poses 
much less of a threat to the bio- 
sphere than greenhouse gases 
would. 


strong nuclear 


T= 
electrical 


u force « 1/r2 


lfm distance 


q/The strong nuclear force 
cuts off very sharply at a range of 
about 1 fm. 


small sphere, it turns out that many nuclei are somewhat elongated, 
like an American football, and a few have exotic asymmetric shapes 
like pears or kiwi fruits. 


Discussion Questions 


A Suppose the entire universe was in a (very large) cereal box, and 
the nutritional labeling was supposed to tell a godlike consumer what per- 
centage of the contents was nuclei. Roughly what would the percentage 
be like if the labeling was according to mass? What if it was by volume? 


9? 


8.2.5 The strong nuclear force, alpha decay and fission 


Once physicists realized that nuclei consisted of positively charged 
protons and uncharged neutrons, they had a problem on their hands. 
The electrical forces among the protons are all repulsive, so the nu- 
cleus should simply fly apart! The reason all the nuclei in your 
body are not spontaneously exploding at this moment is that there 
is another force acting. This force, called the strong nuclear force, is 
always attractive, and acts between neutrons and neutrons, neutrons 
and protons, and protons and protons with roughly equal strength. 
The strong nuclear force does not have any effect on electrons, which 
is why it does not influence chemical reactions. 


Unlike electric forces, whose strengths are given by the simple 
Coulomb force law, there is no simple formula for how the strong 
nuclear force depends on distance. Roughly speaking, it is effec- 
tive over ranges of ~ 1 fm, but falls off extremely quickly at larger 
distances (much faster than 1/r?). Since the radius of a neutron or 
proton is about 1 fm, that means that when a bunch of neutrons and 
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protons are packed together to form a nucleus, the strong nuclear 
force is effective only between neighbors. 


Figure r illustrates how the strong nuclear force acts to keep 
ordinary nuclei together, but is not able to keep very heavy nuclei 
from breaking apart. In r/1, a proton in the middle of a carbon 
nucleus feels an attractive strong nuclear force (arrows) from each 
of its nearest neighbors. The forces are all in different directions, 
and tend to cancel out. The same is true for the repulsive electrical 
forces (not shown). In figure r/2, a proton at the edge of the nucleus 
has neighbors only on one side, and therefore all the strong nuclear 
forces acting on it are tending to pull it back in. Although all the 
electrical forces from the other five protons (dark arrows) are all 
pushing it out of the nucleus, they are not sufficient to overcome 
the strong nuclear forces. 


In a very heavy nucleus, r/3, a proton that finds itself near the 
edge has only a few neighbors close enough to attract it significantly 
via the strong nuclear force, but every other proton in the nucleus 
exerts a repulsive electrical force on it. If the nucleus is large enough, 
the total electrical repulsion may be sufficient to overcome the at- 
traction of the strong force, and the nucleus may spit out a proton. 
Proton emission is fairly rare, however; a more common type of ra- 
dioactive decay! in heavy nuclei is alpha decay, shown in r/4. The 


‘Alpha decay is more common for statistical reasons. An alpha particle hap- 


r/1. The forces on the proton 
cancel. 2. The forces don’t can- 
cel. 3. In a heavy nucleus, the 
large number of electrical repul- 
sions can add up to a force that is 
comparable to the strong nuclear 
attraction. 4. Alpha emission. 5. 
Fission. 
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imbalance of the forces is similar, but the chunk that is ejected is an 
alpha particle (two protons and two neutrons) rather than a single 
proton. 


It is also possible for the nucleus to split into two pieces of 
roughly equal size, r/5, a process known as fission. Note that in 
addition to the two large fragments, there is a spray of individual 
neutrons. In a nuclear fission bomb or a nuclear fission reactor, 
some of these neutrons fly off and hit other nuclei, causing them to 
undergo fission as well. The result is a chain reaction. 


When a nucleus is able to undergo one of these processes, it is 
said to be radioactive, and to undergo radioactive decay. Some of 
the naturally occurring nuclei on earth are radioactive. The term 
“radioactive” comes from Becquerel’s image of rays radiating out 
from something, not from radio waves, which are a whole differ- 
ent phenomenon. The term “decay” can also be a little misleading, 
since it implies that the nucleus turns to dust or simply disappears 
— actually it is splitting into two new nuclei with an the same total 
number of neutrons and protons, so the term “radioactive transfor- 
mation” would have been more appropriate. Although the original 
atom’s electrons are mere spectators in the process of weak radioac- 
tive decay, we often speak loosely of “radioactive atoms” rather than 
“radioactive nuclei.” 


Randomness in physics 


How does an atom decide when to decay? We might imagine 
that it is like a termite-infested house that gets weaker and weaker, 
until finally it reaches the day on which it is destined to fall apart. 
Experiments, however, have not succeeded in detecting such “tick- 
ing clock” hidden below the surface; the evidence is that all atoms 
of a given isotope are absolutely identical. Why, then, would one 
uranium atom decay today while another lives for another million 
years? The answer appears to be that it is entirely random. We 
can make general statements about the average time required for a 
certain isotope to decay, or how long it will take for half the atoms 
in a sample to decay (its half-life), but we can never predict the 
behavior of a particular atom. 


This is the first example we have encountered of an inescapable 
randomness in the laws of physics. If this kind of randomness makes 
you uneasy, you’re in good company. Einstein’s famous quote is 
“I am convinced that He [God] does not play dice.“ Einstein’s 
distaste for randomness, and his association of determinism with 
divinity, goes back to the Enlightenment conception of the universe 


pens to be a very stable arrangement, with an unusually low interaction energy 
due to the strong nuclear interactions within it. This leaves a lot of spare en- 
ergy for its center of mass motion, which gives it a greater selection of possible 
momentum vectors. The idea of a “greater selection” can be made more precise 
using the concept of phase space, discussed in section 5.4. 
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as a gigantic piece of clockwork that only had to be set in motion 
initially by the Builder. Physics had to be entirely rebuilt in the 20th 
century to incorporate the fundamental randomness of physics, and 
this modern revolution is the topic of chapter 13. In particular, 
we will delay the mathematical development of the half-life concept 
until then. 


8.2.6 The weak nuclear force; beta decay 


All the nuclear processes we’ve discussed so far have involved 
rearrangements of neutrons and protons, with no change in the total 
number of neutrons or the total number of protons. Now consider 
the proportions of neutrons and protons in your body and in the 
planet earth: neutrons and protons are roughly equally numerous 
in your body’s carbon and oxygen nuclei, and also in the nickel and 
iron that make up most of the earth. The proportions are about 
50-50. But the only chemical elements produced in any significant 
quantities by the big bang were hydrogen (about 90%) and helium 
(about 10%). If the early universe was almost nothing but hydrogen 
atoms, whose nuclei are protons, where did all those neutrons come 
from? 


The answer is that there is another nuclear force, the weak nu- 
clear force, that is capable of transforming neutrons into protons 
and vice-versa. Two possible reactions are 


nopte +7 [electron decay] 


and 
ponte’ +p ; [positron decay] 


(There is also a third type called electron capture, in which a proton 
grabs one of the atom’s electrons and they produce a neutron and 
a neutrino.) 


Whereas alpha decay and fission are just a redivision of the pre- 
viously existing particles, these reactions involve the destruction of 
one particle and the creation of three new particles that did not 
exist before. 


There are three new particles here that you have never previ- 
ously encountered. The symbol et stands for an antielectron, which 
is a particle just like the electron in every way, except that its elec- 
tric charge is positive rather than negative. Antielectrons are also 
known as positrons. Nobody knows why electrons are so common in 
the universe and antielectrons are scarce. When an antielectron en- 
counters an electron, they annihilate each other, producing gamma 
rays, and this is the fate of all the antielectrons that are produced 
by natural radioactivity on earth. Antielectrons are an example of 
antimatter. A complete atom of antimatter would consist of antipro- 
tons, antielectrons, and antineutrons. Although individual particles 
of antimatter occur commonly in nature due to natural radioactivity 
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and cosmic rays, only a few complete atoms of antihydrogen have 
ever been produced artificially. 


The notation v stands for a particle called a neutrino, and 7 
means an antineutrino. Neutrinos and antineutrinos have no electric 
charge (hence the name). 


We can now list all four of the known fundamental forces of 
physics: 


e gravity 
e electromagnetism 
e strong nuclear force 


e weak nuclear force 


The other forces we have learned about, such as friction and the 
normal force, all arise from electromagnetic interactions between 
atoms, and therefore are not considered to be fundamental forces of 
physics. 


Decay of ?!*Pb example 7 
As an example, consider the radioactive isotope of lead @'?Pb. It 
contains 82 protons and 130 neutrons. It decays by the process 
n—+ p+e +v. The newly created proton is held inside the 
nucleus by the strong nuclear force, so the new nucleus contains 
83 protons and 129 neutrons. Having 83 protons makes it the 
element bismuth, so it will be an atom of 2'2Bi. 


In a reaction like this one, the electron flies off at high speed 
(typically close to the speed of light), and the escaping electrons 
are the things that make large amounts of this type of radioactivity 
dangerous. The outgoing electron was the first thing that tipped 
off scientists in the early 1900s to the existence of this type of ra- 
dioactivity. Since they didn’t know that the outgoing particles were 
electrons, they called them beta particles, and this type of radioac- 
tive decay was therefore known as beta decay. A clearer but less 
common terminology is to call the two processes electron decay and 
positron decay. 


The neutrino or antineutrino emitted in such a reaction pretty 
much ignores all matter, because its lack of charge makes it immune 
to electrical forces, and it also remains aloof from strong nuclear 
interactions. Even if it happens to fly off going straight down, it 
is almost certain to make it through the entire earth without in- 
teracting with any atoms in any way. It ends up flying through 
outer space forever. The neutrino’s behavior makes it exceedingly 
difficult to detect, and when beta decay was first discovered nobody 
realized that neutrinos even existed. We now know that the neu- 
trino carries off some of the energy produced in the reaction, but at 
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the time it seemed that the total energy afterwards (not counting 
the unsuspected neutrino’s energy) was greater than the total en- 
ergy before the reaction, violating conservation of energy. Physicists 
were getting ready to throw conservation of energy out the window 
as a basic law of physics when indirect evidence led them to the 
conclusion that neutrinos existed. 


The solar neutrino problem 


What about these neutrinos? Why haven’t you heard of them 
before? It’s not because they’re rare — a billion neutrinos pass 
through your body every microsecond, but until recently almost 
nothing was known about them. Produced as a side-effect of the 
nuclear reactions that power our sun and other stars, these ghostlike 
bits of matter are believed to be the most numerous particles in the 
universe. But they interact so weakly with ordinary matter that 
nearly all the neutrinos that enter the earth on one side will emerge 
from the other side of our planet without even slowing down. 


Our first real peek at the properties of the elusive neutrino has 
come from a huge detector in a played-out Japanese zinc mine, s. An 
international team of physicists outfitted the mineshaft with wall- 
to-wall light sensors, and then filled the whole thing with water so 
pure that you can see through it for a hundred meters, compared to 
only a few meters for typical tap water. Neutrinos stream through 
the 50 million liters of water continually, just as they flood every- 
thing else around us, and the vast majority never interact with a 
water molecule. A very small percentage, however, do annihilate 
themselves in the water, and the tiny flashes of light they produce 
can be detected by the beachball-sized vacuum tubes that line the 
darkened mineshaft. Most of the neutrinos around us come from 
the sun, but for technical reasons this type of water-based detector 
is more sensitive to the less common but more energetic neutrinos 
produced when cosmic ray particles strike the earth’s atmosphere. 


Neutrinos were already known to come in three “flavors,” which 
can be distinguished from each other by the particles created when 
they collide with matter. An “electron-flavored neutrino” creates an 
ordinary electron when it is annihilated, while the two other types 
create more exotic particles called mu and tau particles. Think of 
the three types of neutrinos as chocolate, vanilla, and strawberry. 
When you buy a chocolate ice cream cone, you expect that it will 
keep being chocolate as you eat it. The unexpected finding from 
the Japanese experiment is that some of the neutrinos are changing 
flavor between the time when they are produced by a cosmic ray and 
the moment when they wink out of existence in the water. It’s as 
though your chocolate ice cream cone transformed itself magically 
into strawberry while your back was turned. 


How did the physicists figure out the change in flavor? The 
experiment detects some neutrinos originating in the atmosphere 


s/This neutrino detector 
in the process of being filled 
ultrapure water. 
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above Japan, and also many neutrinos coming from distant parts of 
the earth. A neutrino created above the Atlantic Ocean arrives in 
Japan from underneath, and the experiment can distinguish these 
upward-traveling neutrinos from the downward-moving local vari- 
ety. They found that the mixture of neutrinos coming from below 
was different from the mixture arriving from above, with some of 
the electron-flavored and tau-flavored neutrinos having apparently 
changed into mu-flavored neutrinos during their voyage through the 
earth. The ones coming from above didn’t have time to change 
flavors on their much shorter journey. 


This is interpreted as evidence that the neutrinos are constantly 
changing back and forth among the three flavors. On theoretical 
grounds, it is believed that such a vibration can only occur if neu- 
trinos have mass. Only a rough estimate of the mass is possible at 
this point: it appears that neutrinos have a mass somewhere in the 
neighborhood of one billionth of the mass of an electron, or about 
10? kee: 


If the neutrino’s mass is so tiny, does it even matter? It matters 
to astronomers. Neutrinos are the only particles that can be used 
to probe certain phenomena. For example, they are the only direct 
probes we have for testing our models of the core of our own sun, 
which is the source of energy for all life on earth. 


A In the reactions n > p+e”  +v and p > n+e*+v, verify that 
charge is conserved. In beta decay, when one of these reactions happens 
to a neutron or proton within a nucleus, one or more gamma rays may 
also be emitted. Does this affect conservation of charge? Would it be 
possible for some extra electrons to be released without violating charge 
conservation? 


B When an antielectron and an electron annihilate each other, they 
produce two gamma rays. Is charge conserved in this reaction? 


8.2.7 Fusion 


As we have seen, heavy nuclei tend to fly apart because each 
proton is being repelled by every other proton in the nucleus, but is 
only attracted by its nearest neighbors. The nucleus splits up into 
two parts, and as soon as those two parts are more than about 1 fm 
apart, the strong nuclear force no longer causes the two fragments to 
attract each other. The electrical repulsion then accelerates them, 
causing them to gain a large amount of kinetic energy. This release 
of kinetic energy is what powers nuclear reactors and fission bombs. 


It might seem, then, that the lightest nuclei would be the most 
stable, but that is not the case. Let’s compare an extremely light 
nucleus like He with a somewhat heavier one, !©O. A neutron or 
proton in He can be attracted by the three others, but in 1°O, it 
might have five or six neighbors attracting it. The '®O nucleus is 
therefore more stable. 
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It turns out that the most stable nuclei of all are those around 
nickel and iron, having about 30 protons and 30 neutrons. Just as a 
nucleus that is too heavy to be stable can release energy by splitting 
apart into pieces that are closer to the most stable size, light nuclei 
can release energy if you stick them together to make bigger nuclei 
that are closer to the most stable size. Fusing one nucleus with 
another is called nuclear fusion. Nuclear fusion is what powers our 
sun and other stars. 


8.2.8 Nuclear energy and binding energies 


In the same way that chemical reactions can be classified as 
exothermic (releasing energy) or endothermic (requiring energy to 
react), so nuclear reactions may either release or use up energy. The 
energies involved in nuclear reactions are greater by a huge factor. 
Thousands of tons of coal would have to be burned to produce as 
much energy as would be produced in a nuclear power plant by one 
kg of fuel. 


Although nuclear reactions that use up energy (endothermic 
reactions) can be initiated in accelerators, where one nucleus is 
rammed into another at high speed, they do not occur in nature, not 
even in the sun. The amount of kinetic energy required is simply 
not available. 


To find the amount of energy consumed or released in a nuclear 
reaction, you need to know how much nuclear interaction energy, 
Unuc, was stored or released. Experimentalists have determined the 
amount of nuclear energy stored in the nucleus of every stable el- 
ement, as well as many unstable elements. This is the amount of 
mechanical work that would be required to pull the nucleus apart 
into its individual neutrons and protons, and is known as the nuclear 
binding energy. 


A reaction occurring in the sun example 8 
The sun produces its energy through a series of nuclear fusion 
reactions. One of the reactions is 


1H 42H 33 He+y 


The excess energy is almost all carried off by the gamma ray (not 
by the kinetic energy of the helium-3 atom). The binding energies 
in units of pJ (picojoules) are: 


1H OJ 
2H 0.35593 pJ 
8He 1.23489 pJ 


The total initial nuclear energy is 0 pJ+0.35593 pJ, and the final 
nuclear energy is 1.23489 pJ, so by conservation of energy, the 
gamma ray must carry off 0.87896 pJ of energy. The gamma ray 
is then absorbed by the sun and converted to heat. 
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Our sun’s source of energy is nuclear fusion, so nuclear fusion is also the source of power for all 


life on earth, including, 2, this rain forest in Fatu-Hiva. 3. The first release of energy by nuclear fusion through 
human technology was the 1952 Ivy Mike test at the Enewetak Atoll. 4. This array of gamma-ray detectors is 
called GAMMASPHERE. During operation, the array is closed up, and a beam of ions produced by a particle 
accelerator strikes a target at its center, producing nuclear fusion reactions. The gamma rays can be studied for 
information about the structure of the fused nuclei, which are typically varieties not found in nature. 5. Nuclear 
fusion promises to be a clean, inexhaustible source of energy. However, the goal of commercially viable nuclear 
fusion power has remained elusive, due to the engineering difficulties involved in magnetically containing a 
plasma (ionized gas) at a sufficiently high temperature and density. This photo shows the experimental JET 
reactor, with the device opened up on the left, and in action on the right. 
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Why is the binding energy of 'H exactly equal to zero? > Answer, p. 
870 


By the way, if you add up the masses of the three particles 
produced in the reaction n > p+e” +, you will find that they 
do not equal the mass of the neutron, so mass is not conserved. An 
even more blatant example is the annihilation of an electron with 
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a positron, e~ + et + 4, in which the original mass is completely 
destroyed, since gamma rays have no mass. Nonconservation of mass 
is not just a property of nuclear reactions. It also occurs in chemical 
reactions, but the change in mass is too small to detect with ordinary 
laboratory balances. The reason why mass is not being conserved 
is that mass is being converted to energy, according to Einstein’s 
celebrated equation E = mc?. This was discussed briefly in section 
1.4, and in more depth in section 7.3. 
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Figure u is a compact way of showing the vast variety of the 
nuclei. Each box represents a particular number of neutrons and 
protons. The black boxes are nuclei that are stable, i.e., that would 
require an input of energy in order to change into another. The 
gray boxes show all the unstable nuclei that have been studied ex- 
perimentally. Some of these last for billions of years on the aver- 
age before decaying and are found in nature, but most have much 
shorter average lifetimes, and can only be created and studied in 
the laboratory. 


The curve along which the stable nuclei lie is called the line of 
stability. Nuclei along this line have the most stable proportion 
of neutrons to protons. For light nuclei the most stable mixture 
is about 50-50, but we can see that stable heavy nuclei have two 
or three times more neutrons than protons. This is because the 
electrical repulsions of all the protons in a heavy nucleus add up 
to a powerful force that would tend to tear it apart. The presence 
of a large number of neutrons increases the distances among the 
protons, and also increases the number of attractions due to the 
strong nuclear force. 


u/The known nuclei, repre- 
sented on a chart of proton 
number versus neutron number. 
Note the two nuclei in the bottom 
row with zero protons. One is 
simply a single neutron. The 
other is a cluster of four neu- 
trons. This “tetraneutron” was 
reported, unexpectedly, to be a 
bound system in results from a 
2002 experiment. The result is 
controversial. If correct, it implies 
the existence of a heretofore 
unsuspected type of matter, 
the neutron droplet, which we 
can think of as an atom with no 
protons or electrons. 
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v/A map showing levels of 
radiation near the site of the 


Chernobyl disaster. At. the 
boundary of the most highly 
contaminated (bright red) areas, 
people would be exposed to 
about 13,000 uSv per year, or 
about four times the natural back- 
ground level. In the pink areas, 
which are still densely populated, 
the exposure is comparable 
to the natural level found in a 
high-altitude city such as Denver. 


8.2.9 Biological effects of ionizing radiation 


As a science educator, I find it frustrating that nowhere in the 
massive amount of journalism devoted to the Chernobyl disaster 
does one ever find any numerical statements about the amount of 
radiation to which people have been exposed. Anyone mentally 
capable of understanding sports statistics or weather reports ought 
to be able to understand such measurements, as long as something 
like the following explanatory text was inserted somewhere in the 
article: 


Radiation exposure is measured in units of Sieverts (Sv). The 
average person is exposed to about 2000 ~wSv (microSieverts) each 
year from natural background sources. 


With this context, people would be able to come to informed 
conclusions based on statements such as, “Children in Finland re- 
ceived an average dose of ___________ Sv above natural background 
levels because of the Chernoby] disaster.” 


What is a Sievert? It measures the amount of energy per kilo- 
gram deposited in the body by ionizing radiation, multiplied by a 
“quality factor” to account for the different health hazards posed 
by alphas, betas, gammas, neutrons, and other types of radiation. 
Only ionizing radiation is counted, since nonionizing radiation sim- 
ply heats one’s body rather than killing cells or altering DNA. For 
instance, alpha particles are typically moving so fast that their ki- 
netic energy is sufficient to ionize thousands of atoms, but it is 
possible for an alpha particle to be moving so slowly that it would 
not have enough kinetic energy to ionize even one atom. 


Notwithstanding the pop culture images of the Incredible Hulk 
and Godzilla, it is not possible for a multicellular animal to become 
“mutated” as a whole. In most cases, a particle of ionizing radiation 
will not even hit the DNA, and even if it does, it will only affect 
the DNA of a single cell, not every cell in the animal’s body. Typ- 
ically, that cell is simply killed, because the DNA becomes unable 
to function properly. Once in a while, however, the DNA may be 
altered so as to make that cell cancerous. For instance, skin cancer 
can be caused by UV light hitting a single skin cell in the body of 
a sunbather. If that cell becomes cancerous and begins reproducing 
uncontrollably, she will end up with a tumor twenty years later. 


Other than cancer, the only other dramatic effect that can result 
from altering a single cell’s DNA is if that cell happens to be a 
sperm or ovum, which can result in nonviable or mutated offspring. 
Men are relatively immune to reproductive harm from radiation, 
because their sperm cells are replaced frequently. Women are more 
vulnerable because they keep the same set of ova as long as they 
live. 


A whole-body exposure of 5,000,000 Sv will kill a person within 
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a week or so. Luckily, only a small number of humans have ever been 
exposed to such levels: one scientist working on the Manhattan 
Project, some victims of the Nagasaki and Hiroshima explosions, 
and 31 workers at Chernobyl. Death occurs by massive killing of 
cells, especially in the blood-producing cells of the bone marrow. 


Lower levels, on the order of 1,000,000 pSv, were inflicted on 
some people at Nagasaki and Hiroshima. No acute symptoms result 
from this level of exposure, but certain types of cancer are signifi- 
cantly more common among these people. It was originally expected 
that the radiation would cause many mutations resulting in birth 
defects, but very few such inherited effects have been observed. 


A great deal of time has been spent debating the effects of very 
low levels of ionizing radiation. The following table gives some sam- 
ple figures. 


maximum beneficial dose per day ~ 10,000 pSv 
CT scan ~ 10,000 pSv 
natural background per year 2,000-7,000 pSv 
health guidelines for exposure to a fetus 1,000 pSv 
flying from New York to Tokyo 150 puSv 
chest x-ray 50 uSv 


Note that the largest number, on the first line of the table, is the 
maximum beneficial dose. The most useful evidence comes from 
experiments in animals, which can intentionally be exposed to sig- 
nificant and well measured doses of radiation under controlled con- 
ditions. Experiments show that low levels of radiation activate cel- 
lular damage control mechanisms, increasing the health of the or- 
ganism. For example, exposure to radiation up to a certain level 
makes mice grow faster; makes guinea pigs’ immune systems func- 
tion better against diptheria; increases fertility in trout and mice; 
improves fetal mice’s resistance to disease; increases the life-spans of 
flour beetles and mice; and reduces mortality from cancer in mice. 
This type of effect is called radiation hormesis. 


There is also some evidence that in humans, small doses of ra- 
diation increase fertility, reduce genetic abnormalities, and reduce 
mortality from cancer. The human data, however, tend to be very 
poor compared to the animal data. Due to ethical issues, one cannot 
do controlled experiments in humans. For example, one of the best 
sources of information has been from the survivors of the Hiroshima 
and Nagasaki bomb blasts, but these people were also exposed to 
high levels of carcinogenic chemicals in the smoke from their burning 
cities; for comparison, firefighters have a heightened risk of cancer, 
and there are also significant concerns about cancer from the 9/11 
attacks in New York. The direct empirical evidence about radiation 
hormesis in humans is therefore not good enough to tell us anything 
unambiguous,” and the most scientifically reasonable approach is to 


?For two opposing viewpoints, see Tubiana et al., “The Linear No-Threshold 


growth rate relative 
to control group 


0 40,000 80,000 
radiation dose (mSv/day) 


w/A_ typical example of radi- 
ation hormesis: the health of 
mice is improved by low levels 
of radiation. In this study, young 
mice were exposed to fairly high 
levels of x-rays, while a control 
group of mice was not exposed. 
The mice were weighed, and 
their rate of growth was taken 
as a measure of their health. At 
levels below about 50,000 uSv, 
the radiation had a_ beneficial 
effect on the health of the mice, 
presumably by activating cellular 
damage control mechanisms. 
The two highest data points 
are statistically significant at the 
99% level. The curve is a fit to 
a theoretical model. Redrawn 
from T.D. Luckey, Hormesis with 
lonizing Radiation, CRC Press, 
1980. 
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x/Wild  Przewalski’s horses 
prosper in the Chernobyl area. 


y/ Fossil fuels have done _in- 
comparably more damage to the 
environment than nuclear power 
ever has. Polar bears’ habitat is 
rapidly being destroyed by global 
warming. 


assume that the results in animals also hold for humans: small doses 
of radiation in humans are beneficial, rather than harmful. However, 
a variety of cultural and historical factors have led to a situation in 
which public health policy is based on the assumption, known as 
“linear no-threshold” (LNT), that even tiny doses of radiation are 
harmful, and that the risk they carry is proportional to the dose. 
In other words, law and policy are made based on the assumption 
that the effects of radiation on humans are dramatically different 
than its effects on mice and guinea pigs. Even with the unrealis- 
tic assumption of LNT, one can still evaluate risks by comparing 
with natural background radiation. For example, we can see that 
the effect of a chest x-ray is about a hundred times smaller than 
the effect of spending a year in Colorado, where the level of natural 
background radiation from cosmic rays is higher than average, due 
to the high altitude. Dropping the implausible LNT assumption, we 
can see that the impact on one’s health of spending a year in Col- 
orado is likely to be positive, because the excess radiation is below 
the maximum beneficial level. 


In the late twentieth century, antinuclear activists largely suc- 
ceeded in bringing construction of new nuclear power plants to a 
halt in the U.S. Ironically, we now know that the burning of fossil 
fuels, which leads to global warming, is a far more grave threat to 
the environment than even the Chernobyl disaster. A team of bi- 
ologists writes: “During recent visits to Chernobyl, we experienced 
numerous sightings of moose (Alces alces), roe deer (Capreol capre- 
olus), Russian wild boar (Sus scrofa), foxes (Vulpes vulpes), river 
otter (Lutra canadensis), and rabbits (Lepus europaeus) ... Diver- 
sity of flowers and other plants in the highly radioactive regions is 
impressive and equals that observed in protected habitats outside 
the zone ... The observation that typical human activity (industrial- 
ization, farming, cattle raising, collection of firewood, hunting, etc.) 
is more devastating to biodiversity and abundance of local flora and 
fauna than is the worst nuclear power plant disaster validates the 
negative impact the exponential growth of human populations has 
on wildlife.”*? Nuclear power is the only source of energy that is 
sufficient to replace any significant percentage of energy from fossil 
fuels on the rapid schedule demanded by the speed at which global 
warming is progressing. People worried about the downside of nu- 
clear energy might be better off putting their energy into issues 


Relationship Is Inconsistent with Radiation Biologic and Experimental Data,” 
Radiology, 251 (2009) 13 and Little et al., “ Risks Associated with Low Doses 
and Low Dose Rates of Ionizing Radiation: Why Linearity May Be (Almost) the 
Best We Can Do,” Radiology, 251 (2009) 6. 

’Baker and Chesser, Env. Toxicology and Chem. 19 (1231) 2000. Similar 
effects have been seen at the Bikini Atoll, the site of a 1954 hydrogen bomb test. 
Although some species have disappeared from the area, the coral reef is in many 
ways healthier than similar reefs elsewhere, because humans have tended to stay 
away for fear of radiation (Richards et al., Marine Pollution Bulletin 56 (2008) 
503). 
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related to nuclear weapons: the poor stewardship of the former So- 
viet Union’s warheads; nuclear proliferation in unstable states such 
as Pakistan; and the poor safety and environmental history of the 
superpowers’ nuclear weapons programs, including the loss of sev- 
eral warheads in plane crashes, and the environmental disaster at 
the Hanford, Washington, weapons plant. 


8.2.10 The creation of the elements 
Creation of hydrogen and helium in the Big Bang 


We have discussed in chapter 6 the evidence for the Big Bang 
theory of the origin of the universe. Did all the chemical elements 
we’re made of come into being in the Big Bang? Temperatures in 
the first microseconds after the Big Bang were so high that atoms 
and nuclei could not hold together at all. After things had cooled 
down enough for nuclei and atoms to exist, there was a period of 
about three minutes during which the temperature and density were 
high enough for fusion to occur, but not so high that atoms could 
hold together. We have a good, detailed understanding of the laws 
of physics that apply under these conditions, so theorists are able 
to say with confidence that the only element heavier than hydrogen 
that was created in significant quantities was helium. 


We are stardust 


In that case, where did all the other elements come from? As- 
tronomers came up with the answer. By studying the combinations 
of wavelengths of light, called spectra, emitted by various stars, 
they had been able to determine what kinds of atoms they con- 
tained. (We will have more to say about spectra at the end of this 
book.) They found that the stars fell into two groups. One type 
was nearly 100% hydrogen and helium, while the other contained 
99% hydrogen and helium and 1% other elements. They interpreted 
these as two generations of stars. The first generation had formed 
out of clouds of gas that came fresh from the big bang, and their 
composition reflected that of the early universe. The nuclear fusion 
reactions by which they shine have mainly just increased the pro- 
portion of helium relative to hydrogen, without making any heavier 
elements. 


The members of the first generation that we see today, however, 
are only those that lived a long time. Small stars are more miserly 
with their fuel than large stars, which have short lives. The large 
stars of the first generation have already finished their lives. Near 
the end of its lifetime, a star runs out of hydrogen fuel and undergoes 
a series of violent and spectacular reorganizations as it fuses heavier 
and heavier elements. Very large stars finish this sequence of events 
by undergoing supernova explosions, in which some of their material 
is flung off into the void while the rest collapses into an exotic object 
such as a black hole or neutron star. 


z/ The Crab Nebula is a remnant 
of a supernova explosion.  Al- 
most all the elements our planet 
is made of originated in such 
explosions. 
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aa/Construction of the UNI- 
LAC accelerator in Germany, one 
of whose uses is for experiments 
to create very heavy artificial 
elements. In such an experiment, 
fusion products recoil through a 
device called SHIP (not shown) 
that separates them based on 
their charge-to-mass ratios — 
it is essentially just a scaled-up 
version of Thomson’s appara- 
tus. A typical experiment runs 
for several months, and out of 
the billions of fusion reactions 
induced during this time, only 
one or two may result in the 
production of superheavy atoms. 
In all the rest, the fused nucleus 
breaks up immediately. SHIP 
is used to identify the small 
number of “good” reactions and 
separate them from this intense 
background. 


The second generation of stars, of which our own sun is an exam- 
ple, condensed out of clouds of gas that had been enriched in heavy 
elements due to supernova explosions. It is those heavy elements 
that make up our planet and our bodies. 


Artificial synthesis of heavy elements 


Elements up to uranium, atomic number 92, were created by 
these astronomical processes. Beyond that, the increasing electrical 
repulsion of the protons leads to shorter and shorter half-lives. Even 
if a supernova a billion years ago did create some quantity of an 
element such as Berkelium, number 97, there would be none left in 
the Earth’s crust today. The heaviest elements have all been created 
by artificial fusion reactions in accelerators. As of 2006, the heaviest 
element that has been created is 116 — two atoms of it!* 


Although the creation of a new element, i.e., an atom with a 
novel number of protons, has historically been considered a glam- 
orous accomplishment, to the nuclear physicist the creation of an 
atom with a hitherto unobserved number of neutrons is equally im- 
portant. The greatest neutron number reached so far is 179. One 
tantalizing goal of this type of research is the theoretical prediction 
that there might be an island of stability beyond the previously ex- 
plored tip of the chart of the nuclei shown in section 8.2.8. Just as 
certain numbers of electrons lead to the chemical stability of the no- 
ble gases (helium, neon, argon, ...), certain numbers of neutrons and 
protons lead to a particularly stable packing of orbits. Calculations 
dating back to the 1960’s have hinted that there might be relatively 
stable nuclei having approximately 114 protons and 184 neutrons. 
The isotopes of elements 114 and 116 that have been produced so 
far have had half-lives in the second or millosecond range. This 
may not seem like very long, but lifetimes in the microsecond range 
are more typical for the superheavy elements that have previously 
been discovered. There is even speculation that certain superheavy 
isotopes would be stable enough to be produced in quantities that 
could for instance be weighed and used in chemical reactions. 


This chapter is summarized on page 899. Notation and terminology 
are tabulated on pages 885-886. 


Discussion Questions 


A Should the quality factor for neutrinos be very small, because they 
mostly don’t interact with your body? 


B Would an alpha source be likely to cause different types of cancer 
depending on whether the source was external to the body or swallowed 
in contaminated food? What about a gamma source? 


“An earlier claim of the creation of element 116 by a group at Berkeley turned 
out to be a case of scientific fraud, but the element was later produced by a 
different group, at Dubna, Russia. 
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Problems 


The symbols Vv, “, etc. are explained on page 464. 


1 The figure shows a neuron, which is the type of cell your nerves 
are made of. Neurons serve to transmit sensory information to the 
brain, and commands from the brain to the muscles. All this data 
is transmitted electrically, but even when the cell is resting and not 
transmitting any information, there is a layer of negative electrical 
charge on the inside of the cell membrane, and a layer of positive 
charge just outside it. This charge is in the form of various ions 
dissolved in the interior and exterior fluids. Why would the negative 
charge remain plastered against the inside surface of the membrane, 
and likewise why doesn’t the positive charge wander away from the 
outside surface? 


2 The Earth and Moon are bound together by gravity. If, in- 
stead, the force of attraction were the result of each having a charge 
of the same magnitude but opposite in sign, find the quantity of 
charge that would have to be placed on each to produce the re- 
quired force. Vv 


3 A helium atom finds itself momentarily in this arrangement. 
Find the direction and magnitude of the force acting on the right- 
hand electron. The two protons in the nucleus are so close together 
(~ 1 fm) that you can consider them as being right on top of each 
other. 


Vv o 


4 234Py decays either by electron decay or by alpha decay. (A 
given 7°4Pu nucleus may do either one; it’s random.) What are the 
isotopes created as products of these two modes of decay? gE 


5 Suppose that a proton in a lead nucleus wanders out to the 
surface of the nucleus, and experiences a strong nuclear force of 
about 8 kN from the nearby neutrons and protons pulling it back 
in. Compare this numerically to the repulsive electrical force from 
the other protons, and verify that the net force is attractive. A 
lead nucleus is very nearly spherical, is about 6.5 fm in radius, and 
contains 82 protons, each with a charge of +e, where e = 1.60 x 
it aa © vo 


6 The nuclear process of beta decay by electron capture is de- 
scribed parenthetically on page 449. The reaction is p+e7 > n-+v. 
(a) Show that charge is conserved in this reaction. 

(b) Conversion between energy and mass is discussed in chapter 7. 
Based on these ideas, explain why electron capture doesn’t occur in 
hydrogen atoms. (If it did, matter wouldn’t exist!) E 


7 In the semifinals of an electrostatic croquet tournament, Jes- 
sica hits her positively charged ball, sending it across the playing 
field, rolling to the left along the x axis. It is repelled by two other 
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positive charges. These two charges are fixed on the y axis at the 
locations shown in the figure. The two fixed charges are equal to 
each other, but not to the charge of the ball. 

(a) Express the force on the ball in terms of the ball’s position, x. 
(b) At what value of x does the ball experience the greatest de- 
celeration? Express you answer in terms of b. (Assume the ball 
has enough energy to continue through, so you don’t have to worry 
about whether it actually gets as far as this particular value of 2.) 
(Based on a problem by Halliday and Resnick.) z 


8 Suppose that at some instant in time, a wire extending from 
x = 0 to x = o holds a charge density, in units of coulombs per 
meter, given by ae”. This type of charge density, dq/da, is typi- 
cally notated as \ (Greek letter lambda). Find the total charge on 
the wire. Vv Oo 


9 Use the nutritional information on some packaged food to 
make an order-of-magnitude estimate of the amount of chemical 
energy stored in one atom of food, in units of joules. Assume that 
a typical atom has a mass of 1077° kg. This constitutes a rough 
estimate of the amounts of energy there are on the atomic scale. 
[See chapter 0 for help on how to do order-of-magnitude estimates. 
Note that a nutritional “calorie” is really a kilocalorie; see page 
887.| vo 


10 = ‘The helium atom of problem 3 has some new experiences, goes 
through some life changes, and later on finds itself in the configura- 
tion shown here. What are the direction and magnitude of the force 
acting on the bottom electron? (Draw a sketch to make clear the 
definition you are using for the angle that gives direction.) V & 


11 A neon light consists of a long glass tube full of neon, with 
metal caps on the ends. Positive charge is placed on one end of the 
tube, and negative charge on the other. The electric forces generated 
can be strong enough to strip electrons off of a certain number of 
neon atoms. Assume for simplicity that only one electron is ever 
stripped off of any neon atom. When an electron is stripped off of 
an atom, both the electron and the neon atom (now an ion) have 
electric charge, and they are accelerated by the forces exerted by the 
charged ends of the tube. (They do not feel any significant forces 
from the other ions and electrons within the tube, because only 
a tiny minority of neon atoms ever gets ionized.) Light is finally 
produced when ions are reunited with electrons. Give a numerical 
comparison of the magnitudes and directions of the accelerations of 
the electrons and ions. [You may need some data from appendix 5.] 


~ 


12 The subatomic particles called muons behave exactly like 
electrons, except that a muon’s mass is greater by a factor of 206.77. 
Muons are continually bombarding the Earth as part of the stream 
of particles from space known as cosmic rays. When a muon strikes 
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an atom, it can displace one of its electrons. If the atom happens 
to be a hydrogen atom, then the muon takes up an orbit that is 
on the average 206.77 times closer to the proton than the orbit of 
the ejected electron. How many times greater is the electric force 


experienced by the muon than that previously felt by the electron? 
JV 


13 (a) Recall that the gravitational energy of two gravitationally 
interacting spheres is given by U, = —Gmj,mz2/r, where r is the 
center-to-center distance. What would be the analogous equation 
for two electrically interacting spheres? Justify your choice of a 
plus or minus sign on physical grounds, considering attraction and 
repulsion. Vv 
(b) Use this expression to estimate the energy required to pull apart 
a raisin-cookie atom of the one-electron type, assuming a radius of 
10? m: Vv 
(c) Compare this with the result of problem 9. 5 


14 If you put two hydrogen atoms near each other, they will feel 
an attractive force, and they will pull together to form a molecule. 
(Molecules consisting of two hydrogen atoms are the normal form 
of hydrogen gas.) Why do they feel a force if they are near each 
other, since each is electrically neutral? Shouldn’t the attractive 
and repulsive forces all cancel out exactly? Use the raisin cookie 
model. (Students who have taken chemistry often try to use fancier 
models to explain this, but if you can’t explain it using a simple 
model, you probably don’t understand the fancy model as well as 
you thought you did!) i 


15 The figure shows one layer of the three-dimensional structure 
of a salt crystal. The atoms extend much farther off in all directions, 
but only a six-by-six square is shown here. The larger circles are 
the chlorine ions, which have charges of —e. The smaller circles are 
sodium ions, with charges of +e. The distance between neighboring 
ions is about 0.3 nm. Real crystals are never perfect, and the crystal 
shown here has two defects: a missing atom at one location, and an 
extra lithium atom, shown as a grey circle, inserted in one of the 
small gaps. If the lithium atom has a charge of +e, what is the 
direction and magnitude of the total force on it? Assume there are 
no other defects nearby in the crystal besides the two shown here. 

> Hint, p. 865 Vie 


16 [See chapter ?? for help on how to do order-of-magnitude 
estimates.] Suppose you are holding your hands in front of you, 10 
cm apart. 


(a) Estimate the total number of electrons in each hand. Vv 
(b) Estimate the total repulsive force of all the electrons in one hand 
on all the electrons in the other. Vv 


(c) Why don’t you feel your hands repelling each other? 
(d) Estimate how much the charge of a proton could differ in mag- 
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nitude from the charge of an electron without creating a noticeable 
force between your hands. lee 


Key to symbols: 
easy © typical challenging Mdifficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Chapter 9 
Circuits 


Madam, what good is a baby? Michael Faraday, when asked by 
Queen Victoria what the electrical devices in his lab were good for 


A few years ago, my wife and I bought a house with Character, 
Character being a survival mechanism that houses have evolved in 
order to convince humans to agree to much larger mortgage pay- 
ments than they’d originally envisioned. Anyway, one of the fea- 
tures that gives our house Character is that it possesses, built into 
the wall of the family room, a set of three pachinko machines. These 
are Japanese gambling devices sort of like vertical pinball machines. 
(The legal papers we got from the sellers hastened to tell us that 
they were “for amusement purposes only.”) Unfortunately, only one 
of the three machines was working when we moved in, and it soon 
died on us. Having become a pachinko addict, I decided to fix it, 
but that was easier said than done. The inside is a veritable Rube 
Goldberg mechanism of levers, hooks, springs, and chutes. My hor- 
monal pride, combined with my Ph.D. in physics, made me certain 
of success, and rendered my eventual utter failure all the more de- 
moralizing. 


Contemplating my defeat, I realized how few complex mechan- 
ical devices I used from day to day. Apart from our cars and my 
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a/ Gymnotus carapo, a knifefish, 
uses electrical signals to sense 
its environment and to commu- 
nicate with others of its species. 
(Greg DeGreef) 


saxophone, every technological tool in our modern life-support sys- 
tem was electronic rather than mechanical. 


9.1 Current and Voltage 


9.1.1 Current 
Unity of all types of electricity 


We are surrounded by things we have been told are “electrical,” 
but it’s far from obvious what they have in common to justify being 
grouped together. What relationship is there between the way socks 
cling together and the way a battery lights a lightbulb? We have 
been told that both an electric eel and our own brains are somehow 
electrical in nature, but what do they have in common? 


British physicist Michael Faraday (1791-1867) set out to address 
this problem. He investigated electricity from a variety of sources 
— including electric eels! — to see whether they could all produce 
the same effects, such as shocks and sparks, attraction and repul- 
sion. “Heating” refers, for example, to the way a lightbulb filament 
gets hot enough to glow and emit light. Magnetic induction is an 
effect discovered by Faraday himself that connects electricity and 
magnetism. We will not study this effect, which is the basis for the 
electric generator, in detail until later in the book. 


attraction and 
shocks sparks repulsion heating 


rubbing J J af J 
battery J J J 


J 
animal J J (/) / 
magnetically induced  / J J / 


The table shows a summary of some of Faraday’s results. Check 
marks indicate that Faraday or his close contemporaries were able to 
verify that a particular source of electricity was capable of producing 
a certain effect. (They evidently failed to demonstrate attraction 
and repulsion between objects charged by electric eels, although 
modern workers have studied these species in detail and been able 
to understand all their electrical characteristics on the same footing 
as other forms of electricity.) 


Faraday’s results indicate that there is nothing fundamentally 
different about the types of electricity supplied by the various sources. 
They are all able to produce a wide variety of identical effects. Wrote 
Faraday, “The general conclusion which must be drawn from this 
collection of facts is that electricity, whatever may be its source, is 
identical in its nature.” 


If the types of electricity are the same thing, what thing is that? 
The answer is provided by the fact that all the sources of electricity 
can cause objects to repel or attract each other. We use the word 
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“charge” to describe the property of an object that allows it to 
participate in such electrical forces, and we have learned that charge 
is present in matter in the form of nuclei and electrons. Evidently 
all these electrical phenomena boil down to the motion of charged 
particles in matter. 


Electric current 


If the fundamental phenomenon is the motion of charged parti- 
cles, then how can we define a useful numerical measurement of it? 
We might describe the flow of a river simply by the velocity of the 
water, but velocity will not be appropriate for electrical purposes 
because we need to take into account how much charge the moving 
particles have, and in any case there are no practical devices sold 
at Radio Shack that can tell us the velocity of charged particles. 
Experiments show that the intensity of various electrical effects is 
related to a different quantity: the number of coulombs of charge 
that pass by a certain point per second. By analogy with the flow 
of water, this quantity is called the electric current , I. Its units of 
coulombs/second are more conveniently abbreviated as amperes, 1 
A=1 C/s. (In informal speech, one usually says “amps.” ) 


The main subtlety involved in this definition is how to account 
for the two types of charge. The stream of water coming from a 
hose is made of atoms containing charged particles, but it produces 
none of the effects we associate with electric currents. For example, 
you do not get an electrical shock when you are sprayed by a hose. 
This type of experiment shows that the effect created by the motion 
of one type of charged particle can be canceled out by the motion of 
the opposite type of charge in the same direction. In water, every 
oxygen atom with a charge of +8e is surrounded by eight electrons 
with charges of —e, and likewise for the hydrogen atoms. 


We therefore refine our definition of current as follows: 


When charged particles are exchanged between regions of space 
A and B, the electric current flowing from A to B is defined as 
_ dq 
dt : 
where dg is the change in region B’s total charge occurring over a 
period of time dt. 


I 


In the garden hose example, your body picks up equal amounts 
of positive and negative charge, resulting in no change in your total 
charge, so the electrical current flowing into you is zero. 
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b/ Example 1 
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lons moving across a cell membrane example 1 
> Figure b shows ions, labeled with their charges, moving in or 
out through the membranes of four cells. If the ions all cross 
the membranes during the same interval of time, how would the 
currents into the cells compare with each other? 


> We're just assuming the rate of flow is constant, so we can talk 
about Aq instead of dq. 


Cell A has positive current going into it because its charge is in- 
creased, i.e., has a positive value of Aq. 


Cell B has the same current as cell A, because by losing one unit 
of negative charge it also ends up increasing its own total charge 
by one unit. 


Cell C’s total charge is reduced by three units, so it has a large 
negative current going into it. 


Cell D loses one unit of charge, so it has a small negative current 
into it. 

Finding current given charge example 2 
> A charged balloon falls to the ground, and its charge begins 
leaking off to the Earth. Suppose that the charge on the balloon 


is given by gq = ae. Find the current as a function of time, and 
interpret the answer. 


> Taking the derivative, we have 


ga oe 
dt 
—abe—"! 
An exponential function approaches zero as the exponent gets 
more and more negative. This means that both the charge and 
the current are decreasing in magnitude with time. It makes sense 
that the charge approaches zero, since the balloon is losing its 
charge. It also makes sense that the current is decreasing in 
magnitude, since charge cannot flow at the same rate forever 
without overshooting zero. 


The reverse of differentiation is integration, so if we know the 


current as a function of time, we can find the charge by integrating. 
Example 8 on page 478 shows such a calculation. 


It may seem strange to say that a negatively charged particle 
going one way creates a current going the other way, but this is 
quite ordinary. As we will see, currents flow through metal wires 
via the motion of electrons, which are negatively charged, so the 
direction of motion of the electrons in a circuit is always opposite to 
the direction of the current. Of course it would have been convenient 
of Benjamin Franklin had defined the positive and negative signs of 
charge the opposite way, since so many electrical devices are based 
on metal wires. 


Number of electrons flowing through a lightbulb example 3 
> If a lightbulb has 1.0 A flowing through it, how many electrons 
will pass through the filament in 1.0 s? 


> We are only calculating the number of electrons that flow, so we 
can ignore the positive and negative signs. Also, since the rate of 
flow is constant, we don’t really need to think in terms of calculus; 
the derivative dq/dt that defines current is the same as Aqg/At in 
this situation. Solving for Aq = /At gives a charge of 1.0 C flowing 
in this time interval. The number of electrons is 


electrons 


coulomb 
coulombs 


electron 


number of electrons = coulombs x 


= coulombs/ 
=1.0C/e 
= 6.2 x 1078 


9.1.2 Circuits 


How can we put electric currents to work? The only method of 
controlling electric charge we have studied so far is to charge differ- 
ent substances, e.g., rubber and fur, by rubbing them against each 
other. Figure c/1 shows an attempt to use this technique to light 
a lightbulb. This method is unsatisfactory. True, current will flow 
through the bulb, since electrons can move through metal wires, and 
the excess electrons on the rubber rod will therefore come through 
the wires and bulb due to the attraction of the positively charged 
fur and the repulsion of the other electrons. The problem is that 
after a zillionth of a second of current, the rod and fur will both have 
run out of charge. No more current will flow, and the lightbulb will 
go out. 


Figure c/2 shows a setup that works. The battery pushes charge 
through the circuit, and recycles it over and over again. (We will 
have more to say later in this chapter about how batteries work.) 
This is called a complete circuit. Today, the electrical use of the 
word “circuit” is the only one that springs to mind for most people, 
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out quickly. 2. A practical circuit. 
3. An open circuit. 4. How an 
ammeter works. 5. Measuring 
the current with an ammeter. 
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but the original meaning was to travel around and make a round 
trip, as when a circuit court judge would ride around the boondocks, 
dispensing justice in each town on a certain date. 


Note that an example like c/3 does not work. The wire will 
quickly begin acquiring a net charge, because it has no way to get 
rid of the charge flowing into it. The repulsion of this charge will 
make it more and more difficult to send any more charge in, and 
soon the electrical forces exerted by the battery will be canceled 
out completely. The whole process would be over so quickly that 
the filament would not even have enough time to get hot and glow. 
This is known as an open circuit. Exactly the same thing would 
happen if the complete circuit of figure c/2 was cut somewhere with 
a pair of scissors, and in fact that is essentially how an ordinary 
light switch works: by opening up a gap in the circuit. 


The definition of electric current we have developed has the great 
virtue that it is easy to measure. In practical electrical work, one 
almost always measures current, not charge. The instrument used 
to measure current is called an ammeter. A simplified ammeter, c/4, 
simply consists of a coiled-wire magnet whose force twists an iron 
needle against the resistance of a spring. The greater the current, 
the greater the force. Although the construction of ammeters may 
differ, their use is always the same. We break into the path of the 
electric current and interpose the meter like a tollbooth on a road, 
c/5. There is still a complete circuit, and as far as the battery and 
bulb are concerned, the ammeter is just another segment of wire. 


Does it matter where in the circuit we place the ammeter? Could 
we, for instance, have put it in the left side of the circuit instead 
of the right? Conservation of charge tells us that this can make no 
difference. Charge is not destroyed or “used up” by the lightbulb, 
so we will get the same current reading on either side of it. What is 
“used up” is energy stored in the battery, which is being converted 
into heat and light energy. 


9.1.3 Voltage 
The volt unit 


Electrical circuits can be used for sending signals, storing infor- 
mation, or doing calculations, but their most common purpose by 
far is to manipulate energy, as in the battery-and-bulb example of 
the previous section. We know that lightbulbs are rated in units of 
watts, i.e., how many joules per second of energy they can convert 
into heat and light, but how would this relate to the flow of charge as 
measured in amperes? By way of analogy, suppose your friend, who 
didn’t take physics, can’t find any job better than pitching bales of 
hay. The number of calories he burns per hour will certainly depend 
on how many bales he pitches per minute, but it will also be pro- 
portional to how much mechanical work he has to do on each bale. 
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If his job is to toss them up into a hayloft, he will get tired a lot 
more quickly than someone who merely tips bales off a loading dock 
into trucks. In metric units, 


joules __ haybales joules 


= x 
second second haybale 


Similarly, the rate of energy transformation by a battery will not 
just depend on how many coulombs per second it pushes through a 
circuit but also on how much mechanical work it has to do on each 
coulomb of charge: 


joules — coulombs joules 


second second coulomb 


or 
power = current x work per unit charge 


Units of joules per coulomb are abbreviated as volts, 1 V=1 J/C, 
named after the Italian physicist Alessandro Volta. Everyone knows 
that batteries are rated in units of volts, but the voltage concept is 
more general than that; it turns out that voltage is a property of 
every point in space. To gain more insight, let’s think more carefully 
about what goes on in the battery and bulb circuit. 


The voltage concept in general 


To do work on a charged particle, the battery apparently must be 
exerting forces on it. How does it do this? Well, the only thing that 
can exert an electrical force on a charged particle is another charged 
particle. It’s as though the haybales were pushing and pulling each 
other into the hayloft! This is potentially a horribly complicated 
situation. Even if we knew how much excess positive or negative 
charge there was at every point in the circuit (which realistically we 
don’t) we would have to calculate zillions of forces using Coulomb’s 
law, perform all the vector additions, and finally calculate how much 
work was being done on the charges as they moved along. To make 
things even more scary, there is more than one type of charged 
particle that moves: electrons are what move in the wires and the 
bulb’s filament, but ions are the moving charge carriers inside the 
battery. Luckily, there are two ways in which we can simplify things: 


The situation is unchanging. Unlike the imaginary setup 
in which we attempted to light a bulb using a rubber rod and a 
piece of fur, this circuit maintains itself in a steady state (after 
perhaps a microsecond-long period of settling down after the 
circuit is first assembled). The current is steady, and as charge 
flows out of any area of the circuit it is replaced by the same 
amount of charge flowing in. The amount of excess positive 
or negative charge in any part of the circuit therefore stays 
constant. Similarly, when we watch a river flowing, the water 
goes by but the river doesn’t disappear. 
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Force depends only on position. Since the charge distri- 
bution is not changing, the total electrical force on a charged 
particle depends only on its own charge and on its location. 
If another charged particle of the same type visits the same 
location later on, it will feel exactly the same force. 


The second observation tells us that there is nothing all that 
different about the experience of one charged particle as compared 
to another’s. If we single out one particle to pay attention to, and 
figure out the amount of work done on it by electrical forces as it 
goes from point A to point B along a certain path, then this is 
the same amount of work that will be done on any other charged 
particles of the same type as it follows the same path. For the sake of 
visualization, let’s think about the path that starts at one terminal 
of the battery, goes through the light bulb’s filament, and ends at 
the other terminal. When an object experiences a force that depends 
only on its position (and when certain other, technical conditions 
are satisfied), we can define an electrical energy associated with the 
position of that object. The amount of work done on the particle 
by electrical forces as it moves from A to B equals the drop in 
electrical energy between A and B. This electrical energy is what is 
being converted into other forms of energy such as heat and light. 
We therefore define voltage in general as electrical energy per unit 
charge: 


The difference in voltage between two points in space is defined 
as 
AV = AU elec / 4 ’ 


where AUgiec is the change in the electrical energy of a particle with 
charge q as it moves from the initial point to the final point. 


The amount of power dissipated (i.e., rate at which energy is 
transformed by the flow of electricity) is then given by the equation 


P=fAV 


‘Energy stored in a battery example 4 

> The 1.2 V rechargeable battery in figure d is labeled 1800 milliamp- 
hours. What is the maximum amount of energy the battery can 
store? 


d/ Example 4. > An ampere-hour is a unit of current multiplied by a unit of time. 
Current is charge per unit time, so an ampere-hour is in fact a 
funny unit of charge: 


(1 A)(1 hour) = (1 C/s)(3600 s) 
= 3600 C 


1800 milliamp-hours is therefore 1800 x 10-° x 3600 C = 6.5 x 
10° C. That’s a huge number of charged particles, but the total 
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loss of electrical energy will just be their total charge multiplied by 
the voltage difference across which they move: 


AUglec = AV 
= (6.5 x 10° C)(1.2 V) 
= 7.8 kJ 
Units of volt-amps example 5 


> Doorbells are often rated in volt-amps. What does this combi- 
nation of units mean? 


> Current times voltage gives units of power, P = /AV, so volt- 
amps are really just a nonstandard way of writing watts. They are 
telling you how much power the doorbell requires. 


Power dissipated by a battery and bulb example 6 
> If a 9.0-volt battery causes 1.0 A to flow through a lightbulb, how 
much power is dissipated? 


> The voltage rating of a battery tells us what voltage difference 
AV it is designed to maintain between its terminals. 


The only nontrivial thing in this problem was dealing with the units. 
One quickly gets used to translating common combinations like 
A.-V into simpler terms. 
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Here are a few questions and answers about the voltage concept. 


Question: OK, so what is voltage, really? 

Answer: A device like a battery has positive and negative charges 
inside it that push other charges around the outside circuit. A 
higher-voltage battery has denser charges in it, which will do more 
work on each charged particle that moves through the outside cir- 
cuit. 


To use a gravitational analogy, we can put a paddlewheel at the 
bottom of either a tall waterfall or a short one, but a kg of water 
that falls through the greater gravitational energy difference will 
have more energy to give up to the paddlewheel at the bottom. 


Question: Why do we define voltage as electrical energy divided by 
charge, instead of just defining it as electrical energy? 

Answer: One answer is that it’s the only definition that makes the 
equation P = ITAV work. A more general answer is that we want 
to be able to define a voltage difference between any two points 
in space without having to know in advance how much charge the 
particles moving between them will have. If you put a nine-volt 
battery on your tongue, then the charged particles that move across 
your tongue and give you that tingly sensation are not electrons but 
ions, which may have charges of +e, —2e, or practically anything. 
The manufacturer probably expected the battery to be used mostly 
in circuits with metal wires, where the charged particles that flowed 
would be electrons with charges of —e. If the ones flowing across 
your tongue happen to have charges of —2e, the electrical energy 
difference for them will be twice as much, but dividing by their 
charge of —2e in the definition of voltage will still give a result of 9 
V. 


Question: Are there two separate roles for the charged particles in 
the circuit, a type that sits still and exerts the forces, and another 
that moves under the influence of those forces? 

Answer: No. Every charged particle simultaneously plays both 
roles. Newton’s third law says that any particle that has an electri- 
cal force acting on it must also be exerting an electrical force back on 
the other particle. There are no “designated movers” or “designated 
force-makers.” 


Question: Why does the definition of voltage only refer to voltage 
differences? 

Answer: It’s perfectly OK to define voltage as V = Uejec/q. But 
recall that it is only differences in interaction energy, U, that have 
direct physical meaning in physics. Similarly, voltage differences are 
really more useful than absolute voltages. A voltmeter measures 
voltage differences, not absolute voltages. 
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Discussion Questions 


A Aroller coaster is sort of like an electric circuit, but it uses gravitational 
forces on the cars instead of electric ones. What would a high-voltage 
roller coaster be like? What would a high-current roller coaster be like? 


B- Criticize the following statements: 


“He touched the wire, and 10000 volts went through him.” 
“That battery has a charge of 9 volts.” 


“You used up the charge of the battery.” 


Cc When you touch a 9-volt battery to your tongue, both positive and 
negative ions move through your saliva. Which ions go which way? 


D | once touched a piece of physics apparatus that had been wired 
incorrectly, and got a several-thousand-volt voltage difference across my 
hand. | was not injured. For what possible reason would the shock have 
had insufficient power to hurt me? 


9.1.4 Resistance 
Resistance 


So far we have simply presented it as an observed fact that a 
battery-and-bulb circuit quickly settles down to a steady flow, but 
why should it? Newton’s second law, a = F'/m, would seem to 
predict that the steady forces on the charged particles should make 
them whip around the circuit faster and faster. The answer is that as 
charged particles move through matter, there are always forces, anal- 
ogous to frictional forces, that resist the motion. These forces need 
to be included in Newton’s second law, which is really a = Fiotar/m, 
not a = F'/m. If, by analogy, you push a crate across the floor at 
constant speed, i.e., with zero acceleration, the total force on it must 
be zero. After you get the crate going, the floor’s frictional force is 
exactly canceling out your force. The chemical energy stored in 
your body is being transformed into heat in the crate and the floor, 
and no longer into an increase in the crate’s kinetic energy. Simi- 
larly, the battery’s internal chemical energy is converted into heat, 
not into perpetually increasing the charged particles’ kinetic energy. 
Changing energy into heat may be a nuisance in some circuits, such 
as a computer chip, but it is vital in a lightbulb, which must get hot 
enough to glow. Whether we like it or not, this kind of heating effect 
is going to occur any time charged particles move through matter. 


What determines the amount of heating? One flashlight bulb 
designed to work with a 9-volt battery might be labeled 1.0 watts, 
another 5.0. How does this work? Even without knowing the details 
of this type of friction at the atomic level, we can relate the heat 
dissipation to the amount of current that flows via the equation 
P = IAV. If the two flashlight bulbs can have two different values 
of P when used with a battery that maintains the same AV, it 
must be that the 5.0-watt bulb allows five times more current to 
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e/Georg Simon Ohm (1787- 
1854). 


4 


f/Four objects made of the 
same substance have different 
resistances. 


flow through it. 


For many substances, including the tungsten from which light- 
bulb filaments are made, experiments show that the amount of cur- 
rent that will flow through it is directly proportional to the voltage 
difference placed across it. For an object made of such a substance, 
we define its electrical resistance as follows: 


If an object inserted in a circuit displays a current flow which 
is proportional to the voltage difference across it, then we define its 
resistance as the constant ratio 


R=AV/I 


The units of resistance are volts/ampere, usually abbreviated as 
ohms, symbolized with the capital Greek letter omega, 2. 


‘Resistance of a lightbulb example 7 
> A flashlight bulb powered by a 9-volt battery has a resistance of 
10 O. How much current will it draw? 


> Solving the definition of resistance for /, we find 


[= AV/R 
=0.9V/Q 
= 0.9 V/(V/A) 
=0.9A 


Ohm’s law states that many substances, including many solids 
and some liquids, display this kind of behavior, at least for voltages 
that are not too large. The fact that Ohm’s law is called a “law” 
should not be taken to mean that all materials obey it, or that it has 
the same fundamental importance as Newton’s laws, for example. 
Materials are called ohmic or nonohmic, depending on whether they 
obey Ohm’s law. 


If objects of the same size and shape made from two different 
ohmic materials have different resistances, we can say that one ma- 
terial is more resistive than the other, or equivalently that it is less 
conductive. Materials, such as metals, that are very conductive are 
said to be good conductors. Those that are extremely poor conduc- 
tors, for example wood or rubber, are classified as insulators. There 
is no sharp distinction between the two classes of materials. Some, 
such as silicon, lie midway between the two extremes, and are called 
semiconductors. 


On an intuitive level, we can understand the idea of resistance 
by making the sounds “hhhhhh” and “ffffff.”. To make air flow out 
of your mouth, you use your diaphragm to compress the air in your 
chest. The pressure difference between your chest and the air outside 
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your mouth is analogous to a voltage difference. When you make the 
“h” sound, you form your mouth and throat in a way that allows air 
to flow easily. The large flow of air is like a large current. Dividing 
by a large current in the definition of resistance means that we get 
a small resistance. We say that the small resistance of your mouth 
and throat allows a large current to flow. When you make the “f” 
sound, you increase the resistance and cause a smaller current to 
flow. 


Note that although the resistance of an object depends on the 
substance it is made of, we cannot speak simply of the “resistance 
of gold” or the “resistance of wood.” Figure f shows four examples 
of objects that have had wires attached at the ends as electrical 
connections. If they were made of the same substance, they would 
all nevertheless have different resistances because of their different 
sizes and shapes. A more detailed discussion will be more natural 
in the context of the following chapter, but it should not be too 
surprising that the resistance of f/2 will be greater than that of 
f/1 — the image of water flowing through a pipe, however incorrect, 
gives us the right intuition. Object f/3 will have a smaller resistance 
than f/1 because the charged particles have less of it to get through. 


Superconductors 


All materials display some variation in resistance according to 
temperature (a fact that is used in thermostats to make a ther- 
mometer that can be easily interfaced to an electric circuit). More 
spectacularly, most metals have been found to exhibit a sudden 
change to zero resistance when cooled to a certain critical temper- 
ature. They are then said to be superconductors. Theoretically, 
superconductors should make a great many exciting devices possi- 
ble, for example coiled-wire magnets that could be used to levitate 
trains. In practice, the critical temperatures of all metals are very 
low, and the resulting need for extreme refrigeration has made their 
use uneconomical except for such specialized applications as particle 
accelerators for physics research. 


But scientists have recently made the surprising discovery that 
certain ceramics are superconductors at less extreme temperatures. 
The technological barrier is now in finding practical methods for 
making wire out of these brittle materials. Wall Street is currently 
investing billions of dollars in developing superconducting devices 
for cellular phone relay stations based on these materials. In 2001, 
the city of Copenhagen replaced a short section of its electrical power 
trunks with superconducing cables, and they are now in operation 
and supplying power to customers. 


There is currently no satisfactory theory of superconductivity in 
general, although superconductivity in metals is understood fairly 
well. Unfortunately I have yet to find a fundamental explanation of 
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g/A superconducting  seg- 
ment of the ATLAS accelerator 
at Argonne National Laboratory 


near Chicago. It is used to 
accelerate beams of ions to a few 
percent of the speed of light for 
nuclear physics research. The 
shiny silver-colored surfaces are 
made of the element niobium, 
which is a superconductor at 
relatively high temperatures com- 
pared to other metals — relatively 
high meaning the temperature of 
liquid helium! The beam of ions 
passes through the holes in the 
two small cylinders on the ends 
of the curved rods. Charge is 
shuffled back and forth between 
them at a frequency of 12 million 
cycles per second, so that they 
take turns being positive and 
negative. The positively charged 
beam consists of short spurts, 
each timed so that when it is 
in one of the segments it will 
be pulled forward by negative 
charge on the cylinder in front 
of it and pushed forward by the 
positively charged one behind. 
The huge currents involved would 
quickly melt any metal that was 
not superconducting, but in a 
superconductor they produce no 
heat at all. 
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superconductivity in metals that works at the introductory level. 


Finding charge given current example 8 
> In the segment of the ATLAS accelerator shown in figure g, the 
current flowing back and forth between the two cylinders is given 
by / = acos bt. What is the charge on one of the cylinders as a 
function of time? > We are given the current and want to find the 
charge, i.e., we are given the derivative and we want to find the 
original function that would give that derivative. This means we 


need to integrate: 
q= | tat 


= | acosbtat 


where q) is a constant of integration. 


We can interpret this in order to explain why a superconductor 
needs to be used. The constant b must be very large, since the 
current is supposed to oscillate back and forth millions of times a 
second. Looking at the final result, we see that if b is a very large 
number, and q is to be a significant amount of charge, then a must 
be a very large number as well. If a is numerically large, then the 
current must be very large, so it would heat the accelerator too 
much if it was flowing through an ordinary conductor. 


Constant voltage throughout a conductor 


The idea of a superconductor leads us to the question of how 
we should expect an object to behave if it is made of a very good 
conductor. Superconductors are an extreme case, but often a metal 
wire can be thought of as a perfect conductor, for example if the 
parts of the circuit other than the wire are made of much less con- 
ductive materials. What happens if R equals zero in the equation 
R=AV/I? The result of dividing two numbers can only be zero if 
the number on top equals zero. This tells us that if we pick any two 
points in a perfect conductor, the voltage difference between them 
must be zero. In other words, the entire conductor must be at the 
same voltage. 


Constant voltage means that no work would be done on a charge 
as it moved from one point in the conductor to another. If zero work 
was done only along a certain path between two specific points, it 
might mean that positive work was done along part of the path and 
negative work along the rest, resulting in a cancellation. But there is 
no way that the work could come out to be zero for all possible paths 
unless the electrical force on a charge was in fact zero at every point. 
Suppose, for example, that you build up a static charge by scuffing 
your feet on a carpet, and then you deposit some of that charge onto 
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a doorknob, which is a good conductor. How can all that charge be 
in the doorknob without creating any electrical force at any point 
inside it? The only possible answer is that the charge moves around 
until it has spread itself into just the right configuration so that the 
forces exerted by all the little bits of excess surface charge on any 
charged particle within the doorknob exactly canceled out. 


We can explain this behavior if we assume that the charge placed 
on the doorknob eventually settles down into a stable equilibrium. 
Since the doorknob is a conductor, the charge is free to move through 
it. If it was free to move and any part of it did experience a nonzero 
total force from the rest of the charge, then it would move, and we 
would not have an equilibrium. 


It also turns out that charge placed on a conductor, once it 
reaches its equilibrium configuration, is entirely on the surface, not 
on the interior. We will not prove this fact formally, but it is in- 
tuitively reasonable. Suppose, for instance, that the net charge on 
the conductor is negative, i.e., it has an excess of electrons. These 
electrons all repel each other, and this repulsion will tend to push 
them onto the surface, since being on the surface allows them to be 
as far apart as possible. 


Short circuits 


So far we have been assuming a perfect conductor. What if it 
is a good conductor, but not a perfect one? Then we can solve for 
AV =7R. An ordinary-sized current will make a very small result 
when we multiply it by the resistance of a good conductor such as 
a metal wire. The voltage throughout the wire will then be nearly 
constant. If, on the other hand, the current is extremely large, we 
can have a significant voltage difference. This is what happens in a 
short-circuit: a circuit in which a low-resistance pathway connects 
the two sides of a voltage source. Note that this is much more 
specific than the popular use of the term to indicate any electrical 
malfunction at all. If, for example, you short-circuit a 9-volt battery 
as shown in figure h, you will produce perhaps a thousand amperes 
of current, leading to a very large value of P = [AV. The wire gets 
hot! 


self-check A 
What would happen to the battery in this kind of short circuit? > 
Answer, p. 870 


At this stage, most students have a hard time understanding why 
resistors would be used inside a radio or a computer. We obviously 
want a lightbulb or an electric stove to have a circuit element that 
resists the flow of electricity and heats up, but heating is undesirable 
in radios and computers. Without going too far afield, let’s use a 
mechanical analogy to get a general idea of why a resistor would be 
used in a radio. 


h/Short-circuiting a __ battery. 
Warning: you can burn yourself 
this way or start a fire! If you 
want to try this, try making the 
connection only very briefly, use 
a low-voltage battery, and avoid 
touching the battery or the wire, 
both of which will get hot. 


color meaning 
black 
brown 
red 
orange 
yellow 
green 
blue 
violet 
gray 
white 
silver +10% 
gold +5% 

i / Color codes used on resistors. 
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| | 
blue 
brown 
red "| 


2 1 6 +10% 
21x10 Q 410% 


j/An example of a_ resistor 
with a color code. 


a) \ 


k/ The symbol used in schemat- 
ics to represent a_ resistor. 


The main parts of a radio receiver are an antenna, a tuner for 
selecting the frequency, and an amplifier to strengthen the signal suf- 
ficiently to drive a speaker. The tuner resonates at the selected fre- 
quency, just as in the examples of mechanical resonance discussed in 
3. The behavior of a mechanical resonator depends on three things: 
its inertia, its stiffness, and the amount of friction or damping. The 
first two parameters locate the peak of the resonance curve, while 
the damping determines the width of the resonance. In the radio 
tuner we have an electrically vibrating system that resonates at a 
particular frequency. Instead of a physical object moving back and 
forth, these vibrations consist of electrical currents that flow first 
in one direction and then in the other. In a mechanical system, 
damping means taking energy out of the vibration in the form of 
heat, and exactly the same idea applies to an electrical system: the 
resistor supplies the damping, and therefore controls the width of 
the resonance. If we set out to eliminate all resistance in the tuner 
circuit, by not building in a resistor and by somehow getting rid of 
all the inherent electrical resistance of the wires, we would have a 
useless radio. The tuner’s resonance would be so narrow that we 
could never get close enough to the right frequency to bring in the 
station. The roles of inertia and stiffness are played by other circuit 
elements we have not discusses (a capacitor and a coil). 


Resistors 


Inside any electronic gadget you will see quite a few little circuit 
elements like the one shown below. These resistors are simply a 
cylinder of ohmic material with wires attached to the end. 


Many electrical devices are based on electrical resistance and 
Ohm’s law, even if they do not have little components in them that 
look like the usual resistor. The following are some examples. 


Lightbulb 


There is nothing special about a lightbulb filament — you can 
easily make a lightbulb by cutting a narrow waist into a metallic 
gum wrapper and connecting the wrapper across the terminals of 
a 9-volt battery. The trouble is that it will instantly burn out. 
Edison solved this technical challenge by encasing the filament in 
an evacuated bulb, which prevented burning, since burning requires 
oxygen. 


Polygraph 


The polygraph, or “lie detector,” is really just a set of meters 
for recording physical measures of the subject’s psychological stress, 
such as sweating and quickened heartbeat. The real-time sweat mea- 
surement works on the principle that dry skin is a good insulator, 
but sweaty skin is a conductor. Of course a truthful subject may 
become nervous simply because of the situation, and a practiced 
liar may not even break a sweat. The method’s practitioners claim 
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that they can tell the difference, but you should think twice before 
allowing yourself to be polygraph tested. Most U.S. courts exclude 
all polygraph evidence, but some employers attempt to screen out 
dishonest employees by polygraph testing job applicants, an abuse 
that ranks with such pseudoscience as handwriting analysis. 


Fuse 


A fuse is a device inserted in a circuit tollbooth-style in the same 
manner as an ammeter. It is simply a piece of wire made of metals 
having a relatively low melting point. If too much current passes 
through the fuse, it melts, opening the circuit. The purpose is to 
make sure that the building’s wires do not carry so much current 
that they themselves will get hot enough to start a fire. Most modern 
houses use circuit breakers instead of fuses, although fuses are still 
common in cars and small devices. A circuit breaker is a switch 
operated by a coiled-wire magnet, which opens the circuit when 
enough current flows. The advantage is that once you turn off some 
of the appliances that were sucking up too much current, you can 
immediately flip the switch closed. In the days of fuses, one might 
get caught without a replacement fuse, or even be tempted to stuff 
aluminum foil in as a replacement, defeating the safety feature. 


Voltmeter 


A voltmeter is nothing more than an ammeter with an additional 
high-value resistor through which the current is also forced to flow. 
Ohm’s law relates the current through the resistor is related directly 
to the voltage difference across it, so the meter can be calibrated 
in units of volts based on the known value of the resistor. The 
voltmeter’s two probes are touched to the two locations in a circuit 
between which we wish to measure the voltage difference, 1/2. Note 
how cumbersome this type of drawing is, and how difficult it can 
be to tell what is connected to what. This is why electrical drawing 
are usually shown in schematic form. Figure 1/3 is a schematic 
representation of figure 1/2. 


The setups for measuring current and voltage are different. When 
we are measuring current, we are finding “how much stuff goes 
through,” so we place the ammeter where all the current is forced 
to go through it. Voltage, however, is not “stuff that goes through,” 
it is a measure of electrical energy. If an ammeter is like the meter 
that measures your water use, a voltmeter is like a measuring stick 
that tells you how high a waterfall is, so that you can determine how 
much energy will be released by each kilogram of falling water. We 
do not want to force the water to go through the measuring stick! 


voltmeter 


1/1. A simplified diagram of 
how a voltmeter works. 2. Mea- 
suring the voltage difference 
across a lightbulb. 3. The same 
setup drawn in schematic form. 4. 
The setup for measuring current 
is different. 
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The arrangement in figure 1/3 is a parallel circuit: one in there are 
“forks in the road” where some of the current will flow one way and 
some will flow the other. Figure 1/4 is said to be wired in series: 
all the current will visit all the circuit elements one after the other. 
We will deal with series and parallel circuits in more detail in the 
following chapter. 


If you inserted a voltmeter incorrectly, in series with the bulb and 
battery, its large internal resistance would cut the current down so 
low that the bulb would go out. You would have severely disturbed 
the behavior of the circuit by trying to measure something about it. 


Incorrectly placing an ammeter in parallel is likely to be even 
more disconcerting. The ammeter has nothing but wire inside it to 
provide resistance, so given the choice, most of the current will flow 
through it rather than through the bulb. So much current will flow 
through the ammeter, in fact, that there is a danger of burning out 
the battery or the meter or both! For this reason, most ammeters 
have fuses or circuit breakers inside. Some models will trip their 
circuit breakers and make an audible alarm in this situation, while 
others will simply blow a fuse and stop working until you replace it. 


Discussion Questions 


A __Infigure 1/1, would it make any difference in the voltage measurement 
if we touched the voltmeter’s probes to different points along the same 
segments of wire? 


B- Explain why it would be incorrect to define resistance as the amount 
of charge the resistor allows to flow. 


9.1.5 Current-conducting properties of materials 


Ohm’s law has a remarkable property, which is that current will 
flow even in response to a voltage difference that is as small as we 
care to make it. In the analogy of pushing a crate across a floor, it 
is as though even a flea could slide the crate across the floor, albeit 
at some very low speed. The flea cannot do this because of static 
friction, which we can think of as an effect arising from the tendency 
of the microscopic bumps and valleys in the crate and floor to lock 
together. The fact that Ohm’s law holds for nearly all solids has 
an interesting interpretation: at least some of the electrons are not 
“locked down” at all to any specific atom. 


More generally we can ask how charge actually flows in various 
solids, liquids, and gases. This will lead us to the explanations of 
many interesting phenomena, including lightning, the bluish crust 
that builds up on the terminals of car batteries, and the need for 
electrolytes in sports drinks. 


Solids 


In atomic terms, the defining characteristic of a solid is that its 
atoms are packed together, and the nuclei cannot move very far from 
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their equilibrium positions. It makes sense, then, that electrons, not 
ions, would be the charge carriers when currents flow in solids. This 
fact was established experimentally by Tolman and Stewart, in an 
experiment in which they spun a large coil of wire and then abruptly 
stopped it. They observed a current in the wire immediately after 
the coil was stopped, which indicated that charged particles that 
were not permanently locked to a specific atom had continued to 
move because of their own inertia, even after the material of the 
wire in general stopped. The direction of the current showed that 
it was negatively charged particles that kept moving. The current 
only lasted for an instant, however; as the negatively charged par- 
ticles collected at the downstream end of the wire, farther particles 
were prevented joining them due to their electrical repulsion, as well 
as the attraction from the upstream end, which was left with a net 
positive charge. Tolman and Stewart were even able to determine 
the mass-to-charge ratio of the particles. We need not go into the 
details of the analysis here, but particles with high mass would be 
difficult to decelerate, leading to a stronger and longer pulse of cur- 
rent, while particles with high charge would feel stronger electrical 
forces decelerating them, which would cause a weaker and shorter 
pulse. The mass-to-charge ratio thus determined was consistent with 
the m/q of the electron to within the accuracy of the experiment, 
which essentially established that the particles were electrons. 


The fact that only electrons carry current in solids, not ions, has 
many important implications. For one thing, it explains why wires 
don’t fray or turn to dust after carrying current for a long time. 
Electrons are very small (perhaps even pointlike), and it is easy to 
imagine them passing between the cracks among the atoms without 
creating holes or fractures in the atomic framework. For those who 
know a little chemistry, it also explains why all the best conductors 
are on the left side of the periodic table. The elements in that area 
are the ones that have only a very loose hold on their outermost 
electrons. 


Gases 


The molecules in a gas spend most of their time separated from 
each other by significant distances, so it is not possible for them to 
conduct electricity the way solids do, by handing off electrons from 
atom to atom. It is therefore not surprising that gases are good 
insulators. 


Gases are also usually nonohmic. As opposite charges build up 
on a stormcloud and the ground below, the voltage difference be- 
comes greater and greater. Zero current flows, however, until finally 
the voltage reaches a certain threshold and we have an impressive 
example of what is known as a spark or electrical discharge. If air 
was ohmic, the current between the cloud and the ground would 
simply increase steadily as the voltage difference increased, rather 
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than being zero until a threshold was reached. This behavior can be 
explained as follows. At some point, the electrical forces on the air 
electrons and nuclei of the air molecules become so strong that elec- 
trons are ripped right off of some of the molecules. The electrons 
then accelerate toward either the cloud or the ground, whichever 
is positively charged, and the positive ions accelerate the opposite 
way. As these charge carriers accelerate, they strike and ionize other 
molecules, which produces a rapidly growing cascade. 
Liquids 

Molecules in a liquid are able to slide past each other, so ions 
as well as electrons can carry currents. Pure water is a poor con- 
ductor because the water molecules tend to hold onto their electrons 
strongly, and there are therefore not many electrons or ions available 
to move. Water can become quite a good conductor, however, with 
the addition of even a small amount of certain substances called 
electrolytes, which are typically salts. For example, if we add table 
salt, NaCl, to water, the NaCl molecules dissolve into Na* and Cl 
ions, which can then move and create currents. This is why elec- 
tric currents can flow among the cells in our bodies: cellular fluid is 
quite salty. When we sweat, we lose not just water but electrolytes, 
so dehydration plays havoc with our cells’ electrical systems. It is 
for this reason that electrolytes are included in sports drinks and 
formulas for rehydrating infants who have diarrhea. 


Since current flow in liquids involves entire ions, it is not sur- 
prising that we can see physical evidence when it has occurred. For 
example, after a car battery has been in use for a while, the HySO4 
battery acid becomes depleted of hydrogen ions, which are the main 
charge carriers that complete the circuit on the inside of the bat- 
tery. The leftover SO, then forms a visible blue crust on the battery 
posts. 


Speed of currents and electrical signals 


When I talk on the phone to my mother in law two thousand 
miles away, I do not notice any delay while the signal makes its way 
back and forth. Electrical signals therefore must travel very quickly, 
but how fast exactly? The answer is rather subtle. For the sake 
of concreteness, let’s restrict ourselves to currents in metals, which 
consist of electrons. 


The electrons themselves are only moving at speeds of perhaps 
a few thousand miles per hour, and their motion is mostly random 
thermal motion. This shows that the electrons in my phone cannot 
possibly be zipping back and forth between California and New York 
fast enough to carry the signals. Even if their thousand-mile-an-hour 
motion was organized rather than random, it would still take them 
many minutes to get there. Realistically, it will take the average 
electron even longer than that to make the trip. The current in the 
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wire consists only of a slow overall drift, at a speed on the order 
of a few centimeters per second, superimposed on the more rapid 
random motion. We can compare this with the slow westward drift 
in the population of the U.S. If we could make a movie of the motion 
of all the people in the U.S. from outer space, and could watch it at 
high speed so that the people appeared to be scurrying around like 
ants, we would think that the motion was fairly random, and we 
would not immediately notice the westward drift. Only after many 
years would we realize that the number of people heading west over 
the Sierras had exceeded the number going east, so that California 
increased its share of the country’s population. 


So why are electrical signals so fast if the average drift speed of 
electrons is so slow? The answer is that a disturbance in an electrical 
system can move much more quickly than the charges themselves. 
It is as though we filled a pipe with golf balls and then inserted an 
extra ball at one end, causing a ball to fall out at the other end. 
The force propagated to the other end in a fraction of a second, but 
the balls themselves only traveled a few centimeters in that time. 


Because the reality of current conduction is so complex, we often 
describe things using mental shortcuts that are technically incorrect. 
This is OK as long as we know that they are just shortcuts. For 
example, suppose the presidents of France and Russia shake hands, 
and the French politician has inadvertently picked up a positive elec- 
trical charge, which shocks the Russian. We may say that the excess 
positively charged particles in the French leader’s body, which all 
repel each other, take the handshake as an opportunity to get far- 
ther apart by spreading out into two bodies rather than one. In 
reality, it would be a matter of minutes before the ions in one per- 
son’s body could actually drift deep into the other’s. What really 
happens is that throughout the body of the recipient of the shock 
there are already various positive and negative ions which are free 
to move. Even before the perpetrator’s charged hand touches the 
victim’s sweaty palm, the charges in the shocker’s body begin to 
repel the positive ions and attract the negative ions in the other 
person. The split-second sensation of shock is caused by the sudden 
jumping of the victim’s ions by distances of perhaps a micrometer, 
this effect occurring simultaneously throughout the whole body, al- 
though more violently in the hand and arm, which are closer to the 
other person. 
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a/1. Wrong: The shapes of the 
wires are irrelevant. 2. Wrong: 
Right angles should be used. 3. 
Wrong: A simple pattern is made 
to look unfamiliar and compli- 
cated. 4. Right. 


b/The two’ shaded _ areas 
shaped like the letter “E” are both 
regions of constant voltage. 


9.2 Parallel and Series Circuits 


In section 9.1, we limited ourselves to relatively simple circuits, es- 
sentially nothing more than a battery and a single lightbulb. The 
purpose of this chapter is to introduce you to more complex circuits, 
containing multiple resistors or voltage sources in series, in parallel, 
or both. 


9.2.1 Schematics 


I see a chess position; Kasparov sees an interesting Ruy Lopez 
variation. To the uninitiated a schematic may look as unintelligible 
as Mayan hieroglyphs, but even a little bit of eye training can go a 
long way toward making its meaning leap off the page. A schematic 
is a stylized and simplified drawing of a circuit. The purpose is to 
eliminate as many irrelevant features as possible, so that the relevant 
ones are easier to pick out. 


1 2 3 4 

An example of an irrelevant feature is the physical shape, length, 
and diameter of a wire. In nearly all circuits, it is a good approxi- 
mation to assume that the wires are perfect conductors, so that any 
piece of wire uninterrupted by other components has constant volt- 
age throughout it. Changing the length of the wire, for instance, 
does not change this fact. (Of course if we used miles and miles 
of wire, as in a telephone line, the wire’s resistance would start to 
add up, and its length would start to matter.) The shapes of the 
wires are likewise irrelevant, so we draw them with standardized, 
stylized shapes made only of vertical and horizontal lines with right- 
angle bends in them. This has the effect of making similar circuits 
look more alike and helping us to recognize familiar patterns, just 


as words in a newspaper are easier to recognize than handwritten 
ones. Figure a shows some examples of these concepts. 


The most important first step in learning to read schematics is 
to learn to recognize contiguous pieces of wire which must have con- 
stant voltage throughout. In figure b, for example, the two shaded 
E-shaped pieces of wire must each have constant voltage. This fo- 
cuses our attention on two of the main unknowns we’d like to be 
able to predict: the voltage of the left-hand E and the voltage of 
the one on the right. 
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9.2.2 Parallel resistances and the junction rule 


One of the simplest examples to analyze is the parallel resistance 
circuit, of which figure b was an example. In general we may have 
unequal resistances R; and Rg, as in c/1. Since there are only two 
constant-voltage areas in the circuit, c/2, all three components have 
the same voltage difference across them. A battery normally suc- 
ceeds in maintaining the voltage differences across itself for which it 
was designed, so the voltage drops AV, and AV) across the resistors 
must both equal the voltage of the battery: 


AV, a AV2 — AVbattery 


Each resistance thus feels the same voltage difference as if it was 
the only one in the circuit, and Ohm’s law tells us that the amount 
of current flowing through each one is also the same as it would 
have been in a one-resistor circuit. This is why household electrical 
circuits are wired in parallel. We want every appliance to work 
the same, regardless of whether other appliances are plugged in or 
unplugged, turned on or switched off. (The electric company doesn’t 
use batteries of course, but our analysis would be the same for any 
device that maintains a constant voltage.) 


uf R Fi 2 3 4 

vin E- fe} 

Of course the electric company can tell when we turn on every 
light in the house. How do they know? The answer is that we draw 
more current. Each resistance draws a certain amount of current, 
and the amount that has to be supplied is the sum of the two indi- 


vidual currents. The current is like a river that splits in half, ¢/3, 
and then reunites. The total current is 


Opa = qi + Ip 


This is an example of a general fact called the junction rule: 


In any circuit that is not storing or releasing charge, conservation 
of charge implies that the total current flowing out of any junction 
must be the same as the total flowing in. 


Coming back to the analysis of our circuit, we apply Ohm’s law 


c/1. Two resistors in parallel. 
2. There are two constant-voltage 
areas. 3. The current that comes 
out of the battery splits between 
the two resistors, and later re- 
unites. 4. The two resistors in 
parallel can be treated as a single 
resistor with a smaller resistance 
value. 
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to each resistance, resulting in 


Total = AV/R, + AV/R2 


As far as the electric company is concerned, your whole house 
is just one resistor with some resistance R, called the equivalent 
resistance. They would write Ohm’s law as 


Ltotal = AV/R , 


from which we can determine the equivalent resistance by compari- 
son with the previous expression: 


1 1 
ey eee oe 
/ Ri Rs 


1 rye 3 
R=(—+— 
(atm) 


[equivalent resistance of two resistors in parallel] 


Two resistors in parallel, c/4, are equivalent to a single resistor with 
a value given by the above equation. 


Two lamps on the same household circuit example 9 
> You turn on two lamps that are on the same household circuit. 
Each one has a resistance of 1 ohm. What is the equivalent re- 
sistance, and how does the power dissipation compare with the 
case of a single lamp? 


> The equivalent resistance of the two lamps in parallel is 
1s ee 
R=(—+— 
& : Fi 


(1 Ee ee 
“\ta*Ta 


The voltage difference across the whole circuit is always the 110 
V set by the electric company (it’s alternating current, but that’s 
irrelevant). The resistance of the whole circuit has been cut in 
half by turning on the second lamp, so a fixed amount of voltage 
will produce twice as much current. Twice the current flowing 
across the same voltage difference means twice as much power 
dissipation, which makes sense. 
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The cutting in half of the resistance surprises many students, 
since we are “adding more resistance” to the circuit by putting in 
the second lamp. Why does the equivalent resistance come out to be 
less than the resistance of a single lamp? This is a case where purely 
verbal reasoning can be misleading. A resistive circuit element, such 
as the filament of a lightbulb, is neither a perfect insulator nor 
a perfect conductor. Instead of analyzing this type of circuit in 
terms of “resistors,” i.e., partial insulators, we could have spoken of 
“conductors.” This example would then seem reasonable, since we 
“added more conductance,” but one would then have the incorrect 
expectation about the case of resistors in series, discussed in the 
following section. 


Perhaps a more productive way of thinking about it is to use 
mechanical intuition. By analogy, your nostrils resist the flow of 
air through them, but having two nostrils makes it twice as easy to 
breathe. 1 


Three resistors in parallel example 10 
> What happens if we have three or more resistors in parallel? ea 


> This is an important example, because the solution involves 
an important technique for understanding circuits: breaking them 
down into smaller parts and them simplifying those parts. In the 


circuit d/1, with three resistors in parallel, we can think of two of 
the resistors as forming a single big resistor, d/2, with equivalent R 


- 


emeeeeeny 


Se 


resistance 
12° Ri Re 


We can then simplify the circuit as shown in d/3, so that it con- 


tains only two resistances. The equivalent resistance of the whole d/Three resistors 


circuit is then given by 


tt... 4 
R23 = (az) 


Substituting for R;2 and simplifying, we find the result 


which you probably could have guessed. The interesting point 
here is the divide-and-conquer concept, not the mathematical re- 
sult. 


An arbitrary number of identical resistors in parallel example 11 
> What is the resistance of N identical resistors in parallel? 


> Generalizing the results for two and three resistors, we have 
fi. st a 
Ry={>—-+— +... 
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as 


e/Uniting four resistors in 
parallel is equivalent to making 
a single resistor with the same 
length but four times the cross- 
sectional area. The result is to 
make a resistor with one quarter 
the resistance. 


f/A fat pipe has less resistance 
than a skinny pipe. 


g/A voltmeter is really an 
ammeter with an internal resistor. 
When we measure the voltage 
difference across a resistor, 1, we 
are really constructing a parallel 
resistance circuit, 2. 
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“ooo” 


where “...” means that the sum includes all the resistors. If all the 
resistors are identical, this becomes 


m-(8) 


‘Dependence of resistance on cross-sectional area example 12 
We have alluded briefly to the fact that an object’s electrical re- 
sistance depends on its size and shape, but now we are ready 
to begin making more mathematical statements about it. As sug- 
gested by figure e, increasing a resistors’s cross-sectional area is 
equivalent to adding more resistors in parallel, which will lead to 
an overall decrease in resistance. Any real resistor with straight, 
parallel sides can be sliced up into a large number of pieces, each 
with cross-sectional area of, say, 1 um*. The number, N, of such 
slices is proportional to the total cross-sectional area of the resis- 
tor, and by application of the result of the previous example we 
therefore find that the resistance of an object is inversely propor- 
tional to its cross-sectional area. 


An analogous relationship holds for water pipes, which is why 
high-flow trunk lines have to have large cross-sectional areas. To 
make lots of water (current) flow through a skinny pipe, we’d need 
an impractically large pressure (voltage) difference. 


‘Incorrect readings from a voltmeter example 13 
A voltmeter is really just an ammeter with an internal resistor, and 
we use a voltmeter in parallel with the thing that we’re trying to 
measure the voltage difference across. This means that any time 
we measure the voltage drop across a resistor, we’re essentially 
putting two resistors in parallel. The ammeter inside the voltmeter 
can be ignored for the purpose of analyzing what how current 
flows in the circuit, since it is essentially just some coiled-up wire 
with a very low resistance. 


1 R, NowA4fwe are carrying out this measurement on a resistor that is 


larger circuit, we have changed the behavior of the cir- 
cyit through our act of measuring. It is as though we had modified 
the circult by replacing the resistance A with the smaller equiva- 


leviiePesistance of R and FR, in parallel. It is for this reason that 
voltmeters are built with the largest possible internal resistance. 
As a numerical example, if we use a voltmeter with an internal 
TMaNCe of 1 MO to measure the voltage drop across a one- 
$tor, the equivalent resistance is 0.999999 O, which is 
different enough to make any difference. But if we tried to use 
the same voltmeter to measure the voltage drop across a 2— MO 
3 AV, resister, we would be reducing the resistance of that part of the 
C_~ sircuit by a factor of three, which would produce a drastic change 

avior of the whole circuit. 


he reason why you can’t use a voltmeter to measure the 
\oltage/ difference between two different points in mid-air, or between 
AW@, ands of a piece of wood. This is by no means a stupid thing to 
want to do, since the world around us is not a constant-voltage 


a reuniting path for the positive and negative charges 


he—pe(yiVie 
[tia have fen separated — through the voltmeter itself, which is 


a etfod conductor compared to the air. This reduces to zero the 
voltage difference we were trying to measure. 


general, a voltmeter that has been set up with an open circuit 
ery large resistance) between its probes is said to be “float- 

ing.” An old-fashioned analog voltmeter of the type described here 
h/1. A bakery ddvesefy@hbn left floating, the same as when it was sitting 


through two resisters EhSAHl°s “Hoating digital voltmeter usually shows an error 
There are three constant-voltag 


regions. 3. e three voltae 


differences atgo (ePSéries'resistances 
the meter crab-walks around t 


circuit without ABeineiscincical layouts are parallel and series, so a pair of 
crossing its tegstottseinresultiagh/1, is another of the most basic circuits we can 
voltages havendus Randominuation of charge, all the current that flows through 
signs that make. thertaddrbetl@lso flow through the other (as well as through the 
eos battery): 


The only way the information about the two resistance values is 
going to be useful is if we can apply Ohm’s law, which will relate the 
resistance of each resistor to the current flowing through it and the 
voltage difference across it. Figure h/2 shows the three constant- 
voltage areas. Voltage differences are more physically significant 
than voltages, so we define symbols for the voltage differences across 
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the two resistors in figure h/3. 


We have three constant-voltage areas, with symbols for the dif- 
ference in voltage between every possible pair of them. These three 
voltage differences must be related to each other. It is as though I 
tell you that Fred is a foot taller than Ginger, Ginger is a foot taller 
than Sally, and Fred is two feet taller than Sally. The information 
is redundant, and you really only needed two of the three pieces of 
data to infer the third. In the case of our voltage differences, we 
have 

|AV, | ale |AV3| = |AVoattery| 


The absolute value signs are because of the ambiguity in how we 
define our voltage differences. If we reversed the two probes of the 
voltmeter, we would get a result with the opposite sign. Digital 
voltmeters will actually provide a minus sign on the screen if the 
wire connected to the “V” plug is lower in voltage than the one 
connected to the “COM” plug. Analog voltmeters pin the needle 
against a peg if you try to use them to measure negative voltages, 
so you have to fiddle to get the leads connected the right way, and 
then supply any necessary minus sign yourself. 


Figure h/4 shows a standard way of taking care of the ambiguity 
in signs. For each of the three voltage measurements around the 
loop, we keep the same probe (the darker one) on the clockwise 
side. It is as though the voltmeter was sidling around the circuit 
like a crab, without ever “crossing its legs.” With this convention, 
the relationship among the voltage drops becomes 


AV, + AV2 = —AVbattery ’ 
or, in more symmetrical form, 
AV, 7 AV2 + AVoattery =0 


More generally, this is known as the loop rule for analyzing circuits: 


Assuming the standard convention for plus and minus signs, the 
sum of the voltage drops around any closed loop in a circuit must 
be zero. 


Looking for an exception to the loop rule would be like asking 
for a hike that would be downhill all the way and that would come 
back to its starting point! 


For the circuit we set out to analyze, the equation 


AV, + AV2 oF AVoattery =0 
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can now be rewritten by applying Ohm’s law to each resistor: 
TR, + InR2 + AVyattery = 0 

The currents are the same, so we can factor them out: 
I(Ri + Ro) + AViattery = 0 , 


and this is the same result we would have gotten if we had been 
analyzing a one-resistor circuit with resistance R, + Ro. Thus the 
equivalent resistance of resistors in series equals the sum of their 
resistances. 


Two lightbulbs in series example 14 
> If two identical lightbulos are placed in series, how do their 
brightnesses compare with the brightness of a single bulb? 


> Taken as a whole, the pair of bulbs act like a doubled resistance, 
so they will draw half as much current from the wall. Each bulb 
will be dimmer than a single bulb would have been. 


The total power dissipated by the circuit is /AV. The voltage drop 
across the whole circuit is the same as before, but the current is 
halved, so the two-bulb circuit draws half as much total power as 
the one-bulb circuit. Each bulb draws one-quarter of the normal 
power. 


Roughly speaking, we might expect this to result in one quarter 
the light being produced by each bulb, but in reality lightbulbs 
waste quite a high percentage of their power in the form of heat 
and wavelengths of light that are not visible (infrared and ultravi- 
olet). Less light will be produced, but it’s hard to predict exactly 
how much less, since the efficiency of the bulbs will be changed 
by operating them under different conditions. 


More than two equal resistances in series example 15 


By straightforward application of the divide-and-conquer technique 


discussed in the previous section, we find that the equivalent re- 
sistance of N identical resistances RF in series will be NR. 


Dependence of resistance on length example 16 
In the previous section, we proved that resistance is inversely 
proportional to cross-sectional area. By equivalent reason about 
resistances in series, we find that resistance is proportional to 
length. Analogously, it is harder to blow through a long straw than 
through a short one. 


Putting the two arguments together, we find that the resistance 
of an object with straight, parallel sides is given by 


R= (constant) -L/A 


The proportionality constant is called the resistivity, and it depends 
only on the substance of which the object is made. A resistivity 
measurement could be used, for instance, to help identify a sample 
of an unknown substance. 


i/ Example 14. 


Fa 
oe 


j/Doubling the length of a 
resistor is like putting two resis- 
tors in series. The resistance is 
doubled. 
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Choice of high voltage for power lines example 17 
Thomas Edison got involved in a famous technological contro- 
versy over the voltage difference that should be used for electrical 
power lines. At this time, the public was unfamiliar with electricity, 
and easily scared by it. The president of the United States, for 
instance, refused to have electrical lighting in the White House 
when it first became commercially available because he consid- 
ered it unsafe, preferring the known fire hazard of oil lamps to 
the mysterious dangers of electricity. Mainly as a way to over- 
come public fear, Edison believed that power should be transmit- 
ted using small voltages, and he publicized his opinion by giving 
demonstrations at which a dog was lured into position to be killed 
by a large voltage difference between two sheets of metal on the 
ground. (Edison’s opponents also advocated alternating current 
rather than direct current, and AC is more dangerous than DC as 
well. As we will discuss later, AC can be easily stepped up and 
down to the desired voltage level using a device called a trans- 
former.) 


Now if we want to deliver a certain amount of power P; to a 
load such as an electric lightbulb, we are constrained only by the 
equation P; = /AV,. We can deliver any amount of power we 
wish, even with a low voltage, if we are willing to use large cur- 
rents. Modern electrical distribution networks, however, use dan- 
gerously high voltage differences of tens of thousands of volts. 
Why did Edison lose the debate? 


It boils down to money. The electric company must deliver the 
amount of power P; desired by the customer through a transmis- 
sion line whose resistance Fz is fixed by economics and geogra- 
phy. The same current flows through both the load and the trans- 
mission line, dissipating power usefully in the former and waste- 
fully in the latter. The efficiency of the system is 


power paid for by the customer 


fficiency = 
eecieney = power paid for by the utility 


Putting ourselves in the shoes of the electric company, we wish 
to get rid of the variable P7, since it is something we control only 
indirectly by our choice of AV; and /. Substituting P; = /AV;, we 
find 


efficiency = 
14 Avr 


PL 


We assume the transmission line (but not necessarily the load) is 


ohmic, so substituting AV; = /R7 gives 


efficiency = PR, 

P. 
This quantity can clearly be maximized by making / as small as 
possible, since we will then be dividing by the smallest possible 
quantity on the bottom of the fraction. A low-current circuit can 
only deliver significant amounts of power if it uses high voltages, 
which is why electrical transmission systems use dangerous high 
voltages. 


Getting killed by your ammeter example 18 
As with a voltmeter, an ammeter can give erroneous readings if it 
is used in such a way that it changes the behavior the circuit. An 
ammeter is used in series, so if it is used to measure the current 
through a resistor, the resistor’s value will effectively be changed 
to R+ R,, where R, is the resistance of the ammeter. Ammeters 
are designed with very low resistances in order to make it unlikely 
that R + Ra will be significantly different from R. 


In fact, the real hazard is death, not a wrong reading! Virtually 
the only circuits whose resistances are significantly less than that 
of an ammeter are those designed to carry huge currents. An 
ammeter inserted in such a circuit can easily melt. When | was 
working at a laboratory funded by the Department of Energy, we 
got periodic bulletins from the DOE safety office about serious ac- 
cidents at other sites, and they held a certain ghoulish fascination. 
One of these was about a DOE worker who was completely in- 
cinerated by the explosion created when he inserted an ordinary 
Radio Shack ammeter into a high-current circuit. Later estimates 
showed that the heat was probably so intense that the explosion 
was a ball of plasma — a gas so hot that its atoms have been 
ionized. 


k / Example 19. 
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A complicated circuit example 19 
> All seven resistors in the left-hand panel of figure k are identi- 
cal. Initially, the switch S is open as shown in the figure, and the 
current through resistor A is |. The switch is then closed. Find 
the current through resistor B, after the switch is closed, in terms 
of Io. 


> The second panel shows the circuit redrawn for simplicity, in the 
initial condition with the switch open. When the switch is open, no 
current can flow through the central resistor, so we may as well 
ignore it. I’ve also redrawn the junctions, without changing what’s 
connected to what. This is the kind of mental rearranging that 
you'll eventually learn to do automatically from experience with 
analyzing circuits. The redrawn version makes it easier to see 
what's happening with the current. Charge is conserved, so any 
charge that flows past point 1 in the circuit must also flow past 
points 2 and 3. This would have been harder to reason about by 
applying the junction rule to the original version, which appears 
to have nine separate junctions. 


In the new version, it’s also clear that the circuit has a great deal 
of symmetry. We could flip over each parallel pair of identical re- 
sistors without changing what’s connected to what, so that makes 
it clear that the voltage drops and currents must be equal for the 
members of each pair. We can also prove this by using the loop 
rule. The loop rule says that the two voltage drops in loop 4 must 
be equal, and similarly for loops 5 and 6. Since the resistors obey 
Ohm’s law, equal voltage drops across them also imply equal cur- 
rents. That means that when the current at point 1 comes to the 
top junction, exactly half of it goes through each resistor. Then 
the current reunites at 2, splits between the next pair, and so on. 
We conclude that each of the six resistors in the circuit experi- 
ences the same voltage drop and the same current. Applying the 
loop rule to loop 7, we find that the sum of the three voltage drops 
across the three left-hand resistors equals the battery’s voltage, 
V, so each resistor in the circuit experiences a voltage drop V/3. 
Letting R stand for the resistance of one of the resistors, we find 
that the current through resistor B, which is the same as the cur- 
rents through all the others, is given by lb) = V/3R. 


We now pass to the case where the switch is closed, as shown 
in the third panel. The battery’s voltage is the same as before, 
and each resistor’s resistance is the same, so we can still use the 
same symbols V and A for them. It is no longer true, however, 
that each resistor feels a voltage drop V/3. The equivalent resis- 
tance of the whole circuit is R/2+ R/3+ R/2 = 4R/3, so the total 
current drawn from the battery is 3V/4R. In the middle group 
of resistors, this current is split three ways, so the new current 
through B is (1/3)(3V/4R) = V/4R = 31,/4. 


Interpreting this result, we see that it comes from two effects that 
partially cancel. Closing the switch reduces the equivalent re- 
sistance of the circuit by giving charge another way to flow, and 
increases the amount of current drawn from the battery. Resis- 
tor B, however, only gets a 1/3 share of this greater current, not 
1/2. The second effect turns out to be bigger than the first, and 
therefore the current through resistor B is lessened over all. 


Discussion Question 


A We have stated the loop rule in a symmetric form where a series 
of voltage drops adds up to zero. To do this, we had to define a standard 
way of connecting the voltmeter to the circuit so that the plus and minus 
signs would come out right. Suppose we wish to restate the junction rule 
ina similar symmetric way, so that instead of equating the current coming 
in to the current going out, it simply states that a certain sum of currents at 
a junction adds up to zero. What standard way of inserting the ammeter 
would we have to use to make this work? 


This chapter is summarized on page 900. Notation and terminology 
are tabulated on pages 885-886. 
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Problems 
The symbols Vv, “, etc. are explained on page 505. 


1 In a wire carrying a current of 1.0 pA, how long do you have 
to wait, on the average, for the next electron to pass a given point? 
Express your answer in units of microseconds. 

> Solution, p. 881 © 


2 Referring back to our old friend the neuron from problem 1 
on page 461, let’s now consider what happens when the nerve is 
stimulated to transmit information. When the blob at the top (the 
cell body) is stimulated, it causes Na* ions to rush into the top of 
the tail (axon). This electrical pulse will then travel down the axon, 
like a flame burning down from the end of a fuse, with the Na* ions 
at each point first going out and then coming back in. If 10!° Nat 
ions cross the cell membrane in 0.5 ms, what amount of current is 
created? 


Vv Oo 


3 If a typical light bulb draws about 900 mA from a 110-V 
household circuit, what is its resistance? (Don’t worry about the 


fact that it’s alternating current.) Vv 
4 (a) Express the power dissipated by a resistor in terms of R 
and AV only, eliminating I. Vv 


(b) Electrical receptacles in your home are mostly 110 V, but cir- 
cuits for electric stoves, air conditioners, and washers and driers are 
usually 220 V. The two types of circuits have differently shaped re- 
ceptacles. Suppose you rewire the plug of a drier so that it can be 
plugged in to a 110 V receptacle. The resistor that forms the heat- 
ing element of the drier would normally draw 200 W. How much 
power does it actually draw now? Vv oo 


5 Lightning discharges a cloud during an electrical storm. Sup- 
pose that the current in the lightning bolt varies with time as I = bt, 
where 0 is a constant. Find the cloud’s charge as a function of time. 


J 


6 A resistor has a voltage difference AV across it, causing a 
current I to flow. 

(a) Find an equation for the power it dissipates as heat in terms of 
the variables J and R only, eliminating AV. Vv 
(b) If an electrical line coming to your house is to carry a given 
amount of current, interpret your equation from part a to explain 
whether the wire’s resistance should be small, or large. a 


7 In AM (amplitude-modulated) radio, an audio signal f(t) is 
multiplied by a sine wave sinwt in the megahertz frequency range. 
For simplicity, let’s imagine that the transmitting antenna is a whip, 
and that charge goes back and forth between the top and bottom. 
Suppose that, during a certain time interval, the audio signal varies 
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linearly with time, giving a charge q = (a + bt) sinwt at the top of 
the whip and —q at the bottom. Find the current as a function of 
time. Vv Oo 


8 As discussed in the text, when a conductor reaches an equi- 
librium where its charge is at rest, there is always zero electric force 
on a charge in its interior, and any excess charge concentrates itself 
on the surface. The surface layer of charge arranges itself so as to 
produce zero total force at any point in the interior. (Otherwise the 
free charge in the interior could not be at rest.) Suppose you have 
a teardrop-shaped conductor like the one shown in the figure. Since 
the teardrop is a conductor, there are free charges everywhere inside 
it, but consider a free charged particle at the location shown with 
a white circle. Explain why, in order to produce zero force on this 
particle, the surface layer of charge must be denser in the pointed 
part of the teardrop. 

Remark: Similar reasoning shows why Benjamin Franklin used a sharp tip when 
he invented the lightning rod. The charged stormclouds induce positive and 
negative charges to move to opposite ends of the rod. At the pointed upper end 
of the rod, the charge tends to concentrate at the point, and this charge attracts 
the lightning. The same effect can sometimes be seen when a scrap of aluminum 
foil is inadvertently put in a microwave oven. Modern experiments (Moore et 
al., Journal of Applied Meteorology 39 (1999) 593) show that although a sharp 
tip is best at starting a spark, a more moderate curve, like the right-hand tip of 


the teardrop in this problem, is better at successfully sustaining the spark for 
long enough to connect a discharge to the clouds. ~ 


9 Use the result of problem ?? on page ?? to find an equation for 
the voltage at a point in space at a distance r from a point charge 
Q. (Take your V = 0 distance to be anywhere you like.) Vv 


10 Today, even a big luxury car like a Cadillac can have an 
electrical system that is relatively low in power, since it doesn’t 
need to do much more than run headlights, power windows, etc. 
In the near future, however, manufacturers plan to start making 
cars with electrical systems about five times more powerful. This 
will allow certain energy-wasting parts like the water pump to be 
run on electrical motors and turned off when they’re not needed 
— currently they’re run directly on shafts from the motor, so they 
can’t be shut off. It may even be possible to make an engine that 
can shut off at a stoplight and then turn back on again without 
cranking, since the valves can be electrically powered. Current cars’ 
electrical systems have 12-volt batteries (with 14-volt chargers), but 
the new systems will have 36-volt batteries (with 42-volt chargers). 
(a) Suppose the battery in a new car is used to run a device that 
requires the same amount of power as the corresponding device in 
the old car. Based on the sample figures above, how would the 
currents handled by the wires in one of the new cars compare with 
the currents in the old ones? 

(b) The real purpose of the greater voltage is to handle devices that 
need more power. Can you guess why they decided to change to 36- 
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Problems 13 and 14. 


positive , negative 


Problem 16. 


volt batteries rather than increasing the power without increasing 
the voltage? Vv = 


11 We have referred to resistors dissipating heat, i.e., we have 
assumed that P = IAV is always greater than zero. Could [AV 
come out to be negative for a resistor? If so, could one make a 
refrigerator by hooking up a resistor in such a way that it absorbed 
heat instead of dissipating it? & 


12 What resistance values can be created by combining a 1 kQ 
resistor and a 10 kO resistor? > Solution, p. 881 


13 The figure shows a circuit containing five lightbulbs con- 
nected to a battery. Suppose you’re going to connect one probe of a 
voltmeter to the circuit at the point marked with a dot. How many 
unique, nonzero voltage differences could you measure by connecting 
the other probe to other wires in the circuit? s 


14 The lightbulbs in the figure are all identical. If you were 
inserting an ammeter at various places in the circuit, how many 
unique currents could you measure? If you know that the current 
measurement will give the same number in more than one place, 
only count that as one unique current. & 


15 (a) You take an LP record out of its sleeve, and it acquires a 
static charge of 1 nC. You play it at the normal speed of 334 r.p.m., 
and the charge moving in a circle creates an electric current. What 
is the current, in amperes? Vv 
(b) Although the planetary model of the atom can be made to work 
with any value for the radius of the electrons’ orbits, more advanced 
models that we will study later in this course predict definite radii. 
If the electron is imagined as circling around the proton at a speed 
of 2.2 x 10° m/s, in an orbit with a radius of 0.05 nm, what electric 
current is created? vo 


16 The figure shows a simplified diagram of a device called a 
tandem accelerator, used for accelerating beams of ions up to speeds 
on the order of 1% of the speed of light. The nuclei of these ions 
collide with the nuclei of atoms in a target, producing nuclear re- 
actions for experiments studying the structure of nuclei. The outer 
shell of the accelerator is a conductor at zero voltage (i.e., the same 
voltage as the Earth). The electrode at the center, known as the 
“terminal,” is at a high positive voltage, perhaps millions of volts. 
Negative ions with a charge of —1 unit (i-e., atoms with one extra 
electron) are produced offstage on the right, typically by chemical 
reactions with cesium, which is a chemical element that has a strong 
tendency to give away electrons. Relatively weak electric and mag- 
netic forces are used to transport these —1 ions into the accelerator, 
where they are attracted to the terminal. Although the center of the 
terminal has a hole in it to let the ions pass through, there is a very 
thin carbon foil there that they must physically penetrate. Passing 
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through the foil strips off some number of electrons, changing the 
atom into a positive ion, with a charge of +n times the fundamental 
charge. Now that the atom is positive, it is repelled by the terminal, 
and accelerates some more on its way out of the accelerator. 

(a) Find the velocity, v, of the emerging beam of positive ions, in 
terms of n, their mass m, the terminal voltage V, and fundamental 
constants. Neglect the small change in mass caused by the loss of 
electrons in the stripper foil. 

(b) To fuse protons with protons, a minimum beam velocity of about 
11% of the speed of light is required. What terminal voltage would 
be needed in this case? Vv 
(c) In the setup described in part b, we need a target containing 
atoms whose nuclei are single protons, i.e., a target made of hydro- 
gen. Since hydrogen is a gas, and we want a foil for our target, we 
have to use a hydrogen compound, such as a plastic. Discuss what 
effect this would have on the experiment. Vv Oo 


17 ~+Wire is sold in a series of standard diameters, called “gauges.” 
The difference in diameter between one gauge and the next in the 
series is about 20%. How would the resistance of a given length of 
wire compare with the resistance of the same length of wire in the 
next gauge in the series? vo 


18 In the figure, the battery is 9 V. 


(a) What are the voltage differences across each light bulb? Vv 
(b) What current flows through each of the three components of the 
circuit? J 


(c) If a new wire is added to connect points A and B, how will the 
appearances of the bulbs change? What will be the new voltages 
and currents? Vv 
(d) Suppose no wire is connected from A to B, but the two bulbs 
are switched. How will the results compare with the results from 
the original setup as drawn? v Oo 


19 A student in a biology lab is given the following instruc- 
tions: “Connect the cerebral eraser (C.E.) and the neural depo- 
larizer (N.D.) in parallel with the power supply (P.S.). (Under no 
circumstances should you ever allow the cerebral eraser to come 
within 20 cm of your head.) Connect a voltmeter to measure the 
voltage across the cerebral eraser, and also insert an ammeter in 
the circuit so that you can make sure you don’t put more than 100 
mA through the neural depolarizer.” The diagrams show two lab 
groups’ attempts to follow the instructions. 

(a) Translate diagram 1 into a standard-style schematic. What is 
correct and incorrect about this group’s setup? 

(b) Do the same for diagram 2. z 


20 Referring back to problem 15 on page 463 about the sodium 
chloride crystal, suppose the lithium ion is going to jump from the 
gap it is occupying to one of the four closest neighboring gaps. 
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Which one will it jump to, and if it starts from rest, how fast will 
it be going by the time it gets there? (It will keep on moving and 
accelerating after that, but that does not concern us.) [Hint: The 
approach is similar to the one used for the other problem, but you 
want to work with voltage and electrical energy rather than force.| 


vo 


21 A 1.0 2 toaster and a 2.0 2 lamp are connected in parallel 
with the 110-V supply of your house. (Ignore the fact that the 
voltage is AC rather than DC.) 

(a) Draw a schematic of the circuit. 

(b) For each of the three components in the circuit, find the current 


passing through it and the voltage drop across it. v 
(c) Suppose they were instead hooked up in series. Draw a schematic 
and calculate the same things. Vv Oo 


22 The heating element of an electric stove is connected in series 
with a switch that opens and closes many times per second. When 
you turn the knob up for more power, the fraction of the time that 
the switch is closed increases. Suppose someone suggests a simpler 
alternative for controlling the power by putting the heating element 
in series with a variable resistor controlled by the knob. (With the 
knob turned all the way clockwise, the variable resistor’s resistance is 
nearly zero, and when it’s all the way counterclockwise, its resistance 
is essentially infinite.) (a) Draw schematics. (b) Why would the 
simpler design be undesirable? g 


23 You have a circuit consisting of two unknown resistors in 
series, and a second circuit consisting of two unknown resistors in 
parallel. 

(a) What, if anything, would you learn about the resistors in the 
series circuit by finding that the currents through them were equal? 
(b) What if you found out the voltage differences across the resistors 
in the series circuit were equal? 

(c) What would you learn about the resistors in the parallel circuit 
from knowing that the currents were equal? 

(d) What if the voltages in the parallel circuit were equal? fe 


24 How many different resistance values can be created by com- 
bining three unequal resistors? (Don’t count possibilities in which 
not all the resistors are used, i.e., ones in which there is zero current 
in one or more of them.) a 


25 Suppose six identical resistors, each with resistance R, are 
connected so that they form the edges of a tetrahedron (a pyramid 
with three sides in addition to the base, i.e., one less side than an 
Egyptian pyramid). What resistance value or values can be obtained 
by making connections onto any two points on this arrangement? 

> Solution, p. 881 & 


26 A person in a rural area who has no electricity runs an 
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extremely long extension cord to a friend’s house down the road so 
she can run an electric light. The cord is so long that its resistance, 
x, is not negligible. Show that the lamp’s brightness is greatest if 
its resistance, y, is equal to x. Explain physically why the lamp is 


dim for values of y that are too small or too large. = 
27 All three resistors have the same resistance, R. Find the 
three unknown currents in terms of Vj, V2, and R. vo 


Problem 27. 
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Problem 31. 


28 You are given a battery, a flashlight bulb, and a single piece 
of wire. Draw at least two configurations of these items that would 
result in lighting up the bulb, and at least two that would not light 
it. (Don’t draw schematics.) If you’re not sure what’s going on, 
borrow the materials from your instructor and try it. Note that the 
bulb has two electrical contacts: one is the threaded metal jacket, 
and the other is the tip (at the bottom in the figure). [Problem by 
Arnold Arons.| Eg 


29 The figure shows a simplified diagram of an electron gun such 
as the one that creates the electron beam in a TV tube. Electrons 
that spontaneously emerge from the negative electrode (cathode) 
are then accelerated to the positive electrode, which has a hole in it. 
(Once they emerge through the hole, they will slow down. However, 
if the two electrodes are fairly close together, this slowing down is a 
small effect, because the attractive and repulsive forces experienced 
by the electron tend to cancel.) 

(a) If the voltage difference between the electrodes is AV, what is 
the velocity of an electron as it emerges at B? Assume that its initial 
velocity, at A, is negligible, and that the velocity is nonrelativistic. 
(If you haven’t read ch. 7 yet, don’t worry about the remark about 
relativity.) Vv 
(b) Evaluate your expression numerically for the case where AV=10 
kV, and compare to the speed of light. If you’ve read ch. 7 already, 
comment on whether the assumption of nonrelativistic motion was 
justified. > Solution, p. 881 V o 


30 (a) Many battery-operated devices take more than one bat- 
tery. If you look closely in the battery compartment, you will see 
that the batteries are wired in series. Consider a flashlight circuit. 
What does the loop rule tell you about the effect of putting several 
batteries in series in this way? 

(b) The cells of an electric eel’s nervous system are not that differ- 
ent from ours — each cell can develop a voltage difference across 
it of somewhere on the order of one volt. How, then, do you think 
an electric eel can create voltages of thousands of volts between 
different parts of its body? E 


31 The figure shows two possible ways of wiring a flashlight with 
a switch. Both will serve to turn the bulb on and off, although the 
switch functions in the opposite sense. Why is method (1) prefer- 
able? a 
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32 You have to do different things with a circuit to measure 
current than to measure a voltage difference. Which would be more 
practical for a printed circuit board, in which the wires are actually 
strips of metal embedded inside the board? 

> Solution, p. 881 © 


33 ~The bulbs are all identical. Which one doesn’t light up? i& 


34 Each bulb has a resistance of one ohm. How much power is 
drawn from the one-volt battery? Vv 


35 The bulbs all have unequal resistances. Given the three 
currents shown in the figure, find the currents through bulbs A, B, 
C, and D. z 


36 A silk thread is uniformly charged by rubbing it with llama 
fur. The thread is then dangled vertically above a metal plate and 
released. As each part of the thread makes contact with the conduct- 
ing plate, its charge is deposited onto the plate. Since the thread is 
accelerating due to gravity, the rate of charge deposition increases 
with time, and by time t the cumulative amount of charge is q = ct?, 
where c is aconstant. (a) Find the current flowing onto the plate. V 
(b) Suppose that the charge is immediately carried away through a 
resistance R. Find the power dissipated as heat. Vv 


~ 


Key to symbols: 
easy . typical challenging Mdifficult © very difficult 
Vv An answer check is available at www.lightandmatter.com. 


A printed circuit board, like 
the kind referred to in problem 
32. 
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Exercises 
Exercise 9A: Voltage and Current 


1. How many different currents could you measure in this circuit? Make a prediction, and then 
try it. 


What do you notice? How does this make sense in terms of the roller coaster metaphor intro- 
duced in discussion question 9.1.3A on page 475? 


What is being used up in the resistor? 


2. By connecting probes to these points, how many ways could you measure a voltage? How 
many of them would be different numbers? Make a prediction, and then do it. 
A B 


Cc D 


What do you notice? Interpret this using the roller coaster metaphor, and color in parts of the 
circuit that represent constant voltages. 


3. The resistors are unequal. How many different voltages and currents can you measure? Make 
a prediction, and then try it. 


What do you notice? Interpret this using the roller coaster metaphor, and color in parts of the 
circuit that represent constant voltages. 
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Exercise 9B: The Loop and Junction Rules 
Apparatus: 

DC power supply 

multimeter 

resistors 
1. The junction rule 


Construct a circuit like this one, using the power supply as your voltage source. To make things 
more interesting, don’t use equal resistors. Use nice big resistors (say 100 kQ to 1 MQ) — 
this will ensure that you don’t burn up the resistors, and that the multimeter’s small internal 
resistance when used as an ammeter is negligible in comparison. 


Ry 


Insert your multimeter in the circuit to measure all three currents that you need in order to test 
the junction rule. 


2. The loop rule 


Now come up with a circuit to test the loop rule. Since the loop rule is always supposed to be 
true, it’s hard to go wrong here! Make sure you have at least three resistors in a loop, and make 
sure you hook in the power supply in a way that creates non-zero voltage differences across all 
the resistors. Measure the voltage differences you need to measure to test the loop rule. Here 
it is best to use fairly small resistances, so that the multimeter’s large internal resistance when 
used in parallel as a voltmeter will not significantly reduce the resistance of the circuit. Do not 
use resistances of less than about 100 Q, however, or you may blow a fuse or burn up a resistor. 
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Exercise 9C: Reasoning About Circuits 


The questions in this exercise can all be solved using some combination of the following ap- 
proaches: 


a) There is constant voltage throughout any conductor. 
b) Ohm’s law can be applied to any part of a circuit. 
c) Apply the loop rule. 

d) Apply the junction rule. 


In each case, discuss the question, decide what you think is the right answer, and then try the 
experiment. 


1. A wire is added in parallel with one bulb. 


2.4V 


Which reasoning is correct? 


e Each bulb still has 1.2 V across it, so both bulbs are still lit up. 


e All parts of a wire are at the same voltage, and there is now a wire connection from one 
side of the right-hand bulb to the other. The right-hand bulb has no voltage difference 
across it, so it goes out. 


2. The series circuit is changed as shown. 


L 


2.4V 2.4V 


Which reasoning is correct? 


e Each bulb now has its sides connected to the two terminals of the battery, so each now has 
2.4 V across it instead of 1.2 V. They get brighter. 


e Just as in the original circuit, the current goes through one bulb, then the other. It’s just 
that now the current goes in a figure-8 pattern. The bulbs glow the same as before. 
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3. A wire is added as shown to the original circuit. 


What is wrong with the following reasoning? 


The top right bulb will go out, because its two sides are now connected with wire, so there will 
be no voltage difference across it. The other three bulbs will not be affected. 


4. A wire is added as shown to the original circuit. 


What is wrong with the following reasoning? 


The current flows out of the right side of the battery. When it hits the first junction, some of 
it will go left and some will keep going up The part that goes up lights the top right bulb. The 
part that turns left then follows the path of least resistance, going through the new wire instead 
of the bottom bulb. The top bulb stays lit, the bottom one goes out, and others stay the same. 


5. What happens when one bulb is unscrewed, leaving an air gap? 
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Chapter 10 
Fields 


“Okay. Your duties are as follows: Get Breen. | don’t care how you get 
him, but get him soon. That faker! He posed for twenty years as a scientist 
without ever being apprehended. Well, I’m going to do some apprehend- 
ing that'll make all previous apprehending look like no apprehension at all. 
You with me?” 


“Yes,” said Battle, very much confused. “What's that thing you have?” 


“Piggy-back heat-ray. You transpose the air in its path into an unstable 
isotope which tends to carry all energy as heat. Then you shoot your juice 
light, or whatever along the isotopic path and you burn whatever's on the 
receiving end. You want a few?” 


“No,” said Battle. “I have my gats. What else have you got for offense 
and defense?” Underbottam opened a cabinet and proudly waved an 
arm. “Everything,” he said. 


“Disintegraters, heat-rays, bombs of every type. And impenetrable 
shields of energy, massive and portable. What more do | need?” 


From THE REVERSIBLE REVOLUTIONS by Cecil Corwin, Cosmic 
Stories, March 1941. Art by Morey, Bok, Kyle, Hunt, Forte. Copyright 
expired. 


10.1 Fields of Force 


Cutting-edge science readily infiltrates popular culture, though some- 
times in garbled form. The Newtonian imagination populated the 
universe mostly with that nice solid stuff called matter, which was 
made of little hard balls called atoms. In the early twentieth cen- 
tury, consumers of pulp fiction and popularized science began to 
hear of a new image of the universe, full of x-rays, N-rays, and 
Hertzian waves. What they were beginning to soak up through 
their skins was a drastic revision of Newton’s concept of a universe 
made of chunks of matter which happened to interact via forces. In 
the newly emerging picture, the universe was made of force, or, to 
be more technically accurate, of ripples in universal fields of force. 
Unlike the average reader of Cosmic Stories in 1941, you now pos- 
sess enough technical background to understand what a “force field” 
really is. 
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a/A bar magnet’s atoms are 
(partially) aligned. 


b/A bar magnet _ interacts 
with our magnetic planet. 


N S 
S N 
c / Magnets aligned north-south. 


d/The second magnet is re- 
versed. 


LOT: 
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e / Both magnets are reversed. 


10.1.1 Why fields? 
Time delays in forces exerted at a distance 


What convinced physicists that they needed this new concept of 
a field of force? Although we have been dealing mostly with elec- 
trical forces, let’s start with a magnetic example. (In fact the main 
reason I’ve delayed a detailed discussion of magnetism for so long 
is that mathematical calculations of magnetic effects are handled 
much more easily with the concept of a field of force.) First a little 
background leading up to our example. A bar magnet, a, has an axis 
about which many of the electrons’ orbits are oriented. The earth 
itself is also a magnet, although not a bar-shaped one. The interac- 
tion between the earth-magnet and the bar magnet, b, makes them 
want to line up their axes in opposing directions (in other words 
such that their electrons rotate in parallel planes, but with one set 
rotating clockwise and the other counterclockwise as seen looking 
along the axes). On a smaller scale, any two bar magnets placed 
near each other will try to align themselves head-to-tail, c. 


Now we get to the relevant example. It is clear that two people 
separated by a paper-thin wall could use a pair of bar magnets to 
signal to each other. Each person would feel her own magnet trying 
to twist around in response to any rotation performed by the other 
person’s magnet. The practical range of communication would be 
very short for this setup, but a sensitive electrical apparatus could 
pick up magnetic signals from much farther away. In fact, this is 
not so different from what a radio does: the electrons racing up 
and down the transmitting antenna create forces on the electrons 
in the distant receiving antenna. (Both magnetic and electric forces 
are involved in real radio signals, but we don’t need to worry about 
that yet.) 


A question now naturally arises as to whether there is any time 
delay in this kind of communication via magnetic (and electric) 
forces. Newton would have thought not, since he conceived of 
physics in terms of instantaneous action at a distance. We now 
know, however, that there is such a time delay. If you make a 
long-distance phone call that is routed through a communications 
satellite, you should easily be able to detect a delay of about half a 
second over the signal’s round trip of 50,000 miles. Modern measure- 
ments have shown that electric, magnetic, and gravitational forces 
all travel at the speed of light, 3 x 10° m/s. (In fact, we will soon 
discuss how light itself is made of electricity and magnetism. ) 


If it takes some time for forces to be transmitted through space, 
then apparently there is some thing that travels through space. The 
fact that the phenomenon travels outward at the same speed in all 
directions strongly evokes wave metaphors such as ripples on a pond. 
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More evidence that fields of force are real: they carry energy. 


The smoking-gun argument for this strange notion of traveling 
force ripples comes from the fact that they carry energy. 


First suppose that the person holding the bar magnet on the 
right decides to reverse hers, resulting in configuration d. She had 
to do mechanical work to twist it, and if she releases the magnet, 
energy will be released as it flips back to c. She has apparently stored 
energy by going from c to d. So far everything is easily explained 
without the concept of a field of force. 


But now imagine that the two people start in position c and 
then simultaneously flip their magnets extremely quickly to position 
e, keeping them lined up with each other the whole time. Imagine, 
for the sake of argument, that they can do this so quickly that 
each magnet is reversed while the force signal from the other is 
still in transit. (For a more realistic example, we’d have to have 
two radio antennas, not two magnets, but the magnets are easier 
to visualize.) During the flipping, each magnet is still feeling the 
forces arising from the way the other magnet used to be oriented. 
Even though the two magnets stay aligned during the flip, the time 
delay causes each person to feel resistance as she twists her magnet 
around. How can this be? Both of them are apparently doing 
mechanical work, so they must be storing magnetic energy somehow. 
But in the traditional Newtonian conception of matter interacting 
via instantaneous forces at a distance, interaction energy arises from 
the relative positions of objects that are interacting via forces. If 
the magnets never changed their orientations relative to each other, 
how can any magnetic energy have been stored? 


The only possible answer is that the energy must have gone 
into the magnetic force ripples crisscrossing the space between the 
magnets. Fields of force apparently carry energy across space, which 
is strong evidence that they are real things. 


This is perhaps not as radical an idea to us as it was to our 
ancestors. We are used to the idea that a radio transmitting antenna 
consumes a great deal of power, and somehow spews it out into the 
universe. A person working around such an antenna needs to be 
careful not to get too close to it, since all that energy can easily 
cook flesh (a painful phenomenon known as an “RF burn”). 


10.1.2 The gravitational field 


Given that fields of force are real, how do we define, measure, 
and calculate them? A fruitful metaphor will be the wind patterns 
experienced by a sailing ship. Wherever the ship goes, it will feel a 
certain amount of force from the wind, and that force will be in a 
certain direction. The weather is ever-changing, of course, but for 
now let’s just imagine steady wind patterns. Definitions in physics 
are operational, i.e., they describe how to measure the thing being 
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certain area of the ocean could 
be charted in a “sea of arrows” 


representation like this. 


Each 


arrow represents both the wind’s 
strength and its direction at a 
certain location. 
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defined. The ship’s captain can measure the wind’s “field of force” 
by going to the location of interest and determining both the direc- 
tion of the wind and the strength with which it is blowing. Charting 
all these measurements on a map leads to a depiction of the field of 
wind force like the one shown in the figure. This is known as the 
“sea of arrows” method of visualizing a field. 


Now let’s see how these concepts are applied to the fundamental 
force fields of the universe. We’ll start with the gravitational field, 
which is the easiest to understand. As with the wind patterns, 
we’ll start by imagining gravity as a static field, even though the 
existence of the tides proves that there are continual changes in the 
gravity field in our region of space. When the gravitational field was 
introduced in chapter 2, I avoided discussing its direction explicitly, 
but defining it is easy enough: we simply go to the location of 
interest and measure the direction of the gravitational force on an 
object, such as a weight tied to the end of a string. 


In chapter 2, I defined the gravitational field in terms of the en- 
ergy required to raise a unit mass through a unit distance. However, 
I’m going to give a different definition now, using an approach that 
will be more easily adapted to electric and magnetic fields. This 
approach is based on force rather than energy. We couldn’t carry 
out the energy-based definition without dividing by the mass of the 
object involved, and the same is true for the force-based definition. 
For example, gravitational forces are weaker on the moon than on 
the earth, but we cannot specify the strength of gravity simply by 
giving a certain number of newtons. The number of newtons of 
gravitational force depends not just on the strength of the local 
gravitational field but also on the mass of the object on which we’re 
testing gravity, our “test mass.” A boulder on the moon feels a 
stronger gravitational force than a pebble on the earth. We can get 
around this problem by defining the strength of the gravitational 
field as the force acting on an object, divided by the object’s mass: 


The gravitational field vector, g, at any location in space is found 
by placing a test mass m; at that point. The field vector is then 
given by g = F/m, where F is the gravitational force on the test 
mass. 


We now have three ways of representing a gravitational field. 
The magnitude of the gravitational field near the surface of the 
earth, for instance, could be written as 9.8 N/kg, 9.8 J/kg -m, or 
9.8 m/s?. If we already had two names for it, why invent a third? 
The main reason is that it prepares us with the right approach for 
defining other fields. 


The most subtle point about all this is that the gravitational 
field tells us about what forces would be exerted on a test mass by 
the earth, sun, moon, and the rest of the universe, if we inserted a 
test mass at the point in question. The field still exists at all the 
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places where we didn’t measure it. 


Gravitational field of the earth example 1 
> What is the magnitude of the earth’s gravitational field, in terms 
of its mass, M, and the distance r from its center? 


> Substituting |F| = GMm;/r? into the definition of the gravitational 
field, we find |g) = GM/r?. This expression could be used for 
the field of any spherically symmetric mass distribution, since the 
equation we assumed for the gravitational force would apply in 
any such case. 


Sources and sinks 


If we make a sea-of-arrows picture of the gravitational fields 
surrounding the earth, g, the result is evocative of water going down 
a drain. For this reason, anything that creates an inward-pointing 
field around itself is called a sink. The earth is a gravitational sink. 
The term “source” can refer specifically to things that make outward 
fields, or it can be used as a more general term for both “outies” 
and “innies.” However confusing the terminology, we know that 
gravitational fields are only attractive, so we will never find a region 
of space with an outward-pointing field pattern. 


Knowledge of the field is interchangeable with knowledge of its 
sources (at least in the case of a static, unchanging field). If aliens 
saw the earth’s gravitational field pattern they could immediately 
infer the existence of the planet, and conversely if they knew the 
mass of the earth they could predict its influence on the surrounding 
gravitational field. 


Superposition of indexsuperposition of fieldsindexfields!superposition 
offields 


A very important fact about all fields of force is that when there 
is more than one source (or sink), the fields add according to the 
rules of vector addition. The gravitational field certainly will have 
this property, since it is defined in terms of the force on a test 
mass, and forces add like vectors. Superposition is an important 
characteristics of waves, so the superposition property of fields is 
consistent with the idea that disturbances can propagate outward 
as waves in a field. 
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g/The gravitational field sur- 
rounding a clump of mass such 
as the earth. 
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h/The gravitational fields of 
the earth and moon superpose. 
Note how the fields cancel at one 
point, and how there is no bound- 
ary between the interpenetrating 
fields surrounding the two bodies. 
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‘Reduction in gravity on lo due to Jupiter's gravity example 2 
> The average gravitational field on Jupiter’s moon lo is 1.81 N/kg. 
By how much is this reduced when Jupiter is directly overhead? 
lo’s orbit has a radius of 4.22 x 10® m, and Jupiter's mass is 
1.899 x 10°” kg. 


> By the shell theorem, we can treat the Jupiter as if its mass was 
all concentrated at its center, and likewise for lo. If we visit lo and 
land at the point where Jupiter is overhead, we are on the same 
line as these two centers, so the whole problem can be treated 
one-dimensionally, and vector addition is just like scalar addition. 
Let’s use positive numbers for downward fields (toward the center 
of lo) and negative for upward ones. Plugging the appropriate 
data into the expression derived in example 1, we find that the 
Jupiter’s contribution to the field is —0.71 N/kg. Superposition 
says that we can find the actual gravitational field by adding up 
the fields created by lo and Jupiter: 1.81 — 0.71 N/kg = 1.1 N/kg. 
You might think that this reduction would create some spectacular 
effects, and make lo an exciting tourist destination. Actually you 
would not detect any difference if you flew from one side of lo 
to the other. This is because your body and lo both experience 
Jupiter's gravity, so you follow the same orbital curve through the 
space around Jupiter. 


i/The part of the LIGO grav- 
ity wave detector at Hanford Nu- 
clear Reservation, near Richland, 
Washington. The other half of the 
detector is in Louisiana. 


Gravitational waves 


A source that sits still will create a static field pattern, like a steel 
ball sitting peacefully on a sheet of rubber. A moving source will cre- 
ate a spreading wave pattern in the field, like a bug thrashing on the 
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surface of a pond. Although we have started with the gravitational 
field as the simplest example of a static field, stars and planets do 
more stately gliding than thrashing, so gravitational waves are not 
easy to detect. Newton’s theory of gravity does not describe gravi- 
tational waves, but they are predicted by Einstein’s general theory 
of relativity. J.H. Taylor and R.A. Hulse were awarded the Nobel 
Prize in 1993 for giving indirect evidence that Einstein’s waves actu- 
ally exist. They discovered a pair of exotic, ultra-dense stars called 
neutron stars orbiting one another very closely, and showed that 
they were losing orbital energy at the rate predicted by Einstein’s 
theory. 


A Caltech-MIT collaboration has built a pair of gravitational 
wave detectors called LIGO to search for more direct evidence of 
gravitational waves. Since they are essentially the most sensitive 
vibration detectors ever made, they are located in quiet rural areas, 
and signals will be compared between them to make sure that they 
were not due to passing trucks. The project began operating at full 
sensitivity in 2005, and is now able to detect a vibration that causes 
a change of 107!° m in the distance between the mirrors at the ends 
of the 4-km vacuum tunnels. This is a thousand times less than the 
size of an atomic nucleus! There is only enough funding to keep the 
detectors operating for a few more years, so the physicists can only 
hope that during that time, somewhere in the universe, a sufficiently 
violent cataclysm will occur to make a detectable gravitational wave. 
(More accurately, they want the wave to arrive in our solar system 
during that time, although it will have been produced millions of 
years before.) 


10.1.3 The electric field 
Definition 


The definition of the electric field is directly analogous to, and 
has the same motivation as, the definition of the gravitational field: 


The electric field vector, E, at any location in space is found by 
placing a test charge q at that point. The electric field vector is 
then given by E = F/q, where F is the electric force on the test 
charge. 


Charges are what create electric fields. Unlike gravity, which is 
always attractive, electricity displays both attraction and repulsion. 
A positive charge is a source of electric fields, and a negative one is 
a sink. 


The most difficult point about the definition of the electric field 
is that the force on a negative charge is in the opposite direction 
compared to the field. This follows from the definition, since di- 
viding a vector by a negative number reverses its direction. It’s as 
though we had some objects that fell upward instead of down. 
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self-check A 
Find an equation for the magnitude of the field of a single point charge 


ee Alias, Q. > Answer, p. 870 
‘Superposition of electric fields example 3 
> Charges q and —q are at a distance b from each other, as 

eq shown in the figure. What is the electric field at the point P, which 
lies at a third corner of the square? 

j/ Example 3. > The field at P is the vector sum of the fields that would have 
been created by the two charges independently. Let positive x be 
to the right and let positive y be up. 

y ¥ ¥ Negative charges have fields that point at them, so the charge 
\ \ | Z —q makes a field that points to the right, i.e., has a positive x 
~ .\ fU “ component. Using the answer to the self-check, we have 
eo Ye © ka 
E_gx = any 
a7 aT O — «~_ « ’ b2 
a 7 ecw Nox Egy =0 
; / ; [ | \ \ \ Note that if we had blindly ignored the absolute value signs and 
ft \ 4 plugged in —q to the equation, we would have incorrectly con- 
+4 i 1 4 cluded that the field went to the left. 
\ f 
‘ x | [ At By the Pythagorean theorem, the positive charge is at a distance 
N \ i 7 V2b from P, so the magnitude of its contribution to the field is 
‘ Z E = kq/2b*. Positive charges have fields that point away from 
SK walls Z 2 | 
PS a them, so the field vector is at an angle of 135°counterclockwise 
=> ae C) . oe oe from the x axis. 
ana Al adie 
al a ~ Egy = <1 cos 135° 
es :% <: ax = ppp OS 
kq 
oo =~ pang 

k/A dipole field. Electric fields kq_. - 

diverge from a positive charge Eqy = ape 135 

and converge on a_ negative kq 

charge. a 33/2 52 
The total field is 

kq 
a3/2 
= (1 =o ) B 
kq 
Ey 93/2 h2 


Dipoles 


The simplest set of sources that can occur with electricity but 
not with gravity is the dipole, consisting of a positive charge and a 


|/ A water molecule is a dipole. 
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negative charge with equal magnitudes. More generally, an electric 
dipole can be any object with an imbalance of positive charge on 
one side and negative on the other. A water molecule, 1, is a dipole 
because the electrons tend to shift away from the hydrogen atoms 
and onto the oxygen atom. 


Your microwave oven acts on water molecules with electric fields. 
Let us imagine what happens if we start with a uniform electric field, 
m/1, made by some external charges, and then insert a dipole, m/2, 
consisting of two charges connected by a rigid rod. The dipole dis- 
turbs the field pattern, but more important for our present purposes 
is that it experiences a torque. In this example, the positive charge 
feels an upward force, but the negative charge is pulled down. The 
result is that the dipole wants to align itself with the field, m/3. The 
microwave oven heats food with electrical (and magnetic) waves. 
The alternation of the torque causes the molecules to wiggle and in- 
crease the amount of random motion. The slightly vague definition 
of a dipole given above can be improved by saying that a dipole is 
any object that experiences a torque in an electric field. 


What determines the torque on a dipole placed in an externally 
created field? Torque depends on the force, the distance from the 
axis at which the force is applied, and the angle between the force 
and the line from the axis to the point of application. Let a dipole 
consisting of charges +q and —q separated by a distance @ be placed 
in an external field of magnitude |E]|, at an angle @ with respect to 
the field. The total torque on the dipole is 


t e 
= gE sin @ + 5flEl sin 0 
= q|B| sind 


(Note that even though the two forces are in opposite directions, 
the torques do not cancel, because they are both trying to twist 
the dipole in the same direction.) The quantity is called the dipole 
moment, notated D. (More complex dipoles can also be assigned 


m/1. <A uniform electric field created by some charges “off-stage.” 
2. A dipole is placed in the field. 3. The dipole aligns with the field. 
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a dipole moment — they are defined as having the same dipole 
moment as the two-charge dipole that would experience the same 
torque.) 


Employing a little more mathematical elegance, we can define a 
dipole moment. vector, 


D= Soar ) 


where r; is the position vector of the charge labeled by the index 
i. We can then write the torque in terms of a vector cross product 
(page 279), 


T=DxE 


No matter how we notate it, the definition of the dipole moment 
requires that we choose point from which we measure all the posi- 
tion vectors of the charges. However, in the commonly encountered 
special case where the total charge of the object is zero, the dipole 
moment is the same regardless of this choice. 


Dipole moment of a molecule of NaCl gas example 4 
> In a molecule of NaCl gas, the center-to-center distance be- 
tween the two atoms is about 0.6 nm. Assuming that the chlo- 
rine completely steals one of the sodium’s electrons, compute the 
magnitude of this molecule’s dipole moment. 


> The total charge is zero, so it doesn’t matter where we choose 
the origin of our coordinate system. For convenience, let’s choose 
it to be at one of the atoms, so that the charge on that atom 
doesn’t contribute to the dipole moment. The magnitude of the 
dipole moment is then 


D=(6 x 10~'° m)(e) 
= (6 x 10719 m)(1.6 x 10719 C) 
=1x 10° C-m 


Dipole moments as vectors example 5 
> The horizontal and vertical spacing between the charges in the 
figure is b. Find the dipole moment. 


> Let the origin of the coordinate system be at the leftmost charge. 


D=S_ air 
= (q)(0) + (—q)(bX) + (q)(bX + by) + (—q)(2bX) 
= —2bqX + bay 
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Alternative definition of the electric field 


The behavior of a dipole in an externally created field leads us 
to an alternative definition of the electric field: 


The electric field vector, FE, at any location in space is defined 
by observing the torque exerted on a test dipole D; placed there. 
The direction of the field is the direction in which the field tends 
to align a dipole (from — to +), and the field’s magnitude is |E| = 
T/D,sin 6. In other words, the field vector is the vector that satisfies 
the equation 7 = D; x E for any test dipole D; placed at that point 
in space. 


The main reason for introducing a second definition for the same 
concept is that the magnetic field is most easily defined using a 
similar approach. 


Discussion Questions 


A Inthe definition of the electric field, does the test charge need to be 
1 coulomb? Does it need to be positive? 


B Does a charged particle such as an electron or proton feel a force 
from its own electric field? 


Cc Is there an electric field surrounding a wall socket that has nothing 
plugged into it, or a battery that is just sitting on a table? 


D Ina flashlight powered by a battery, which way do the electric fields 
point? What would the fields be like inside the wires? Inside the filament 
of the bulb? 


E  Criticize the following statement: “An electric field can be represented 
by a sea of arrows showing how current is flowing.” 


F The field of a point charge, |E| = kQ/r?, was derived in a self-check. 
How would the field pattern of a uniformly charged sphere compare with 
the field of a point charge? 


G The interior of a perfect electrical conductor in equilibrium must 
have zero electric field, since otherwise the free charges within it would 
be drifting in response to the field, and it would not be in equilibrium. What 
about the field right at the surface of a perfect conductor? Consider the 
possibility of a field perpendicular to the surface or parallel to it. 


H Compare the dipole moments of the molecules and molecular ions 
shown in the figure. 


o / Discussion question H. 


I Small pieces of paper that have not been electrically prepared in 
any way can be picked up with a charged object such as a charged piece 
of tape. In our new terminology, we could describe the tape’s charge as 
inducing a dipole moment in the paper. Can a similar technique be used 
to induce not just a dipole moment but a charge? 


10.2 Voltage Related To Field 
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10.2.1 One dimension 

Voltage is electrical energy per unit charge, and electric field is 
force per unit charge. For a particle moving in one dimension, along 
the x axis, we can therefore relate voltage and field if we start from 
the relationship between interaction energy and force, 


dU = —F,.dx ; 
and divide by charge, 
du — —_"dr F 
q qd 
giving 
dV = —-E,dz : 
or 
dV 
dz 7 


The interpretation is that a strong electric field occurs in a region 
of space where the voltage is rapidly changing. By analogy, a steep 
hillside is a place on the map where the altitude is rapidly changing. 


Field generated by an electric eel example 6 
> Suppose an electric eel is 1 m long, and generates a voltage 
difference of 1000 volts between its head and tail. What is the 
electric field in the water around it? 


> We are only calculating the amount of field, not its direction, so 
we ignore positive and negative signs. Subject to the possibly in- 
accurate assumption of a constant field parallel to the eel’s body, 
we have 


dV 
IE|= > 


AV 
Bae [assumption of constant field] 


= 1000 V/m 


Relating the units of electric field and voltage example 7 
From our original definition of the electric field, we expect it to 
have units of newtons per coulomb, N/C. The example above, 
however, came out in volts per meter, V/m. Are these inconsis- 
tent? Let’s reassure ourselves that this all works. In this kind 
of situation, the best strategy is usually to simplify the more com- 
plex units so that they involve only mks units and coulombs. Since 
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voltage is defined as electrical energy per unit charge, it has units 
of J/C: 


Vv wc 
m om 
J 
C-m 


To connect joules to newtons, we recall that work equals force 
times distance, so J = N- m, so 


As with other such difficulties with electrical units, one quickly 
begins to recognize frequently occurring combinations. 


Voltage associated with a point charge example 8 
> What is the voltage associated with a point charge? 


> As derived previously in self-check A on page 517, the field is 


The difference in voltage between two points on the same radius 
line is 


AV= 


In the general discussion above, x was just a generic name for 
distance traveled along the line from one point to the other, so in 
this case x really means r. 


The standard convention is to use 4, = co as a reference point, so 
that the voltage at any distance r from the charge is 


_kQ 
ae 


V 
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The interpretation is that if you bring a positive test charge closer 
to a positive charge, its electrical energy is increased; if it was 
released, it would spring away, releasing this as kinetic energy. 


self-check B 
Show that you can recover the expression for the field of a point charge 
by evaluating the derivative E, = —dV/dx. > Answer, p. 870 


10.2.2 Two or three dimensions 


The topographical map in figure a suggests a good way to visu- 
alize the relationship between field and voltage in two dimensions. 
Each contour on the map is a line of constant height; some of these 
are labeled with their elevations in units of feet. Height is related 
to gravitational energy, so in a gravitational analogy, we can think 
of height as representing voltage. Where the contour lines are far 
apart, as in the town, the slope is gentle. Lines close together indi- 
cate a steep slope. 


If we walk along a straight line, say straight east from the town, 
then height (voltage) is a function of the east-west coordinate x. 
Using the usual mathematical definition of the slope, and writing 
V for the height in order to remind us of the electrical analogy, the 
slope along such a line is dV/dz (the rise over the run). 


What if everything isn’t confined to a straight line? Water flows 
downhill. Notice how the streams on the map cut perpendicularly 
through the lines of constant height. 


It is possible to map voltages in the same way, as shown in 
figure b. The electric field is strongest where the constant-voltage 
curves are closest together, and the electric field vectors always point 
perpendicular to the constant-voltage curves. 


The one-dimensional relationship F = —dV/dx generalizes to 
three dimensions as follows: 


ES 


E, =-—— 


This can be notated as a gradient (page 213), 
E=VW , 


and if we know the field and want to find the voltage, we can use a 


line integral, 
AV = | E-dr : 
Cc 


where the quantity inside the integral is a vector dot product. 


self-check C 

Imagine that figure a represents voltage rather than height. (a) Con- 
sider the stream the starts near the center of the map. Determine the 
positive and negative signs of dV/dx and dV/dy, and relate these to 
the direction of the force that is pushing the current forward against the 
resistance of friction. (bo) If you wanted to find a lot of electric charge on 
this map, where would you look? > Answer, p. 870 


a/A_ topographical map of 
Shelburne Falls, Mass. (USGS) 


b/The constant-voltage curves 
surrounding a point charge. Near 
the charge, the curves are so 
closely spaced that they blend 
together on this drawing due to 
the finite width with which they 
were drawn. Some electric fields 
are shown as arrows. 
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Figure c shows some examples of ways to visualize field and 
voltage patterns. 
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c/ Two-dimensional field and voltage patterns. Top: A uniformly 
charged rod. Bottom: A dipole. In each case, the diagram on the left 
shows the field vectors and constant-voltage curves, while the one on 
the right shows the voltage (up-down coordinate) as a function of x and 
y. Interpreting the field diagrams: Each arrow represents the field at the 
point where its tail has been positioned. For clarity, some of the arrows 
in regions of very strong field strength are not shown — they would be 
too long to show. Interpreting the constant-voltage curves: In regions 
of very strong fields, the curves are not shown because they would 
merge together to make solid black regions. Interpreting the perspective 
plots: Keep in mind that even though we’re visualizing things in three 
dimensions, these are really two-dimensional voltage patterns being 
represented. The third (up-down) dimension represents voltage, not 
position. 


526 Chapter 10 Fields 


10.3 Fields by Superposition 
10.3.1 Electric field of a continuous charge distribution 


Charge really comes in discrete chunks, but often it is mathe- 
matically convenient to treat a set of charges as if they were like a 
continuous fluid spread throughout a region of space. For example, 
a charged metal ball will have charge spread nearly uniformly all 
over its surface, and for most purposes it will make sense to ignore 
the fact that this uniformity is broken at the atomic level. The 
electric field made by such a continuous charge distribution is the 
sum of the fields created by every part of it. If we let the “parts” 
become infinitesimally small, we have a sum of an infinitely many 
infinitesimal numbers: an integral. If it was a discrete sum, as in 
example 3 on page 518, we would have a total electric field in the x 
direction that was the sum of all the x components of the individual 
fields, and similarly we’d have sums for the y and z components. In 
the continuous case, we have three integrals. Let’s keep it simple by 
starting with a one-dimensional example. 


Field of a uniformly charged rod example 9 
> A rod of length L has charge Q spread uniformly along it. Find 
the electric field at a point a distance d from the center of the rod, 
along the rod’s axis. 


> This is a one-dimensional situation, so we really only need to 
do a single integral representing the total field along the axis. We 
imagine breaking the rod down into short pieces of length dz, 
each with charge dq. Since charge is uniformly spread along the 
rod, we have dq = Adz, where A = Q/L (Greek lambda) is the 
charge per unit length, in units of coulombs per meter. Since 
the pieces are infinitesimally short, we can treat them as point 
charges and use the expression kdq/r? for their contributions to 
the field, where r = d — z is the distance from the charge at z to 
the point in which we are interested. 


E,= ays 


ne kvAdz 
—L/2 re 


+L/2 
= kn | a 
aye: (QZ) 


The integral can be looked up in a table, or reduced to an ele- 
mentary form by substituting a new variable for d — z. The result 


IS 
{ +L/2 
d—z -L/2 


kot A 1 
~ bE \G=2 d41)2 


< 
[oT ; 
< > 
a/ Example 9. 
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For large values of d, this expression gets smaller for two rea- 
sons: (1) the denominators of the fractions become large, and 
(2) the two fractions become nearly the same, and tend to can- 
cel out. This makes sense, since the field should get weaker as 
we get farther away from the charge. In fact, the field at large 
distances must approach kQ/d? (homework problem 2). 


It’s also interesting to note that the field becomes infinite at the 
ends of the rod, but is not infinite on the interior of the rod. Can 
you explain physically why this happens? 


Example 9 was one-dimensional. In the general three-dimensional 


case, we might have to integrate all three components of the field. 
However, there is a trick that lets us avoid this much complication. 
The voltage is a scalar, so we can find the voltage by doing just a 
single integral, then use the voltage to find the field. 
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Voltage, then field example 10 
> A rod of length L is uniformly charged with charge Q. Find the 
field at a point lying in the midplane of the rod at a distance R. 


> By symmetry, the field has only a radial component, Er, pointing 
directly away from the rod (or toward it for Q < 0). The brute-force 
approach, then, would be to evaluate the integral E = | |dE|cos 0, 
where dE is the contribution to the field from a charge dq at some 
point along the rod, and 0 is the angle dE makes with the radial 
line. 


It's easier, however, to find the voltage first, and then find the 
field from the voltage. Since the voltage is a scalar, we simply 
integrate the contribution dV from each charge dq, without even 
worrying about angles and directions. Let z be the coordinate 
that measures distance up and down along the rod, with z = 0 at 
the center of the rod. Then the distance between a point z on the 
rod and the point of interest is r = Vz? + R2, and we have 


+L/2 dz 
= kn i ee 
-L/2 Vz? + Re 


The integral can be looked up in a table, or evaluated using com- 


puter software: 


V=kAln (2+ 2+ Fe) Oe 


—L/2 
sip L/2+./L?/4+4+ R? 
7 —L/2+ ./12/4+ Re 


The expression inside the parentheses can be simplified a little. 
Leaving out some tedious algebra, the result is 


L L? 
v= 20am (3a ape) 
This can readily be differentiated to find the field: 


Joe 
dR 


= (—2ka) 


Ep = 


—L/2R? + (1/2)(1 + L?2/4R?)-1/2(—L? /2R8) 
L/2R + (14 L2/4R2)1/2 


or, after some simplification, 
kAL 


2/1 + L2/4R? 


For large values of A, the square root approaches one, and we 
have simply Eg ~ kAL/R® = kQ/R?. In other words, the field 
very far away is the same regardless of whether the charge is a 
point charge or some other shape like a rod. This is intuitively 
appealing, and doing this kind of check also helps to reassure 
one that the final result is correct. 


Ep = 


The preceding example, although it involved some messy alge- 
bra, required only straightforward calculus, and no vector operations 
at all, because we only had to integrate a scalar function to find the 
voltage. The next example is one in which we can integrate either 
the field or the voltage without too much complication. 


On-axis field of a ring of charge example 11 
> Find the voltage and field along the axis of a uniformly charged 
ring. 


> Integrating the voltage is straightforward. 
rae i HOG 
r 
Lf / dq 
Vb? + 2@ 


k 
= cease | 09 c/ Example 11. 
kQ 
Vb? + 2° 
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where Q is the total charge of the ring. This result could have 
been derived without calculus, since the distance r is the same 
for every point around the ring, i.e., the integrand is a constant. 
It would also be straightforward to find the field by differentiating 
this expression with respect to z (homework problem 10). 


Instead, let's see how to find the field by direct integration. By 
symmetry, the field at the point of interest can have only a com- 
ponent along the axis of symmetry, the z axis: 


E, =0 
Ey =0 


To find the field in the z direction, we integrate the z components 
contributed to the field by each infinitesimal part of the ring. 


| oe cos 8 ‘ 


Everything inside the integral is a constant, so we have 
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In all the examples presented so far, the charge has been con- 


fined to a one-dimensional line or curve. Although it is possible, for 
example, to put charge on a piece of wire, it is more common to 
encounter practical devices in which the charge is distributed over a 
two-dimensional surface, as in the flat metal plates used in Thom- 


son’s experiments. Mathematically, we can approach this type of 
calculation with the divide-and-conquer technique: slice the surface 
into lines or curves whose fields we know how to calculate, and then 
add up the contributions to the field from all these slices. In the 
limit where the slices are imagined to be infinitesimally thin, we 
have an integral. 
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Field of a uniformly charged disk example 12 
> A circular disk is uniformly charged. (The disk must be an in- 
sulator; if it was a conductor, then the repulsion of all the charge 
would cause it to collect more densely near the edge.) Find the 
field at a point on the axis, at a distance z from the plane of the 
disk. 


> We're given that every part of the disk has the same charge per 
unit area, so rather than working with Q, the total charge, it will 
be easier to use the charge per unit area, conventionally notated 
o (Greek sigma), o = Q/mb*. 


Since we already know the field due to a ring of charge, we can 
solve the problem by slicing the disk into rings, with each ring 
extending from r to r+dr. The area of such a ring equals its 
circumference multiplied by its width, i.e., 27rdr, so its charge is 
dq = 2nordr, and from the result of example 11, its contribution 
to the field is 


kzdq 
(r2 e ze)? 
anokzrdr 


(r2 + ze)? 


dE, = 


The total field is 


E, = [ aE; 


b 
rdr 
= 2nokz | ae 
0 (r2 +2?) 
r=b 


Vre +z? 


= 2710kz 


r=0 
Zz 
= 270k | 1 — ———— 
( ee) 


The result of example 12 has some interesting properties. First, 
we note that it was derived on the unspoken assumption of z > 0. 
By symmetry, the field on the other side of the disk must be equally 
strong, but in the opposite direction, as shown in figures e and g. 
Thus there is a discontinuity in the field at z = 0. In reality, the 
disk will have some finite thickness, and the switching over of the 
field will be rapid, but not discontinuous. 


At large values of z, i.e., z >> b, the field rapidly approaches the 
1/r? variation that we expect when we are so far from the disk that 
the disk’s size and shape cannot matter (homework problem 2). 


d/Example 12: geometry. 


e/Example 12: the field on 
both sides (for o > 0). 


E=0 E=0 


f/A capacitor consisting of 
two disks with opposite charges. 
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g / Example 12: variation of the field (o > 0). 


A practical application is the case of a capacitor, f, having two 
parallel circular plates very close together. In normal operation, the 
charges on the plates are opposite, so one plate has fields pointing 
into it and the other one has fields pointing out. In a real capacitor, 
the plates are a metal conductor, not an insulator, so the charge 
will tend to arrange itself more densely near the edges, rather than 
spreading itself uniformly on each plate. Furthermore, we have only 
calculated the on-axis field in example 12; in the off-axis region, 
each disk’s contribution to the field will be weaker, and it will also 
point away from the axis a little. But if we are willing to ignore 
these complications for the sake of a rough analysis, then the fields 
superimpose as shown in figure f: the fields cancel the outside of 
the capacitor, but between the plates its value is double that con- 
tributed by a single plate. This cancellation on the outside is a very 
useful property for a practical capacitor. For instance, if you look at 
the printed circuit board in a typical piece of consumer electronics, 
there are many capacitors, often placed fairly close together. If their 
exterior fields didn’t cancel out nicely, then each capacitor would in- 
teract with its neighbors in a complicated way, and the behavior of 
the circuit would depend on the exact physical layout, since the 
interaction would be stronger or weaker depending on distance. In 
reality, a capacitor does create weak external electric fields, but their 
effects are often negligible, and we can then use the lumped-circuit 
approximation, which states that each component’s behavior de- 
pends only on the currents that flow in and out of it, not on the 
interaction of its fields with the other components. 
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10.3.2 The field near a charged surface 


From a theoretical point of view, there is something even more 
intriguing about example 12: the magnitude of the field for small 
values of z (z < b) is E = 2mko, which doesn’t depend on 6 at 
all for a fixed value of o. If we made a disk with twice the radius, 
and covered it with the same number of coulombs per square meter 
(resulting in a total charge four times as great), the field close to 
the disk would be unchanged! That is, a flea living near the center 
of the disk, h, would have no way of determining the size of her flat 
“planet” by measuring the local field and charge density. (Only by 
leaping off the surface into outer space would she be able to measure 
fields that were dependent on b. If she traveled very far, to z > 8, 
she would be in the region where the field is well approximated by 
|E|  kQ/2? = krb?a/2z*, which she could solve for b.) 


What is the reason for this surprisingly simple behavior of the 
field? Is it a piece of mathematical trivia, true only in this particular 
case? What if the shape was a square rather than a circle? In other 
words, the flea gets no information about the size of the disk from 
measuring FE, since E = 2mko, independent of b, but what if she 
didn’t know the shape, either? If the result for a square had some 
other geometrical factor in front instead of 27, then she could tell 
which shape it was by measuring F. The surprising mathematical 
fact, however, is that the result for a square, indeed for any shape 
whatsoever, is FE = 2ack. It doesn’t even matter whether the sur- 
face is flat or warped, or whether the density of charge is different 
at parts of the surface which are far away compared to the flea’s 
distance above the surface. 


This universal FE, = 2ako field perpendicular to a charged 
surface can be proved mathematically based on Gauss’s law! (sec- 
tion 10.6), but we can understand what’s happening on qualitative 
grounds. Suppose on night, while the flea is asleep, someone adds 
more surface area, also positively charged, around the outside edge 
of her disk-shaped world, doubling its radius. The added charge, 
however, has very little effect on the field in her environment, as 
long as she stays at low altitudes above the surface. As shown in 
figure i, the new charge to her west contributes a field, T, that is 
almost purely “horizontal” (i.e., parallel to the surface) and to the 
east. It has a negligible upward component, since the angle is so 
shallow. This new eastward contribution to the field is exactly can- 
celed out by the westward field, S, created by the new charge to 
her east. There is likewise almost perfect cancellation between any 
other pair of opposite compass directions. 


A similar argument can be made as to the shape-independence of 
the result, as long as the shape is symmetric. For example, suppose 
that the next night, the tricky real estate developers decide to add 


‘rhymes with “mouse” 


h/Close to the surface, the 
relationship between E and o 
is a fixed one, regardless of 
the geometry. The flea can't 
determine the size or shape of 
her world by comparing E and o. 


i/ Fields contributed by nearby 
parts of the surface, P, Q, and R, 
contribute to E,. Fields due to 
distant charges, S, and T, have 
very small contributions to E, 
because of their shallow angles. 
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corners to the disk and transform it into a square. Each corner’s 
contribution to the field measured at the center is canceled by the 
field due to the corner diagonally across from it. 


What if the flea goes on a trip away from the center of the disk? 
The perfect cancellation of the “horizontal” fields contributed by 
distant charges will no longer occur, but the “vertical” field (i.e., the 
field perpendicular to the surface) will still be E, = 2rko, where a is 
the local charge density, since the distant charges can’t contribute to 
the vertical field. The same result applies if the shape of the surface 
is asymmetric, and doesn’t even have any well-defined geometric 
center: the component perpendicular to the surface is FE, = 2rko, 
but we may have E) #4 0. All of the above arguments can be made 
more rigorous by discussing mathematical limits rather than using 
words like “very small.” There is not much point in giving a rigorous 
proof here, however, since we will be able to demonstrate this fact 
as a corollary of Gauss’ Law in section 10.6. The result is as follows: 


At a point lying a distance z from a charged surface, the com- 
ponent of the electric field perpendicular to the surface obeys 


lim EF, = 2rko ; 
z—0 


where o is the charge per unit area. This is true regardless of the 
shape or size of the surface. 


The field near a point, line, or surface charge example 13 
> Compare the variation of the electric field with distance, d, for 
small values of d in the case of a point charge, an infinite line of 
charge, and an infinite charged surface. 


> For a point charge, we have already found E « d~® for the 
magnitude of the field, where we are now using d for the quantity 
we would ordinarily notate as r. This is true for all values of d, 
not just for small d — it has to be that way, because the point 
charge has no size, so if E behaved differently for small and large 
d, there would be no way to decide what d had to be small or 
large relative to. 


For a line of charge, the result of example 10 is 
= kKAL 
d2,/1 + L?/4d2 


In the limit of d < L, the quantity inside the square root is domi- 
nated by the second term, and we have E « d~!. 


Finally, in the case of a charged surface, the result is simply E = 
2nok, or E x a. 


Notice the lovely simplicity of the pattern, as shown in figure j. A 
point is zero-dimensional: it has no length, width, or breadth. A 
line is one-dimensional, and a surface is two-dimensional. As the 
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dimensionality of the charged object changes from 0 to 1, and 
then to 2, the exponent in the near-field expression goes from 2 
to 1 to 0. 
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a/Two- oppositely charged 
capacitor plates are pulled apart. 


10.4 Energy In Fields 
10.4.1 Electric field energy 


Fields possess energy, as argued on page 513, but how much 
energy? The answer can be found using the following elegant ap- 
proach. We assume that the electric energy contained in an infinites- 
imal volume of space dv is given by dU. = f(E)dv, where f is some 
function, which we wish to determine, of the field E. It might seem 
that we would have no easy way to determine the function f, but 
many of the functions we could cook up would violate the symme- 
try of space. For instance, we could imagine f(E) = aE,, where a 
is some constant with the appropriate units. However, this would 
violate the symmetry of space, because it would give the y axis a 
different status from x and z. As discussed on page 210, if we wish 
to calculate a scalar based on some vectors, the dot product is the 
only way to do it that has the correct symmetry properties. If all 
we have is one vector, E, then the only scalar we can form is E - E, 
which is the square of the magnitude of the electric field vector. 


In principle, the energy function we are seeking could be pro- 
portional to E- E, or to any function computed from it, such as 
VE-E or (E- E)’. On physical grounds, however, the only possi- 
bility that works is E- E. Suppose, for instance, that we pull apart 
two oppositely charged capacitor plates, as shown in figure a. We 
are doing work by pulling them apart against the force of their elec- 
trical attraction, and this quantity of mechanical work equals the 
increase in electrical energy, U-. Using our previous approach to 
energy, we would have thought of U. as a quantity which depended 
on the distance of the positive and negative charges from each other, 
but now we’re going to imagine U, as being stored within the elec- 
tric field that exists in the space between and around the charges. 
When the plates are touching, their fields cancel everywhere, and 
there is zero electrical energy. When they are separated, there is still 
approximately zero field on the outside, but the field between the 
plates is nonzero, and holds some energy. Now suppose we carry 
out the whole process, but with the plates carrying double their 
previous charges. Since Coulomb’s law involves the product qiq2 of 
two charges, we have quadrupled the force between any given pair 
of charged particles, and the total attractive force is therefore also 
four times greater than before. This means that the work done in 
separating the plates is four times greater, and so is the energy U, 
stored in the field. The field, however, has merely been doubled at 
any given location: the electric field E, due to the positively charged 
plate is doubled, and similarly for the contribution E_ from the neg- 
ative one, so the total electric field EK, + E_ is also doubled. Thus 
doubling the field results in an electrical energy which is four times 
greater, i.e., the energy density must be proportional to the square 
of the field, dU. «x (E- E)dv. For ease of notation, we write this 
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as dU, «x E*dv, or dU. = aE?dv, where a is a constant of propor- 
tionality. Note that we never really made use of any of the details 
of the geometry of figure a, so the reasoning is of general validity. 
In other words, not only is dU. = aE?dv the function that works 
in this particular case, but there is every reason to believe that it 
would work in other cases as well. 


It now remains only to find a. Since the constant must be the 
same in all situations, we only need to find one example in which 
we can compute the field and the energy, and then we can deter- 
mine a. The situation shown in figure a is just about the easiest 
example to analyze. We let the square capacitor plates be uni- 
formly covered with charge densities +o and —o, and we write b 
for the lengths of their sides. Let h be the gap between the plates 
after they have been separated. We choose h < 8, so that the 
field experienced by the negative plate due to the positive plate 
is E, = 2rko. The charge of the negative plate is —cb?, so the 
magnitude of the force attracting it back toward the positive plate 
is (force) = (charge)(field) = 27ko?b?. The amount of work done 
in separating the plates is (work) = (force)(distance) = 27ka2b7h. 
This is the amount of energy that has been stored in the field be- 
tween the two plates, Ue = 2ako?b?h = 2nko7v, where v is the 
volume of the region between the plates. 


We want to equate this to U. = aE*v. (We can write U. and 
v rather than dU; and dv, since the field is constant in the region 
between the plates.) The field between the plates has contributions 
from both plates, EF = Ey + E_ = 4rko. (We only used half this 
value in the computation of the work done on the moving plate, 
since the moving plate can’t make a force on itself. Mathematically, 
each plate is in a region where its own field is reversing directions, 
so we can think of its own contribution to the field as being zero 
within itself.) We then have aB?v = a- 16m?k?o? - v, and setting 
this equal to Ue = 2rko?v from the result of the work computation, 
we find a = 1/87k. Our final result is as follows: 


The electric energy possessed by an electric field E occupying an 
infinitesimal volume of space dv is given by 


dU, = —~E?dv 


where E? = E-E is the square of the magnitude of the electric field. 


This is reminiscent of how waves behave: the energy content of 
a wave is typically proportional to the square of its amplitude. 
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self-check D 

We can think of the quantity dU,/dv as the energy density due to the 
electric field, i.e., the number of joules per cubic meter needed in order 
to create that field. (a) How does this quantity depend on the compo- 
nents of the field vector, E,, Ey, and E,? (b) Suppose we have a field 
with Ey, #0, Ey=0, and E,=0. What would happen to the energy density 
if we reversed the sign of E,? > Answer, p. 870 


A numerical example example 14 
> A capacitor has plates whose areas are 10~* m?, separated by 
a gap of 10-° m. A 1.5-volt battery is connected across it. How 
much energy is sucked out of the battery and stored in the elec- 
tric field between the plates? (A real capacitor typically has an 
insulating material between the plates whose molecules interact 
electrically with the charge in the plates. For this example, we'll 
assume that there is just a vacuum in between the plates. The 
plates are also typically rolled up rather than flat.) 


> To connect this with our previous calculations, we need to find 
the charge density on the plates in terms of the voltage we were 
given. Our previous examples were based on the assumption that 
the gap between the plates was small compared to the size of the 
plates. Is this valid here? Well, if the plates were square, then 
the area of 10~* m? would imply that their sides were 10-2 m in 
length. This is indeed very large compared to the gap of 10-° 
m, so this assumption appears to be valid (unless, perhaps, the 
plates have some very strange, long and skinny shape). 

Based on this assumption, the field is relatively uniform in the 
whole volume between the plates, so we can use a single symbol, 
E, to represent its magnitude, and the relation E = dV/dx is 
equivalent to E = AV/Ax = (1.5 V)/(gap) = 1.5 x 10° V/m. 
Since the field is uniform, we can dispense with the calculus, and 
replace dU, = (1/87k)E*dv with Us, = (1/87k)E*v. The volume 
equals the area multiplied by the gap, so we have 


Ue = (1/87k)E? (area)(gap) 
1 
~ 8x9 x 109 N-m2/C? 


(1.5 x 10° V/m)?(10~4 m?)(107° m) 


=1x107"J 
self-check E 
Show that the units in the preceding example really do work out to be 
joules. > Answer, p. 870 
Why k is on the bottom example 15 


It may also seem strange that the constant k is in the denomina- 
tor of the equation dU, = (1 /87tk)E@dv. The Coulomb constant k 
tells us how strong electric forces are, so shouldn’t it be on top? 


No. Consider, for instance, an alternative universe in which elec- 
tric forces are twice as strong as in ours. The numerical value of 
k is doubled. Because k is doubled, all the electric field strengths 
are doubled as well, which quadruples the quantity E*. In the ex- 
pression E*/87tk, we’ve quadrupled something on top and dou- 
bled something on the bottom, which makes the energy twice as 
big. That makes perfect sense. 


Potential energy of a pair of opposite charges example 16 
Imagine taking two opposite charges, b, that were initially far 
apart and allowing them to come together under the influence 
of their electrical attraction. 


According to our old approach, electrical energy is lost because 
the electric force did positive work as it brought the charges to- 
gether. (This makes sense because as they come together and 
accelerate it is their electrical energy that is being lost and con- 
verted to kinetic energy.) 


By the new method, we must ask how the energy stored in the 
electric field has changed. In the region indicated approximately 
by the shading in the figure, the superposing fields of the two 
charges undergo partial cancellation because they are in oppos- 
ing directions. The energy in the shaded region is reduced by 
this effect. In the unshaded region, the fields reinforce, and the 
energy is increased. 


It would be quite a project to do an actual numerical calculation of 
the energy gained and lost in the two regions (this is a case where 
the old method of finding energy gives greater ease of computa- 
tion), but it is fairly easy to convince oneself that the energy is 
less when the charges are closer. This is because bringing the 
charges together shrinks the high-energy unshaded region and 
enlarges the low-energy shaded region. 


A spherical capacitor example 17 
> A spherical capacitor, c, consists of two concentric spheres of 
radii a and b. Find the energy required to charge up the capacitor 
so that the plates hold charges +q and —q. 


> On page 102, | proved that for gravitational forces, the inter- 
action of a spherical shell of mass with other masses outside it 
is the same as if the shell’s mass was concentrated at its cen- 
ter. On the interior of such a shell, the forces cancel out exactly. 
Since gravity and the electric force both vary as 1/r?, the same 
proof carries over immediately to electrical forces. The magnitude 
of the outward electric field contributed by the charge +q of the 
central sphere is therefore 
IE,| = { 0, r<a 
- kq/r*?, r>a 

where r is the distance from the center. Similarly, the magnitude 
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b/ Example 16. 


-q 


c/Example B. Part of the 
outside sphere has been drawn 
as if it is transparent, in order to 
show the inside sphere. 
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of the inward field contributed by the outside sphere is 


(ae 0, r<b 
oe kg ir PSD 


In the region outside the whole capacitor, the two fields are equal 
in magnitude, but opposite in direction, so they cancel. We then 
have for the total field 


0, r<a 
[E\=< kq/r?, a<r<b 
0, r>b 


so to calculate the energy, we only need to worry about the region 
a<r<_b. The energy density in this region is 


dU, - ST 2 
dv 8nk 

kq@ 4 
Sn! 


This expression only depends on r, so the energy density is con- 
stant across any sphere of radius r. We can slice the region 
a <r < b into concentric spherical layers, like an onion, and 
the energy within one such layer, extending from r to r+dr is 


dU, 
dv 
dU. 
dv 
kq? 
Z ae 

kq? 
ee 


dU, = dv 


= (area of shell)(thickness of shell) 
r—*)(4nr?)(dr) 


r-?dr 


Integrating over all the layers to find the total energy, we have 


=-—-r 


Discussion Questions 


A_ The figure shows a positive charge in the gap between two capacitor 
plates. Compare the energy of the electric fields in the two cases. Does 
this agree with what you would have expected based on your knowledge 
of electrical forces? 


B_ The figure shows a spherical capacitor. In the text, the energy stored 
in its electric field is shown to be 


kq@® /1 1 
as ( = 5) 
What happens if the difference between b and ais very small? Does this 
make sense in terms of the mechanical work needed in order to separate 


the charges? Does it make sense in terms of the energy stored in the 
electric field? Should these two energies be added together? 


Similarly, discuss the cases of b + oo and a-> 0. 


C_Criticize the following statement: “A solenoid makes a charge in the 
space surrounding it, which dissipates when you release the energy.” 


D In example 16 on page 539, | argued that for the charges shown 
in the figure, the fields contain less energy when the charges are closer 
together, because the region of cancellation expanded, while the region 
of reinforcing fields shrank. Perhaps a simpler approach is to consider 
the two extreme possibilities: the case where the charges are infinitely 
far apart, and the one in which they are at zero distance from each other, 
i.e., right on top of each other. Carry out this reasoning for the case of 
(1) a positive charge and a negative charge of equal magnitude, (2) two 
positive charges of equal magnitude, (3) the gravitational energy of two 
equal masses. 


10.4.2 Gravitational field energy 


Example B depended on the close analogy between electric and 
gravitational forces. In fact, every argument, proof, and example 
discussed so far in this section is equally valid as a gravitational 
example, provided we take into account one fact: only positive mass 
exists, and the gravitational force between two masses is attractive. 
This is the opposite of what happens with electrical forces, which 
are repulsive in the case of two positive charges. As a consequence of 
this, we need to assign a negative energy density to the gravitational 
field! For a gravitational field, we have 


1 


2 
— gd 
ao : 


dU, = — 


where g? = g- g is the square of the magnitude of the gravitational 
field. 


10.4.3 Magnetic field energy 


So far we’ve only touched in passing on the topic of magnetic 
fields, which will deal with in detail in chapter 11. Magnetism is 
an interaction between moving charge and moving charge, i.e., be- 
tween currents and currents. Since a current has a direction in 
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space,” while charge doesn’t, we can anticipate that the mathemat- 
ical rule connecting a magnetic field to its source-currents will have 
to be completely different from the one relating the electric field 
to its source-charges. However, if you look carefully at the argu- 
ment leading to the relation dU./dv = E?/8rk, you'll see that. these 
mathematical details were only necessary to the part of the argu- 
ment in which we fixed the constant of proportionality. To establish 
dU./dv « E?, we only had to use three simple facts: 


e The field is proportional to the source. 
e Forces are proportional to fields. 


e Field contributed by multiple sources add like vectors. 


All three of these statements are true for the magnetic field as well, 
so without knowing anything more specific about magnetic fields — 
not even what units are used to measure them! — we can state 
with certainty that the energy density in the magnetic field is pro- 
portional to the square of the magnitude of the magnetic field. 


?Current is a scalar, since the definition J = dq/dt is the derivative of a scalar. 
However, there is a closely related quantity called the current density, J, which 
is a vector, and J is in fact the more fundamentally important quantity. 
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10.5 LRC Circuits 


The long road leading from the light bulb to the computer started 
with one very important step: the introduction of feedback into 
electronic circuits. Although the principle of feedback has been un- 
derstood and and applied to mechanical systems for centuries, and 
to electrical ones since the early twentieth century, for most of us 
the word evokes an image of Jimi Hendrix (or some more recent 
guitar hero) intentionally creating earsplitting screeches, or of the 
school principal doing the same inadvertently in the auditorium. In 
the guitar example, the musician stands in front of the amp and 
turns it up so high that the sound waves coming from the speaker 
come back to the guitar string and make it shake harder. This is 
an example of positive feedback: the harder the string vibrates, the 
stronger the sound waves, and the stronger the sound waves, the 
harder the string vibrates. The only limit is the power-handling 
ability of the amplifier. 


Negative feedback is equally important. Your thermostat, for 
example, provides negative feedback by kicking the heater off when 
the house gets warm enough, and by firing it up again when it 
gets too cold. This causes the house’s temperature to oscillate back 
and forth within a certain range. Just as out-of-control exponential 
freak-outs are a characteristic behavior of positive-feedback systems, 
oscillation is typical in cases of negative feedback. You have already 
studied negative feedback extensively in section 3.3 in the case of a 
mechanical system, although we didn’t call it that. 


10.5.1 Capacitance and inductance 


In a mechanical oscillation, energy is exchanged repetitively be- 
tween potential and kinetic forms, and may also be siphoned off in 
the form of heat dissipated by friction. In an electrical circuit, re- 
sistors are the circuit elements that dissipate heat. What are the 
electrical analogs of storing and releasing the potential and kinetic 
energy of a vibrating object? When you think of energy storage in 
an electrical circuit, you are likely to imagine a battery, but even 
rechargeable batteries can only go through 10 or 100 cycles before 
they wear out. In addition, batteries are not able to exchange en- 
ergy on a short enough time scale for most applications. The circuit 
in a musical synthesizer may be called upon to oscillate thousands 
of times a second, and your microwave oven operates at gigahertz 
frequencies. Instead of batteries, we generally use capacitors and 
inductors to store energy in oscillating circuits. Capacitors, which 
you’ve already encountered, store energy in electric fields. An in- 
ductor does the same with magnetic fields. 


Capacitors 


A capacitor’s energy exists in its surrounding electric fields. It is 
proportional to the square of the field strength, which is proportional 


Section 10.5 


LRC Circuits 


543 


4H 


a/The symbol for a _ capaci- 
tor. 


eerie 


b / Some capacitors. 


c/ Two 


common 
for inductors. 
shape on the left is called a 
solenoid. 


geometries 
The cylindrical 
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tor. 


e/ Some inductors. 


to the charges on the plates. If we assume the plates carry charges 
that are the same in magnitude, +q and —q, then the energy stored 
in the capacitor must be proportional to q?. For historical reasons, 
we write the constant of proportionality as 1/2C, 


1 4 
= 30° 


The constant C' is a geometrical property of the capacitor, called its 
capacitance. 


Uc 


Based on this definition, the units of capacitance must be coulombs 
squared per joule, and this combination is more conveniently abbre- 
viated as the farad, 1 F = 1 C?/J. “Condenser” is a less formal 
term for a capacitor. Note that the labels printed on capacitors 
often use MF to mean uF, even though MF should really be the 
symbol for megafarads, not microfarads. Confusion doesn’t result 
from this nonstandard notation, since picofarad and microfarad val- 
ues are the most common, and it wasn’t until the 1990’s that even 
millifarad and farad values became available in practical physical 
sizes. Figure a shows the symbol used in schematics to represent a 
capacitor. 


‘A parallel-plate capacitor example 18 
> Suppose a capacitor consists of two parallel metal plates with 
area A, and the gap between them is h. The gap is small com- 
pared to the dimensions of the plates. What is the capacitance? 


> Since the plates are metal, the charges on each plate are free 
to move, and will tend to cluster themselves more densely near 
the edges due to the mutual repulsion of the other charges in the 
same plate. However, it turns out that if the gap is small, this is 
a small effect, so we can get away with assuming uniform charge 
density on each plate. The result of example 14 then applies, and 
for the region between the plates, we have E = 4ko = 41kq/A 
and U, = (1/87k)E*Ah. Substituting the first expression into the 
second, we find Up = 27kq*h/A. Comparing this to the definition 
of capacitance, we end up with C = A/47tkh. 


Inductors 


Any current will create a magnetic field, so in fact every current- 
carrying wire in a circuit acts as an inductor! However, this type 
of “stray” inductance is typically negligible, just as we can usually 
ignore the stray resistance of our wires and only take into account 
the actual resistors. To store any appreciable amount of magnetic 
energy, one usually uses a coil of wire designed specifically to be 
an inductor. All the loops’ contribution to the magnetic field add 
together to make a stronger field. Unlike capacitors and resistors, 
practical inductors are easy to make by hand. One can for instance 
spool some wire around a short wooden dowel. An inductor like 
this, in the form cylindrical coil of wire, is called a solenoid, c, and 
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a stylized solenoid, d, is the symbol used to represent an inductor 
in a circuit regardless of its actual geometry. 


How much energy does an inductor store? The energy density is 
proportional to the square of the magnetic field strength, which is 
in turn proportional to the current flowing through the coiled wire, 
so the energy stored in the inductor must be proportional to J?. We 
write L/2 for the constant of proportionality, giving 


Lio 

Ur, = 3f 
As in the definition of capacitance, we have a factor of 1/2, 
which is purely a matter of definition. The quantity L is called the 
inductance of the inductor, and we see that its units must be joules 
per ampere squared. This clumsy combination of units is more 
commonly abbreviated as the henry, 1 henry = 1 J/A?. Rather 
than memorizing this definition, it makes more sense to derive it 
when needed from the definition of inductance. Many people know 
inductors simply as “coils,” or “chokes,” and will not understand 
you if you refer to an “inductor,” but they will still refer to D as the 

“inductance,” not the “coilance” or “chokeance!” 


There is a lumped circuit approximation for inductors, just like 
the one for capacitors (p. 532). For a capacitor, this means assuming 
that the electric fields are completely internal, so that components 
only interact via currents that flow through wires, not due to the 
physical overlapping of their fields in space. Similarly for an induc- 
tor, the lumped circuit approximation is the assumption that the 
magnetic fields are completely internal. 


Identical inductances in series example 19 
If two inductors are placed in series, any current that passes 
through the combined double inductor must pass through both 
its parts. If we assume the lumped circuit approximation, the 
two inductors’ fields don’t interfere with each other, so the energy 
is doubled for a given current. Thus by the definition of induc- 
tance, the inductance is doubled as well. In general, inductances 
in series add, just like resistances. The same kind of reason- 
ing also shows that the inductance of a solenoid is approximately 
proportional to its length, assuming the number of turns per unit 
length is kept constant. (This is only approximately true, because 
putting two solenoids end-to-end causes the fields just outside 
their mouths to overlap and add together in a complicated man- 
ner. In other words, the lumped-circuit approximation may not be 
very good.) 


Identical capacitances in parallel example 20 
When two identical capacitances are placed in parallel, any charge 
deposited at the terminals of the combined double capacitor will 
divide itself evenly between the two parts. The electric fields sur- 
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h/A variable capacitor. 
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i / Discussion question B. 


rounding each capacitor will be half the intensity, and therefore 
store one quarter the energy. Two capacitors, each storing one 
quarter the energy, give half the total energy storage. Since ca- 
pacitance is inversely related to energy storage, this implies that 
identical capacitances in parallel give double the capacitance. In 
general, capacitances in parallel add. This is unlike the behav- 
ior of inductors and resistors, for which series configurations give 
addition. 


This is consistent with the result of example 18, which had the 
capacitance of a single parallel-plate capacitor proportional to the 
area of the plates. If we have two parallel-plate capacitors, and 
we combine them in parallel and bring them very close together 
side by side, we have produced a single capacitor with plates of 
double the area, and it has approximately double the capacitance, 
subject to any violation of the lumped-circuit approximation due to 
the interaction of the fields where the edges of the capacitors are 
joined together. 


Inductances in parallel and capacitances in series are explored 
in homework problems 36 and 33. 


A variable capacitor example 21 
Figure h/1 shows the construction of a variable capacitor out of 
two parallel semicircles of metal. One plate is fixed, while the 
other can be rotated about their common axis with a knob. The 
opposite charges on the two plates are attracted to one another, 
and therefore tend to gather in the overlapping area. This over- 
lapping area, then, is the only area that effectively contributes to 
the capacitance, and turning the knob changes the capacitance. 
The simple design can only provide very small capacitance val- 
ues, so in practice one usually uses a bank of capacitors, wired 
in parallel, with all the moving parts on the same shaft. 


Discussion Questions 


A Suppose that two parallel-plate capacitors are wired in parallel, and 
are placed very close together, side by side, so that the lumped circuit 
approximation is not very accurate. Will the resulting capacitance be too 
small, or too big? Could you twist the circuit into a different shape and 
make the effect be the other way around, or make the effect vanish? How 
about the case of two inductors in series? 


B Most practical capacitors do not have an air gap or vacuum gap 
between the plates; instead, they have an insulating substance called a 
dielectric. We can think of the molecules in this substance as dipoles that 
are free to rotate (at least a little), but that are not free to move around, 
since it is a solid. The figure shows a highly stylized and unrealistic way 
of visualizing this. We imagine that all the dipoles are intially turned side- 
ways, (1), and that as the capacitor is charged, they all respond by turning 
through a certain angle, (2). (In reality, the scene might be much more 
random, and the alignment effect much weaker.) 


For simplicity, imagine inserting just one electric dipole into the vacuum 
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gap. For a given amount of charge on the plates, how does this affect 
the amount of energy stored in the electric field? How does this affect the 
capacitance? 


Now redo the analysis in terms of the mechanical work needed in order 
to charge up the plates. 


10.5.2 Oscillations 


Figure j shows the simplest possible oscillating circuit. For any 
useful application it would actually need to include more compo- 
nents. For example, if it was a radio tuner, it would need to be 
connected to an antenna and an amplifier. Nevertheless, all the 
essential physics is there. 


We can analyze it without any sweat or tears whatsoever, sim- 
ply by constructing an analogy with a mechanical system. In a 
mechanical oscillator, k, we have two forms of stored energy, 


1 


Uspring = ah (1) 
1 
k= xe (2) 


In the case of a mechanical oscillator, we have usually assumed 
a friction force of the form that turns out to give the nicest math- 
ematical results, / = —bv. In the circuit, the dissipation of energy 
into heat occurs via the resistor, with no mechanical force involved, 
so in order to make the analogy, we need to restate the role of the 
friction force in terms of energy. The power dissipated by friction 
equals the mechanical work it does in a time interval dt, divided by 
dt, P = W/dt = Fda/dt = Fv = —bv?, so 


rate of heat dissipation = —bv? : (3) 


self-check F 

Equation (1) has x squared, and equations (2) and (3) have v squared. 
Because they’re squared, the results don’t depend on whether these 
variables are positive or negative. Does this make physical sense? > 
Answer, p. 870 


In the circuit, the stored forms of energy are 


1 

Uc = car (1’) 
1 

Up = 5LP ’ (2') 


and the rate of heat dissipation in the resistor is 
rate of heat dissipation = —RI? ; (3’) 


Comparing the two sets of equations, we first form analogies between 
quantities that represent the state of the system at some moment 
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in time: 


rod 
vel 


self-check G 
How is v related mathematically to x? How is / connected to g? Are the 
two relationships analogous? > Answer, p. 871 


Next we relate the ones that describe the system’s permanent 
characteristics: 


kol/c 
mol 
bOoR 


Since the mechanical system naturally oscillates with a frequency? 


w & \/k/m , we can immediately solve the electrical version by anal- 
ogy, giving 


Since the resistance R is analogous to b in the mechanical case, 
we find that the Q (quality factor, not charge) of the resonance 
is inversely proportional to R, and the width of the resonance is 
directly proportional to R. 


Tuning a radio receiver example 22 

A radio receiver uses this kind of circuit to pick out the desired 
station. Since the receiver resonates at a particular frequency, 
stations whose frequencies are far off will not excite any response 
in the circuit. The value of R has to be small enough so that only 
one station at a time is picked up, but big enough so that the 
tuner isn’t too touchy. The resonant frequency can be tuned by 
adjusting either L or C, but variable capacitors are easier to build 
than variable inductors. 


A numerical calculation example 23 
The phone company sends more than one conversation at a time 
over the same wire, which is accomplished by shifting each voice 
signal into different range of frequencies during transmission. The 
number of signals per wire can be maximized by making each 
range of frequencies (known as a bandwidth) as small as possi- 
ble. It turns out that only a relatively narrow range of frequencies 
is necessary in order to make a human voice intelligible, so the 


3 As in chapter 2, we use the word “frequency” to mean either f or w = 2af 
when the context makes it clear which is being referred to. 
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phone company filters out all the extreme highs and lows. (This is 
why your phone voice sounds different from your normal voice.) 


> If the filter consists of an LRC circuit with a broad resonance 
centered around 1.0 kHz, and the capacitor is 1 uF (microfarad), 
what inductance value must be used? 


> Solving for L, we have 
1 


~ Cw? 
{ 


(10-6 F)(27 x 103 s—')? 
=25x 10-3 F's? 


Checking that these really are the same units as henries is a little 
tedious, but it builds character: 


Fo'52 = (Coys? 


The result is 25 mH (millinenries). 


This is actually quite a large inductance value, and would require 
a big, heavy, expensive coil. In fact, there is a trick for making 
this kind of circuit small and cheap. There is a kind of silicon 
chip called an op-amp, which, among other things, can be used 
to simulate the behavior of an inductor. The main limitation of the 
op-amp is that it is restricted to low-power applications. 


10.5.3 Voltage and current 


What is physically happening in one of these oscillating circuits? 
Let’s first look at the mechanical case, and then draw the analogy 
to the circuit. For simplicity, let’s ignore the existence of damping, 
so there is no friction in the mechanical oscillator, and no resistance 
in the electrical one. 


Suppose we take the mechanical oscillator and pull the mass 
away from equilibrium, then release it. Since friction tends to resist 
the spring’s force, we might naively expect that having zero friction 
would allow the mass to leap instantaneously to the equilibrium 
position. This can’t happen, however, because the mass would have 
to have infinite velocity in order to make such an instantaneous leap. 
Infinite velocity would require infinite kinetic energy, but the only 
kind of energy that is available for conversion to kinetic is the energy 
stored in the spring, and that is finite, not infinite. At each step on 
its way back to equilibrium, the mass’s velocity is controlled exactly 
by the amount of the spring’s energy that has so far been converted 
into kinetic energy. After the mass reaches equilibrium, it overshoots 
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due to its own momentum. It performs identical oscillations on both 
sides of equilibrium, and it never loses amplitude because friction is 
not available to convert mechanical energy into heat. 


Now with the electrical oscillator, the analog of position is charge. 
Pulling the mass away from equilibrium is like depositing charges 
+q and —q on the plates of the capacitor. Since resistance tends 
to resist the flow of charge, we might imagine that with no fric- 
tion present, the charge would instantly flow through the inductor 
(which is, after all, just a piece of wire), and the capacitor would 
discharge instantly. However, such an instant discharge is impossi- 
ble, because it would require infinite current for one instant. Infinite 
current would create infinite magnetic fields surrounding the induc- 
tor, and these fields would have infinite energy. Instead, the rate 
of flow of current is controlled at each instant by the relationship 
between the amount of energy stored in the magnetic field and the 
amount of current that must exist in order to have that strong a 
field. After the capacitor reaches q = 0, it overshoots. The circuit 
has its own kind of electrical “inertia,” because if charge was to stop 
flowing, there would have to be zero current through the inductor. 
But the current in the inductor must be related to the amount of 
energy stored in its magnetic fields. When the capacitor is at q = 0, 
all the circuit’s energy is in the inductor, so it must therefore have 
strong magnetic fields surrounding it and quite a bit of current going 
through it. 


The only thing that might seem spooky here is that we used to 
speak as if the current in the inductor caused the magnetic field, 
but now it sounds as if the field causes the current. Actually this is 
symptomatic of the elusive nature of cause and effect in physics. It’s 
equally valid to think of the cause and effect relationship in either 
way. This may seem unsatisfying, however, and for example does not 
really get at the question of what brings about a voltage difference 
across the resistor (in the case where the resistance is finite); there 
must be such a voltage difference, because without one, Ohm’s law 
would predict zero current through the resistor. 


Voltage, then, is what is really missing from our story so far. 


Let’s start by studying the voltage across a capacitor. Voltage is 
electrical potential energy per unit charge, so the voltage difference 
between the two plates of the capacitor is related to the amount by 
which its energy would increase if we increased the absolute values 
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of the charges on the plates from q to q + dq: 


Vo= (Uataq 7 Uy) /dq 


Many books use this as the definition of capacitance. This equation, 
by the way, probably explains the historical reason why C' was de- 
fined so that the energy was inversely proportional to C for a given 
value of C’: the people who invented the definition were thinking of a 
capacitor as a device for storing charge rather than energy, and the 
amount of charge stored for a fixed voltage (the charge “capacity” ) 
is proportional to C. 


In the case of an inductor, we know that if there is a steady, con- 
stant current flowing through it, then the magnetic field is constant, 
and so is the amount of energy stored; no energy is being exchanged 
between the inductor and any other circuit element. But what if 
the current is changing? The magnetic field is proportional to the 
current, so a change in one implies a change in the other. For con- 
creteness, let’s imagine that the magnetic field and the current are 
both decreasing. The energy stored in the magnetic field is there- 
fore decreasing, and by conservation of energy, this energy can’t just 
go away — some other circuit element must be taking energy from 
the inductor. The simplest example, shown in figure l, is a series 
circuit consisting of the inductor plus one other circuit element. It 
doesn’t matter what this other circuit element is, so we just call it a 
black box, but if you like, we can think of it as a resistor, in which 
case the energy lost by the inductor is being turned into heat by 
the resistor. The junction rule tells us that both circuit elements 
have the same current through them, so J could refer to either one, 
and likewise the loop rule tells us Vinductor + Volack bor = 0, so the 
two voltage drops have the same absolute value, which we can refer 
to as V. Whatever the black box is, the rate at which it is taking 
energy from the inductor is given by |P| = |IV]|, so 


[IV] = 
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which in many books is taken to be the definition of inductance. 
The direction of the voltage drop (plus or minus sign) is such that 
the inductor resists the change in current. 


There’s one very intriguing thing about this result. Suppose, 
for concreteness, that the black box in figure 1 is a resistor, and 
that the inductor’s energy is decreasing, and being converted into 
heat in the resistor. The voltage drop across the resistor indicates 
that it has an electric field across it, which is driving the current. 
But where is this electric field coming from? There are no charges 
anywhere that could be creating it! What we’ve discovered is one 
special case of a more general principle, the principle of induction: a 
changing magnetic field creates an electric field, which is in addition 
to any electric field created by charges. (The reverse is also true: 
any electric field that changes over time creates a magnetic field.) 
Induction forms the basis for such technologies as the generator and 
the transformer, and ultimately it leads to the existence of light, 
which is a wave pattern in the electric and magnetic fields. These 
are all topics for chapter 11, but it’s truly remarkable that we could 
come to this conclusion without yet having learned any details about 
magnetism. 


=i 0 ~| 
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m / Electric fields made by charges, 1, and by changing magnetic fields, 2 and 3. 


The cartoons in figure m compares electric fields made by charges, 
1, to electric fields made by changing magnetic fields, 2-3. In m/1, 
two physicists are in a room whose ceiling is positively charged and 
whose floor is negatively charged. The physicist on the bottom 
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throws a positively charged bowling ball into the curved pipe. The 
physicist at the top uses a radar gun to measure the speed of the 
ball as it comes out of the pipe. They find that the ball has slowed 
down by the time it gets to the top. By measuring the change in the 
ball’s kinetic energy, the two physicists are acting just like a volt- 
meter. They conclude that the top of the tube is at a higher voltage 
than the bottom of the pipe. A difference in voltage indicates an 
electric field, and this field is clearly being caused by the charges in 
the floor and ceiling. 


In m/2, there are no charges anywhere in the room except for 
the charged bowling ball. Moving charges make magnetic fields, so 
there is a magnetic field surrounding the helical pipe while the ball 
is moving through it. A magnetic field has been created where there 
was none before, and that field has energy. Where could the energy 
have come from? It can only have come from the ball itself, so 
the ball must be losing kinetic energy. The two physicists working 
together are again acting as a voltmeter, and again they conclude 
that there is a voltage difference between the top and bottom of 
the pipe. This indicates an electric field, but this electric field can’t 
have been created by any charges, because there aren’t any in the 
room. This electric field was created by the change in the magnetic 
field. 


The bottom physicist keeps on throwing balls into the pipe, until 
the pipe is full of balls, m/3, and finally a steady current is estab- 
lished. While the pipe was filling up with balls, the energy in the 
magnetic field was steadily increasing, and that energy was being 
stolen from the balls’ kinetic energy. But once a steady current is 
established, the energy in the magnetic field is no longer changing. 
The balls no longer have to give up energy in order to build up the 
field, and the physicist at the top finds that the balls are exiting the 
pipe at full speed again. There is no voltage difference any more. 
Although there is a current, d//dt is zero. 


Ballasts example 24 

In a gas discharge tube, such as a neon sign, enough voltage 
is applied to a tube full of gas to ionize some of the atoms in the 
gas. Once ions have been created, the voltage accelerates them, 
and they strike other atoms, ionizing them as well and resulting 
in a chain reaction. This is a spark, like a bolt of lightning. But 
once the spark starts up, the device begins to act as though it has 
no resistance: more and more current flows, without the need to 
apply any more voltage. The power, P = /V, would grow without 
limit, and the tube would burn itself out. 


The simplest solution is to connect an inductor, known as the 
“ballast,” in series with the tube, and run the whole thing on an 
AC voltage. During each cycle, as the voltage reaches the point 
where the chain reaction begins, there is a surge of current, but 


n/ Ballasts for fluorescent lights. 
Top: a big, heavy inductor used 
as a ballast in an old-fashioned 
fluorescent bulb. Bottom: a 
small solid-state ballast, built into 
the base of a modern compact 
fluorescent bulb. 
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the inductor resists such a sudden change of current, and the 
energy that would otherwise have burned out the bulb is instead 
channeled into building a magnetic field. 


A common household fluorescent lightbulb consists of a gas dis- 
charge tube in which the glass is coated with a fluorescent mate- 
rial. The gas in the tube emits ultraviolet light, which is absorbed 
by the coating, and the coating then glows in the visible spectrum. 


Until recently, it was common for a fluroescent light’s ballast to 
be a simple inductor, and for the whole device to be operated at 
the 60 Hz frequency of the electrical power lines. This caused 
the lights to flicker annoyingly at 120 Hz, and could also cause an 
audible hum, since the magnetic field surrounding the inductor 
could exert mechanical forces on things. These days, the trend 
is toward using a solid-state circuit that mimics the behavior of 
an inductor, but at a frequency in the kilohertz range, eliminating 
the flicker and hum. Modern compact fluorescent bulbs electronic 
have ballasts built into their bases, so they can be used as plug-in 
replacements for incandescent bulbs. A compact fluorescent bulb 
uses about 1/4 the electricity of an incandescent bulb, lasts ten 
times longer, and saves $30 worth of electricity over its lifetime. 


Discussion Question 


A What happens when the physicist at the bottom in figure m/3 starts 
getting tired, and decreases the current? 


10.5.4 Decay 


Up until now I’ve soft-pedaled the fact that by changing the char- 
acteristics of an oscillator, it is possible to produce non-oscillatory 
behavior. For example, imagine taking the mass-on-a-spring system 
and making the spring weaker and weaker. In the limit of small 
k, it’s as though there was no spring whatsoever, and the behavior 
of the system is that if you kick the mass, it simply starts slowing 
down. For friction proportional to v, as we’ve been assuming, the re- 
sult is that the velocity approaches zero, but never actually reaches 
zero. This is unrealistic for the mechanical oscillator, which will not 
have vanishing friction at low velocities, but it is quite realistic in 
the case of an electrical circuit, for which the voltage drop across the 
resistor really does approach zero as the current approaches zero. 


We do not even have to reduce k to exactly zero in order to get 
non-oscillatory behavior. There is actually a finite, critical value be- 
low which the behavior changes, so that the mass never even makes 
it through one cycle. This is the case of overdamping, discussed on 
page 185. 


Electrical circuits can exhibit all the same behavior. For sim- 
plicity we will analyze only the cases of LRC circuits with D = 0 or 
C=0. 
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The RC circuit 


We first analyze the RC circuit, o. In reality one would have 
to “kick” the circuit, for example by briefly inserting a battery, in 
order to get any interesting behavior. We start with Ohm’s law and 
the equation for the voltage across a capacitor: 


Ve=IR 
Vo =4/C 
The loop rule tells us 
Verve sO. 4g 


and combining the three equations results in a relationship between 


q and I: 
1 


RC! 

The negative sign tells us that the current tends to reduce the charge 
on the capacitor, i.e., to discharge it. It makes sense that the current 
is proportional to q: if q is large, then the attractive forces between 
the +q and —q charges on the plates of the capacitor are large, 
and charges will flow more quickly through the resistor in order to 
reunite. If there was zero charge on the capacitor plates, there would 
be no reason for current to flow. Since amperes, the unit of current, 
are the same as coulombs per second, it appears that the quantity 
RC must have units of seconds, and you can check for yourself that 
this is correct. RC is therefore referred to as the time constant of 
the circuit. 


: 


How exactly do I and q vary with time? Rewriting I as dq/dt, 


we have 
dq sl 


dt RC? 
We need a function g(t) whose derivative equals itself, but multiplied 
by a negative constant. A function of the form ae’, where e = 
2.718... is the base of natural logarithms, is the only one that has its 
derivative equal to itself, and ae” has its derivative equal to itself 
multiplied by 6. Thus our solution is 


t 
qd = dW &Xp (-z5) 


The REL circuit 


The RL circuit, q, can be attacked by similar methods, and it 
can easily be shown that it gives 


I =I,exp (-F#) 


The RL time constant equals L/R. 
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Death by solenoid; spark plugs example 25 
When we suddenly break an RL circuit, what will happen? It might 
seem that we’re faced with a paradox, since we only have two 
forms of energy, magnetic energy and heat, and if the current 
stops suddenly, the magnetic field must collapse suddenly. But 
where does the lost magnetic energy go? It can’t go into resistive 
heating of the resistor, because the circuit has now been broken, 
and current can’t flow! 


The way out of this conundrum is to recognize that the open gap 
in the circuit has a resistance which is large, but not infinite. This 
large resistance causes the RL time constant L/R to be very 
small. The current thus continues to flow for a very brief time, 
and flows straight across the air gap where the circuit has been 
opened. In other words, there is a spark! 


We can determine based on several different lines of reasoning 
that the voltage drop from one end of the spark to the other must 
be very large. First, the airs resistance is large, so V = /R re- 
quires a large voltage. We can also reason that all the energy 
in the magnetic field is being dissipated in a short time, so the 
power dissipated in the spark, P = IV, is large, and this requires 
a large value of V. (/ isn’t large — it is decreasing from its initial 
value.) Yet a third way to reach the same result is to consider the 
equation V, = d//dt: since the time constant is short, the time 
derivative d//dt is large. 


This is exactly how a car’s spark plugs work. Another application 
is to electrical safety: it can be dangerous to break an inductive 
circuit suddenly, because so much energy is released in a short 
time. There is also no guarantee that the spark will discharge 
across the air gap; it might go through your body instead, since 
your body might have a lower resistance. 


A spark-gap radio transmitter example 26 
Figure r shows a primitive type of radio transmitter, called a spark 
gap transmitter, used to send Morse code around the turn of the 
twentieth century. The high voltage source, V, is typically about 
10,000 volts. When the telegraph switch, S, is closed, the RC 
circuit on the left starts charging up. An increasing voltage differ- 
ence develops between the electrodes of the spark gap, G. When 
this voltage difference gets large enough, the electric field in the 
air between the electrodes causes a spark, partially discharging 
the RC circuit, but charging the LC circuit on the right. The LC 
circuit then oscillates at its resonant frequency (typically about 1 
MHz), but the energy of these oscillations is rapidly radiated away 
by the antenna, A, which sends out radio waves (chapter 11). 


Discussion Questions 


A gopher gnaws through one of the wires in the DC lighting system 


in your front yard, and the lights turn off. At the instant when the circuit 
becomes open, we can consider the bare ends of the wire to be like the 
plates of a capacitor, with an air gap (or gopher gap) between them. What 
kind of capacitance value are we talking about here? What would this tell 
you about the AC time constant? 


10.5.5 Review of complex numbers 


For a more detailed treatment of complex numbers, see ch. 3 of 
James Nearing’s free book at 
http://www.physics.miami.edu/nearing/mathmethods/. 


We assume there is a number, i, such that i? = —1. The square 
roots of —1 are then 7 and —i. (In electrical engineering work, where 
i stands for current, j is sometimes used instead.) This gives rise to 
a number system, called the complex numbers, containing the real 
numbers as a subset. Any complex number z can be written in the 
form z = a+01, where a and b are real, and a and B are then referred 
to as the real and imaginary parts of z. A number with a zero real 
part is called an imaginary number. The complex numbers can be 
visualized as a plane, with the real number line placed horizontally 
like the x axis of the familiar x—y plane, and the imaginary numbers 
running along the y axis. The complex numbers are complete in a 
way that the real numbers aren’t: every nonzero complex number 
has two square roots. For example, 1 is a real number, so it is also 
a member of the complex numbers, and its square roots are —1 and 
1. Likewise, —1 has square roots 7 and —7, and the number 7 has 


square roots 1//2 + i/V2 and —1/V2—i/vV2. 


Complex numbers can be added and subtracted by adding or 
subtracting their real and imaginary parts. Geometrically, this is 
the same as vector addition. 


The complex numbers a+ bi and a — 01, lying at equal distances 
above and below the real axis, are called complex conjugates. The 
results of the quadratic formula are either both real, or complex 
conjugates of each other. The complex conjugate of a number z is 
notated as Z or 2*. 


The complex numbers obey all the same rules of arithmetic as 
the reals, except that they can’t be ordered along a single line. That 
is, it’s not possible to say whether one complex number is greater 
than another. We can compare them in terms of their magnitudes 
(their distances from the origin), but two distinct complex numbers 
may have the same magnitude, so, for example, we can’t say whether 
1 is greater than 7 or 7 is greater than 1. 


A square root of i 
> Prove that 1/\/2 + i/\/2 is a square root of i. 


> Our proof can use any ordinary rules of arithmetic, except for 


example 27 
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s/ Visualizing complex num- 
bers as points in a plane. 


t/ Addition of complex num- 
bers is just like addition of 
vectors, although the real and 
imaginary axes don’t actually 
represent directions in space. 


u/A complex number and 


its conjugate. 
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arg z 


v/A complex number can 
be described in terms of its 
magnitude and argument. 


e UV 


w/The argument of uv is 
the sum of the arguments of u 
and v. 


ordering. 
Ageia oh Og Set gee gee te 
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Example 27 showed one method of multiplying complex num- 
bers. However, there is another nice interpretation of complex mul- 
tiplication. We define the argument of a complex number as its angle 
in the complex plane, measured counterclockwise from the positive 
real axis. Multiplying two complex numbers then corresponds to 
multiplying their magnitudes, and adding their arguments. 


self-check H 
Using this interpretation of multiplication, how could you find the square 
roots of a complex number? > Answer, p. 871 


An identity example 28 
The magnitude |z| of a complex number z obeys the identity |z|? = 
zz. To prove this, we first note that Z has the same magnitude 
as Z, since flipping it to the other side of the real axis doesn’t 
change its distance from the origin. Multiplying z by Z gives a 
result whose magnitude is found by multiplying their magnitudes, 
so the magnitude of zZ must therefore equal |z|?._ Now we just 
have to prove that 22 is a positive real number. But if, for example, 
Z lies counterclockwise from the real axis, then Z lies clockwise 
from it. If z has a positive argument, then Zz has a negative one, or 
vice-versa. The sum of their arguments is therefore zero, so the 


result has an argument of zero, and is on the positive real axis. 
4 


This whole system was built up in order to make every number 
have square roots. What about cube roots, fourth roots, and so on? 
Does it get even more weird when you want to do those as well? No. 
The complex number system we’ve already discussed is sufficient to 
handle all of them. The nicest way of thinking about it is in terms 
of roots of polynomials. In the real number system, the polynomial 
x?—1 has two roots, i.e., two values of x (plus and minus one) that we 
can plug in to the polynomial and get zero. Because it has these two 
real roots, we can rewrite the polynomial as (2 —1)(a+1). However, 
the polynomial x? + 1 has no real roots. It’s ugly that in the real 
number system, some second-order polynomials have two roots, and 
can be factored, while others can’t. In the complex number system, 
they all can. For instance, x? + 1 has roots i and —i, and can be 


‘I cheated a little. If z’s argument is 30 degrees, then we could say Z’s was 
-30, but we could also call it 330. That’s OK, because 330+30 gives 360, and an 
argument of 360 is the same as an argument of zero. 
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factored as (x — i)(a +7). In general, the fundamental theorem of 
algebra states that in the complex number system, any nth-order 
polynomial can be factored completely into n linear factors, and we 
can also say that it has n complex roots, with the understanding that 
some of the roots may be the same. For instance, the fourth-order 
polynomial «*+<? can be factored as (a —i)(x+7)(x—0)(x2—0), and 
we say that it has four roots, 7, —i, 0, and 0, two of which happen 
to be the same. This is a sensible way to think about it, because 
in real life, numbers are always approximations anyway, and if we 
make tiny, random changes to the coefficients of this polynomial, it 
will have four distinct roots, of which two just happen to be very 
close to zero. 


Discussion Questions 


A Find arg/, arg(—/), and arg 37, where arg z denotes the argument of 
the complex number 2. 


B Visualize the following multiplications in the complex plane using 
the interpretation of multiplication in terms of multiplying magnitudes and 
adding arguments: (/)(i) = —1, (/)(—/) = 1, (—i)(—i) = -1. 


Cc If we visualize Zz as a point in the complex plane, how should we 
visualize —z? What does this mean in terms of arguments? Give similar 
interpretations for z* and \/z. 


D Find four different complex numbers z such that z* = 1. 


E Compute the following. Use the magnitude and argument, not the 
real and imaginary parts. 
ar d 
, Itai : 


Based on the results above, compute the real and imaginary parts of 


1/(1 +/). 


10.5.6 Euler’s formula 


l1+/) , arg(1+/ , ce 


Having expanded our horizons to include the complex numbers, 
it’s natural to want to extend functions we knew and loved from 
the world of real numbers so that they can also operate on complex 
numbers. The only really natural way to do this in general is to 
use Taylor series. A particularly beautiful thing happens with the 
functions e”, sinx, and cos 2: 


1 1 
(a \ 2 3 
CS ae er Peasctns 
= Tog Dd 
cosx = 1 at aaa 
2 1 3, 1 5 
sinz =x at aie 


If x = 7¢ is an imaginary number, we have 


e'? =cosd+isind ; 
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x/The complex number e/? 
lies on the unit circle. 


y / Leonhard Euler (1707-1783) 


a result known as Euler’s formula. The geometrical interpretation 
in the complex plane is shown in figure x. 


Although the result may seem like something out of a freak show 
at first, applying the definition of the exponential function makes it 
clear how natural it is: 


When x = i¢ is imaginary, the quantity (1 + i¢/n) represents a 
number lying just above 1 in the complex plane. For large n, (1 + 
i@/n) becomes very close to the unit circle, and its argument is the 
small angle ¢/n. Raising this number to the nth power multiplies 
its argument by n, giving a number with an argument of ¢. 


Euler’s formula is used frequently in physics and engineering. 
‘Trig functions in terms of complex exponentials example 29 
> Write the sine and cosine functions in terms of exponentials. 


> Euler’s formula for x = —id gives cos —/sin d, since cos(—6) = 
cos 8, and sin(—@) = —sin@. 


elx + ex 
cos X = ——__— 
2 
elx _ e7!x 
sinx= = ——_ 
2! 
‘A hard integral made easy example 30 
> Evaluate 
/ e* cos xdx 


> This seemingly impossible integral becomes easy if we rewrite 
the cosine in terms of exponentials: 


/ e* cos xdx 


ix — ix 


_ 5 fem + elt) dy 


1 /eltt)x — @(t-ix : 
“2\ ter” tay.” 


Since this result is the integral of a real-valued function, we'd like 
it to be real, and in fact it is, since the first and second terms are 
complex conjugates of one another. If we wanted to, we could 
use Euler’s theorem to convert it back to a manifestly real result.° 


5In general, the use of complex number techniques to do an integral could 
result in a complex number, but that complex number would be a constant, 
which could be subsumed within the usual constant of integration. 
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10.5.7 Impedance 


So far we have been thinking in terms of the free oscillations of a 
circuit. This is like a mechanical oscillator that has been kicked but 
then left to oscillate on its own without any external force to keep 
the vibrations from dying out. Suppose an LRC circuit is driven 
with a sinusoidally varying voltage, such as will occur when a radio 
tuner is hooked up to a receiving antenna. We know that a current 
will flow in the circuit, and we know that there will be resonant 
behavior, but it is not necessarily simple to relate current to voltage 
in the most general case. Let’s start instead with the special cases 
of LRC circuits consisting of only a resistance, only a capacitance, 
or only an inductance. We are interested only in the steady-state 
response. 


The purely resistive case is easy. Ohm’s law gives 


l= 
R 


In the purely capacitive case, the relation V = q/C lets us cal- 
culate 


This is partly analogous to Ohm’s law. For example, if we double 
the amplitude of a sinusoidally varying AC voltage, the derivative 
dV /dt will also double, and the amplitude of the sinusoidally varying 
current will also double. However, it is not true that J = V/R, be- 
cause taking the derivative of a sinusoidal function shifts its phase by 
90 degrees. If the voltage varies as, for example, V(t) = Vo sin(wt), 
then the current will be I(t) = wCV, cos(wt). The amplitude of the 
current is wCVo, which is proportional to Vo, but it’s not true that 
I(t) = V(t)/R for some constant R. 


A second problem that crops up is that our entire analysis of 
DC resistive circuits was built on the foundation of the loop rule 
and the junction rule, both of which are statements about sums. To 
apply the junction rule to an AC circuit, for exampe, we would say 
that the sum of the sine waves describing the currents coming into 
the junction is equal (at every moment in time) to the sum of the 
sine waves going out. Now sinusoidal functions have a remarkable 
property, which is that if you add two different sinusoidal functions 
having the same frequency, the result is also a sinusoid with that 
frequency. For example, coswt + sinwt = V2sin(wt + 7/4), which 
can be proved using trig identities. The trig identities can get very 
cumbersome, however, and there is a much easier technique involv- 
ing complex numbers. 
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z/ln a capacitor, the current 
is 90° ahead of the voltage in 
phase. 
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~Sesin (t+ 1/4) 


0 sinwt 2sinot 


aa / Representing functions 
with points in polar coordinates. 


cos @te--.  esinwt+cos wt 
= a 
sin ot 
ab / Adding two sinusoidal 
functions. 
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Figure aa shows a useful way to visualize what’s going on. When 
a circuit is oscillating at a frequency w, we use points in the plane to 
represent sinusoidal functions with various phases and amplitudes. 


self-check | 
Which of the following functions can be represented in this way? cos(6f— 
4), cos? f, tant > Answer, p. 871 


The simplest examples of how to visualize this in polar coordi- 
nates are ones like coswt + coswt = 2coswt, where everything has 
the same phase, so all the points lie along a single line in the polar 
plot, and addition is just like adding numbers on the number line. 
The less trivial example coswt + sinwt = /2sin(wt + 7/4), can be 
visualized as in figure ab. 


Figure ab suggests that all of this can be tied together nicely 
if we identify our plane with the plane of complex numbers. For 
example, the complex numbers 1 and 7 represent the functions sin wt 
and coswt. In figure z, for example, the voltage across the capacitor 
is a sine wave multiplied by a number that gives its amplitude, so 
we associate that function with a number V lying on the real axis. 
Its magnitude, |V|, gives the amplitude in units of volts, while its 
argument arg V, gives its phase angle, which is zero. The current 
is a multiple of a sine wave, so we identify it with a number [ 
lying on the imaginary axis. We have arg = 90°, and |J| is the 
amplitude of the current, in units of amperes. But comparing with 
our result above, we have || = wC|V]|. Bringing together the phase 
and magnitude information, we have [ = iwC'V. This looks very 
much like Ohm’s law, so we write 

ite. « 
Zc 
where the quantity 
i 
ZC = WC ; 
having units of ohms, is called the impedance of the capacitor at 
this frequency. 


[impedance of a capacitor] 


It makes sense that the impedance becomes infinite at zero fre- 
quency. Zero frequency means that it would take an infinite time 
before the voltage would change by any amount. In other words, 
this is like a situation where the capacitor has been connected across 
the terminals of a battery and been allowed to settle down to a state 
where there is constant charge on both terminals. Since the elec- 
tric fields between the plates are constant, there is no energy being 
added to or taken out of the field. A capacitor that can’t exchange 
energy with any other circuit component is nothing more than a 
broken (open) circuit. 


Note that we have two types of complex numbers: those that 
represent sinusoidal functions of time, and those that represent 
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impedances. The ones that represent sinusoidal functions have tildes 
on top, which look like little sine waves. 


self-check J 
Why can’t a capacitor have its impedance printed on it along with its 
capacitance? > Answer, p. 871 


Similar math (but this time with an integral instead of a deriva- 
tive) gives 


Dy, = mad [impedance of an inductor] 


for an inductor. It makes sense that the inductor has lower impedance 
at lower frequencies, since at zero frequency there is no change in 
the magnetic field over time. No energy is added to or released 
from the magnetic field, so there are no induction effects, and the 
inductor acts just like a piece of wire with negligible resistance. The 
term “choke” for an inductor refers to its ability to “choke out” high 
frequencies. 


The phase relationships shown in figures z and ac can be re- 
membered using my own mnemonic, “eVIL,” which shows that the 
voltage (V) leads the current (I) in an inductive circuit, while the 
opposite is true in a capacitive one. A more traditional mnemonic 
is “ELI the ICE man,” which uses the notation E for emf, a concept 
closely related to voltage (see p. 640). 


Summarizing, the impedances of resistors, capacitors, and in- 
ductors are 


Ze=R 
a 
tO = 5G 
Zr = iwh 
Low-pass and high-pass filters example 31 


An LRC circuit only responds to a certain range (band) of fre- 
quencies centered around its resonant frequency. As a filter, this 
is known as a bandpass filter. If you turn down both the bass and 
the treble on your stereo, you have created a bandpass filter. 


To create a high-pass or low-pass filter, we only need to insert 
a capacitor or inductor, respectively, in series. For instance, a 
very basic surge protector for a computer could be constructed 
by inserting an inductor in series with the computer. The desired 
60 Hz power from the wall is relatively low in frequency, while the 
surges that can damage your computer show much more rapid 
time variation. Even if the surges are not sinusoidal signals, we 
can think of a rapid “spike” qualitatively as if it was very high in 
frequency — like a high-frequency sine wave, it changes very 
rapidly. 
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ac/The current through an 
inductor lags behind the voltage 
by a phase angle of 90°. 
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Inductors tend to be big, heavy, expensive circuit elements, so a 
simple surge protector would be more likely to consist of a capac- 
itor in parallel! with the computer. (In fact one would normally just 
connect one side of the power circuit to ground via a capacitor.) 
The capacitor has a very high impedance at the low frequency of 
the desired 60 Hz signal, so it siphons off very little of the current. 
But for a high-frequency signal, the capacitor’s impedance is very 
small, and it acts like a zero-impedance, easy path into which the 
current is diverted. 


The main things to be careful about with impedance are that 
(1) the concept only applies to a circuit that is being driven sinu- 
soidally, (2) the impedance of an inductor or capacitor is frequency- 
dependent. 


Discussion Question 


A __ Figure z on page 561 shows the voltage and current for a capacitor. 
Sketch the q-t graph, and use it to give a physical explanation of the 
phase relationship between the voltage and current. For example, why is 
the current zero when the voltage is at a maximum or minimum? 


B_ Figure ac on page 563 shows the voltage and current for an inductor. 
The power is considered to be positive when energy is being put into the 
inductor’s magnetic field. Sketch the graph of the power, and then the 
graph of U, the energy stored in the magnetic field, and use it to give 
a physical explanation of the P-t graph. In particular, discuss why the 
frequency is doubled on the P-t graph. 


C Relate the features of the graph in figure ac on page 563 to the story 
told in cartoons in figure m/2-3 on page 552. 


10.5.8 Power 


How much power is delivered when an oscillating voltage is ap- 
plied to an impedance? The equation P = IV is generally true, 
since voltage is defined as energy per unit charge, and current is 
defined as charge per unit time: multiplying them gives energy per 
unit time. In a DC circuit, all three quantities were constant, but 
in an oscillating (AC) circuit, all three display time variation. 


A resistor 


First let’s examine the case of a resistor. For instance, you’re 
probably reading this book from a piece of paper illuminated by 
a glowing lightbulb, which is driven by an oscillating voltage with 
amplitude V,. In the special case of a resistor, we know that J and 
V are in phase. For example, if V varies as V, coswt, then I will be 
a cosine as well, [,coswt. The power is then [,V, cos? wt, which is 
always positive,° and varies between 0 and J,Vo. Even if the time 
variation was cos wt or sin(wt+7/4), we would still have a maximum 
power of I,V,, because both the voltage and the current would reach 


°A resistor always turns electrical energy into heat. It never turns heat into 
electrical energy! 
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their maxima at the same time. In a lightbulb, the moment of 
maximum power is when the circuit is most rapidly heating the 
filament. At the instant when P = 0, a quarter of a cycle later, no 
current is flowing, and no electrical energy is being turned into heat. 
Throughout the whole cycle, the filament is getting rid of energy by 
radiating light.” Since the circuit oscillates at a frequency® of 60 Hz, 
the temperature doesn’t really have time to cycle up or down very 
much over the 1/60 s period of the oscillation, and we don’t notice 
any significant variation in the brightness of the light, even with a 
short-exposure photograph. 


Thus, what we really want to know is the average power, “aver- 
age” meaning the average over one full cycle. Since we’re covering 
a whole cycle with our average, it doesn’t matter what phase we 
assume. Let’s use a cosine. The total amount of energy transferred 
over one cycle is 


T 
-| TVo cos? wtdt 
0 
Tv 
=1,Vo ‘i cos? wtdt 
0 


fa 
= 10 | 
0 


(1 + cos 2wt) dt 


Nile 


"To many people, the word “radiation” implies nuclear contamination. Ac- 
tually, the word simply means something that “radiates” outward. Natural 
sunlight is “radiation.” So is the light from a lightbulb, or the infrared light 
being emitted by your skin right now. 

®Note that this time “frequency” means f, not w! Physicists and engineers 
generally use w because it simplifies the equations, but electricians and techni- 
cians always use f. The 60 Hz frequency is for the U.S. 
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ad/Power in a resistor: the 
rate at which electrical energy is 
being converted into heat. 
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The reason for using the trig identity cos*x = (1 + cos2x)/2 in 


the last step is that it lets us get the answer without doing a hard 
integral. Over the course of one full cycle, the quantity cos 2wt goes 
positive, negative, positive, and negative again, so the integral of it 
is zero. We then have 


eat 
E=1¥, f <dt 
ae) 


EVeE 
“9 


The average power is 


energy transferred in one full cycle 


P,,, = 
- time for one full cycle 


devel? 
mos ae 
_ InVo 
= 


i.e., the average is half the maximum. The power varies from 0 
to IjVo, and it spends equal amounts of time above and below the 
maximum, so it isn’t surprising that the average power is half-way 
in between zero and the maximum. Summarizing, we have 


IV 
Paw = 


[average power in a resistor] 


for a resistor. 


Rms quantities 


Suppose one day the electric company decided to start supplying 
your electricity as DC rather than AC. How would the DC voltage 
have to be related to the amplitude Vg of the AC voltage previously 
used if they wanted your lightbulbs to have the same brightness as 
before? The resistance of the bulb, R, is a fixed value, so we need 
to relate the power to the voltage and the resistance, eliminating 
the current. In the DC case, this gives P = IV = (V/R)V = V?/R. 
(For DC, P and Pyy are the same.) In the AC case, Pay = IpVo/2 = 
V2/2R. Since there is no factor of 1/2 in the DC case, the same 
power could be provided with a DC voltage that was smaller by a 
factor of 1/\/2. Although you will hear people say that household 
voltage in the U.S. is 110 V, its amplitude is actually (110 V) x 
V2 = 160 V. The reason for referring to Vo/V2 as “the” voltage is 
that people who are naive about AC circuits can plug Vo/V/2 into 
a familiar DC equation like P = V?/R and get the right average 
answer. The quantity Vo//2 is called the “RMS” voltage, which 
stands for “root mean square.” The idea is that if you square the 
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function V(t), take its average (mean) over one cycle, and then take 
the square root of that average, you get Vo//2. Many digital meters 
provide RMS readouts for measuring AC voltages and currents. 


A capacitor 


For a capacitor, the calculation starts out the same, but ends up 
with a twist. If the voltage varies as a cosine, V,coswt, then the 
relation I = CdV/dt tells us that the current will be some constant 
multiplied by minus the sine, —V, sinwt. The integral we did in the 
case of a resistor now becomes 


T 
=| —IjVo sin wt cos wtdt ; 
0 


and based on figure ae, you can easily convince yourself that over 
the course of one full cycle, the power spends two quarter-cycles 
being negative and two being positive. In other words, the average 
power is zero! 


Why is this? It makes sense if you think in terms of energy. 
A resistor converts electrical energy to heat, never the other way 
around. A capacitor, however, merely stores electrical energy in an 
electric field and then gives it back. For a capacitor, 


Pay = 0 [average power in a capacitor] 


Notice that although the average power is zero, the power at any 
given instant is not typically zero, as shown in figure ae. The capac- 
itor does transfer energy: it’s just that after borrowing some energy, 
it always pays it back in the next quarter-cycle. 


An inductor 


The analysis for an inductor is similar to that for a capacitor: the 
power averaged over one cycle is zero. Again, we’re merely storing 
energy temporarily in a field (this time a magnetic field) and getting 
it back later. 


10.5.9 Impedance matching 


Figure af shows a commonly encountered situation: we wish to 
maximize the average power, P,,, delivered to the load for a fixed 
value of Vo, the amplitude of the oscillating driving voltage. We 
assume that the impedance of the transmission line, Zr is a fixed 
value, over which we have no control, but we are able to design the 
load, Z, with any impedance we like. For now, we’ll also assume 
that both impedances are resistive. For example, Zp could be the 
resistance of a long extension cord, and Z, could be a lamp at the 
end of it. The result generalizes immediately, however, to any kind 
of impedance impedance. For example, the load could be a stereo 
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ae/Power in a_ capacitor: 
the rate at which energy is being 
stored in (+) or removed from (-) 
the electric field. 


af/We wish to maximize the 
power delivered to the load, 2, 
by adjusting its impedance. 
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speaker’s magnet coil, which is displays both inductance and resis- 
tance. (For a purely inductive or capacitive load, P,, equals zero, 
so the problem isn’t very interesting!) 


Since we’re assuming both the load and the transmission line are 
resistive, their impedances add in series, and the amplitude of the 
current is given by 

Vo 


(ae 
Zo + 27 


so 
Pay = ToVar2 

= 17Z,/2 
Vez. 


~ Gea 


The maximum of this expression occurs where the derivative is zero, 


0-55 | Vez | 
~ -2dZ |(Z, + Zr) 


i a Zo 
“raz lara 
0=(Z.+ Zr)? — 22, (Zo + Zr) * 
0=(Z.+ Zr) —2Z, 
Lo = 27 


In other words, to maximize the power delivered to the load, we 
should make the load’s impedance the same as the transmission 
line’s. This result may seem surprising at first, but it makes sense 
if you think about it. If the load’s impedance is too high, it’s like 
opening a switch and breaking the circuit; no power is delivered. 
On the other hand, it doesn’t pay to make the load’s impedance too 
small. Making it smaller does give more current, but no matter how 
small we make it, the current will still be limited by the transmission 
line’s impedance. As the load’s impedance approaches zero, the 
current approaches this fixed value, and the the power delivered, 
I?Z,, decreases in proportion to Zo. 


Maximizing the power transmission by matching Zr to Zp is 
called impedance matching. For example, an 8-ohm home stereo 
speaker will be correctly matched to a home stereo amplifier with 
an internal impedance of 8 ohms, and 4-ohm car speakers will be 
correctly matched to a car stereo with a 4-ohm internal impedance. 
You might think impedance matching would be unimportant be- 
cause even if, for example, we used a car stereo to drive 8-ohm 
speakers, we could compensate for the mismatch simply by turn- 
ing the volume knob higher. This is indeed one way to compensate 
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for any impedance mismatch, but there is always a price to pay. 


When the impedances are matched, half the power is dissipated in 
the transmission line and half in the load. By connecting a 4-ohm 
amplifier to an 8-ohm speaker, however, you would be setting up a 
situation in two watts were being dissipated as heat inside the amp 
for every amp being delivered to the speaker. In other words, you 
would be wasting energy, and perhaps burning out your amp when 
you turned up the volume to compensate for the mismatch. 


10.5.10 Impedances in series and parallel 


How do impedances combine in series and parallel? The beauty 
of treating them as complex numbers is that they simply combine 
according to the same rules you’ve already learned as resistances. 


Series impedance example 32 
> A capacitor and an inductor in series with each other are driven 
by a sinusoidally oscillating voltage. At what frequency is the cur- 
rent maximized? 


> Impedances in series, like resistances in series, add. The ca- 
pacitor and inductor act as if they were a single circuit element 
with an impedance 


The current is then 


i V 
~ iwL—i/wC 

We don’t care about the phase of the current, only its amplitude, 
which is represented by the absolute value of the complex num- 
ber /, and this can be maximized by making |iwL —i/wC| as small 
as possible. But there is some frequency at which this quantity is 
zero — 


At this frequency, the current is infinite! What is going on phys- 
ically? This is an LRC circuit with R = 0. It has a resonance at 
this frequency, and because there is no damping, the response 
at resonance is infinite. Of course, any real LRC circuit will have 
some damping, however small (cf. figure j on page 180). 
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Resonance with damping example 33 
> What is the amplitude of the current in a series LRC circuit? 


> Generalizing from example 32, we add a third, real impedance: 


kd 
fice 
|/| Z| 
_ W 
|R+iwl —i/wC| 


: iv 
JR? + (wl — 1/wC)2 


This result would have taken pages of algebra without the com- 
plex number technique! 


A second-order stereo crossover filter example 34 
A stereo crossover filter ensures that the high frequencies go to 
the tweeter and the lows to the woofer. This can be accomplished 
simply by putting a single capacitor in series with the tweeter and 
a single inductor in series with the woofer. However, such a filter 
does not cut off very sharply. Suppose we model the speakers 
as resistors. (They really have inductance as well, since they 
have coils in them that serve as electromagnets to move the di- 
aphragm that makes the sound.) Then the power they draw is 
PR. Putting an inductor in series with the woofer, ag/1, gives 
a total impedance that at high frequencies is dominated by the 
inductor’s, so the current is proportional to w~', and the power 
drawn by the woofer is proportional to w~?. 


A second-order filter, like ag/2, is one that cuts off more sharply: 
at high frequencies, the power goes like w~*. To analyze this 
circuit, we first calculate the total impedance: 


ZeZi4(Z5'+Z,') | 


All the current passes through the inductor, so if the driving volt- 
age being supplied on the left is Vj, we have 


Va=hZ  , 
and we also have ce tt 
Vi =12Z, 
The loop rule, applied to the outer perimeter of the circuit, gives 
Vy = V+ Vp 


Straightforward algebra now results in 
i, Vas 
WS Te Zl Ze + Zi) 2a 


At high frequencies, the Z,/Zc term, which varies as w, dom- 
inates, so Vg and /p are proportional to w~*, and the power is 
proportional to w~4. 


10.6 Fields by Gauss’ Law 


10.6.1 Gauss’ law 


The flea of subsection 10.3.2 had a long and illustrious scientific 
career, and we’re now going to pick up her story where we left. off. 
This flea, whose name is Gauss’, has derived the equation E; = 
2nko for the electric field very close to a charged surface with charge 
density 0. Next we will describe two improvements she is going to 
make to that equation. 


First, she realizes that the equation is not as useful as it could be, 
because it only gives the part of the field due to the surface. If other 
charges are nearby, then their fields will add to this field as vectors, 
and the equation will not be true unless we carefully subtract out 
the field from the other charges. This is especially problematic for 
her because the planet on which she lives, known for obscure reasons 
as planet Flatcat, is itself electrically charged, and so are all the fleas 
— the only thing that keeps them from floating off into outer space 
is that they are negatively charged, while Flatcat carries a positive 
charge, so they are electrically attracted to it. When Gauss found 
the original version of her equation, she wanted to demonstrate it to 
her skeptical colleagues in the laboratory, using electric field meters 
and charged pieces of metal foil. Even if she set up the measurements 
by remote control, so that her the charge on her own body would 
be too far away to have any effect, they would be disrupted by the 
ambient field of planet Flatcat. Finally, however, she realized that 
she could improve her equation by rewriting it as follows: 


Poured, on side 1 + Einar, on side 2 — Atkao 


The tricky thing here is that “outward” means a different thing, 
depending on which side of the foil we’re on. On the left side, 
“outward” means to the left, while on the right side, “outward” is 
right. A positively charged piece of metal foil has a field that points 
leftward on the left side, and rightward on its right side, so the two 
contributions of 27ko are both positive, and we get 4a7ko. On the 
other hand, suppose there is a field created by other charges, not 
by the charged foil, that happens to point to the right. On the 
right side, this externally created field is in the same direction as 
the foil’s field, but on the left side, the it reduces the strength of the 
leftward field created by the foil. The increase in one term of the 
equation balances the decrease in the other term. This new version 
of the equation is thus exactly correct regardless of what externally 
generated fields are present! 


Her next innovation starts by multiplying the equation on both 
sides by the area, A, of one side of the foil: 


( Houewara: on side 1 + outward, on side 2) A=4rkoA 


°no relation to the human mathematician of the same name 
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a/The area vector is  de- 
fined to be perpendicular to the 
surface, in the outward direction. 
Its magnitude tells how much the 
area is. 


b / Gauss contemplates a 
map of the known world. 


or 


outward, on side 1A 7 outward, on side 2A = Arkq ’ 


where q is the charge of the foil. The reason for this modification is 
that she can now make the whole thing more attractive by defining 
a new vector, the area vector A. As shown in figure a, she defines 
an area vector for side 1 which has magnitude A and points outward 
from side 1, and an area vector for side 2 which has the same mag- 
nitude and points outward from that side, which is in the opposite 
direction. The dot product of two vectors, u-v, can be interpreted 
aS Uparallel to v|/V|, and she can therefore rewrite her equation as 


E, - A, + Eg- Ao = 4rkq 


The quantity on the left side of this equation is called the flux 
through the surface, written ®. 


Gauss now writes a grant proposal to her favorite funding agency, 
the BSGS (Blood-Suckers’ Geological Survey), and it is quickly ap- 
proved. Her audacious plan is to send out exploring teams to chart 
the electric fields of the whole planet of Flatcat, and thereby de- 
termine the total electric charge of the planet. The fleas’ world 
is commonly assumed to be a flat disk, and its size is known to 
be finite, since the sun passes behind it at sunset and comes back 
around on the other side at dawn. The most daring part of the plan 
is that it requires surveying not just the known side of the planet 
but the uncharted Far Side as well. No flea has ever actually gone 
around the edge and returned to tell the tale, but Gauss assures 
them that they won’t fall off — their negatively charged bodies will 
be attracted to the disk no matter which side they are on. 


Of course it is possible that the electric charge of planet Flatcat 
is not perfectly uniform, but that isn’t a problem. As discussed in 
subsection 10.3.2, as long as one is very close to the surface, the field 
only depends on the local charge density. In fact, a side-benefit of 
Gauss’s program of exploration is that any such local irregularities 
will be mapped out. But what the newspapers find exciting is the 
idea that once all the teams get back from their voyages and tabulate 
their data, the total charge of the planet will have been determined 
for the first time. Each surveying team is assigned to visit a certain 
list of republics, duchies, city-states, and so on. They are to record 
each territory’s electric field vector, as well as its area. Because the 
electric field may be nonuniform, the final equation for determining 
the planet’s electric charge will have many terms, not just one for 
each side of the planet: 


O= S> Ej - Aj = 4tkdtotal 
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Gauss herself leads one of the expeditions, which heads due east, 
toward the distant Tail Kingdom, known only from fables and the 
occasional account from a caravan of traders. A strange thing hap- 
pens, however. Gauss embarks from her college town in the wetlands 
of the Tongue Republic, travels straight east, passes right through 
the Tail Kingdom, and one day finds herself right back at home, all 
without ever seeing the edge of the world! What can have happened? 
All at once she realizes that the world isn’t flat. 


Now what? The surveying teams all return, the data are tabu- 
lated, and the result for the total charge of Flatcat is (1/47k) >> E;- 
A; = 87 nC (units of nanocoulombs). But the equation was derived 
under the assumption that Flatcat was a disk. If Flatcat is really 
round, then the result may be completely wrong. Gauss and two 
of her grad students go to their favorite bar, and decide to keep 
on ordering Bloody Marys until they either solve their problems or 
forget them. One student suggests that perhaps Flatcat really is a 
disk, but the edges are rounded. Maybe the surveying teams really 
did flip over the edge at some point, but just didn’t realize it. Under 
this assumption, the original equation will be approximately valid, 
and 37 nC really is the total charge of Flatcat. 


A second student, named Newton, suggests that they take seri- 
ously the possibility that Flatcat is a sphere. In this scenario, their 
planet’s surface is really curved, but the surveying teams just didn’t 
notice the curvature, since they were close to the surface, and the 
surface was so big compared to them. They divided up the surface 
into a patchwork, and each patch was fairly small compared to the 
whole planet, so each patch was nearly flat. Since the patch is nearly 
flat, it makes sense to define an area vector that is perpendicular to 
it. In general, this is how we define the direction of an area vector, 
as shown in figure d. This only works if the areas are small. For in- 
stance, there would be no way to define an area vector for an entire 
sphere, since “outward” is in more than one direction. 


If Flatcat is a sphere, then the inside of the sphere must be 
vast, and there is no way of knowing exactly how the charge is 
arranged below the surface. However, the survey teams all found 
that the electric field was approximately perpendicular to the surface 
everywhere, and that its strength didn’t change very much from one 
location to another. The simplest explanation is that the charge 
is all concentrated in one small lump at the center of the sphere. 
They have no way of knowing if this is really the case, but it’s a 
hypothesis that allows them to see how much their 37 nC result 
would change if they assumed a different geometry. Making this 
assumption, Newton performs the following simple computation on 
a napkin. The field at the surface is related to the charge at the 
center by 


c/Each part of the surface 
has its own area vector. 
the differences in lengths of the 
vectors, corresponding to the 


unequal areas. 


d/An_ area_ vector 


can 


Note 


be 


defined for a sufficiently small 


part of a curved surface. 
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Kdtotal 
B= ee 
iB] = “Hs 


where r is the radius of Flatcat. The flux is then 


b=S°B-A; 


and since the E; and A; vectors are parallel, the dot product equals 
|E;|| Aj], so 


Kdtotal 
0 = tata, 


But the field strength is always the same, so we can take it outside 
the sum, giving 


Kdtotal 
oe ar do IAs 


_ kdtotal A 
= 5) total 
r 


2 


kdtotal 
= “total arp 
r 


= 4rk Gtotal 


Not only have all the factors of r canceled out, but the result is 
the same as for a disk! 


Everyone is pleasantly surprised by this apparent mathematical 
coincidence, but is it anything more than that? For instance, what 
if the charge wasn’t concentrated at the center, but instead was 
evenly distributed throughout Flatcat’s interior volume? Newton, 
however, is familiar with a result called the shell theorem (page 102), 
which states that the field of a uniformly charged sphere is the same 
as if all the charge had been concentrated at its center.!° We now 
have three different assumptions about the shape of Flatcat and the 
arrangement of the charges inside it, and all three lead to exactly the 
same mathematical result, ® = 47kqotqi. This is starting to look 
like more than a coincidence. In fact, there is a general mathematical 
theorem, called Gauss’ theorem, which states the following: 


For any region of space, the flux through the surface equals 
Atkqin, where gin is the total charge in that region. 


Don’t memorize the factor of 47 in front — you can rederive it 
any time you need to, by considering a spherical surface centered on 
a point charge. 


'0Newton’s human namesake actually proved this for gravity, not electricity, 
but they’re both 1/r? forces, so the proof works equally well in both cases. 


Chapter 10 Fields 


Note that although region and its surface had a definite physical 
existence in our story — they are the planet Flatcat and the surface 
of planet Flatcat — Gauss’ law is true for any region and surface we 
choose, and in general, the Gaussian surface has no direct physical 
significance. It’s simply a computational tool. 


Rather than proving Gauss’ theorem and then presenting some 
examples and applications, it turns out to be easier to show some ex- 
amples that demonstrate its salient properties. Having understood 
these properties, the proof becomes quite simple. 


self-check K 

Suppose we have a negative point charge, whose field points inward, 
and we pick a Gaussian surface which is a sphere centered on that 
charge. How does Gauss’ theorem apply here? > Answer, p. 871 


10.6.2 Additivity of flux 


Figure e shows two two different ways in which flux is additive. 
Figure e/1, additivity by charge, shows that we can break down a 
charge distribution into two or more parts, and the flux equals the 
sum of the fluxes due to the individual charges. This follows directly 
from the fact that the flux is defined in terms of a dot product, E-A, 
and the dot product has the additive property (a+b)-c = a-c+b-c. 


To understand additivity of flux by region, e/2, we have to con- 
sider the parts of the two surfaces that were eliminated when they 
were joined together, like knocking out a wall to make two small 
apartments into one big one. Although the two regions shared this 
wall before it was removed, the area vectors were opposite: the di- 
rection that is outward from one region is inward with respect to 
the other. Thus if the field on the wall contributes positive flux to 
one region, it contributes an equal amount of negative flux to the 
other region, and we can therefore eliminate the wall to join the two 
regions, without changing the total flux. 


10.6.3 Zero flux from outside charges 


A third important property of Gauss’ theorem is that it only 
refers to the charge inside the region we choose to discuss. In other 
words, it asserts that any charge outside the region contributes zero 
to the flux. This makes at least some sense, because a charge outside 
the region will have field vectors pointing into the surface on one 
side, and out of the surface on the other. Certainly there should 
be at least partial cancellation between the negative (inward) flux 
on one side and the positive (outward) flux on the other. But why 
should this cancellation be exact? 


To see the reason for this perfect cancellation, we can imagine 
space as being built out of tiny cubes, and we can think of any charge 
distribution as being composed of point charges. The additivity-by- 
charge property tells us that any charge distribution can be handled 


- . 


eT 


Soe 


e/1. The flux due to two 
charges equals the sum of the 
fluxes from each one. 2. When 
two regions are joined together, 
the flux through the new region 
equals the sum of the fluxes 
through the two parts. 
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f/ The flux through a tiny cube 
due to a point charge. 


by considering its point charges individually, and the additivity-by- 
region property tells us that if we have a single point charge outside 
a big region, we can break the region down into tiny cubes. If we 
can prove that the flux through such a tiny cube really does cancel 
exactly, then the same must be true for any region, which we could 
build out of such cubes, and any charge distribution, which we can 
build out of point charges. 


For simplicity, we will carry out this calculation only in the spe- 
cial case shown in figure f, where the charge lies along one axis of 
the cube. Let the sides of the cube have length 2b, so that the area 
of each side is (2b)? = 4b?. The cube extends a distance b above, 
below, in front of, and behind the horizontal x axis. There is a dis- 
tance d — b from the charge to the left side, and d+ 6 to the right 
side. 

There will be one negative flux, through the left side, and five 
positive ones. Of these positive ones, the one through the right side 
is very nearly the same in magnitude as the negative flux through 
the left side, but just a little less because the field is weaker on 
the right, due to the greater distance from the charge. The fluxes 
through the other four sides are very small, since the field is nearly 
perpendicular to their area vectors, and the dot product E; - A; is 
zero if the two vectors are perpendicular. In the limit where 0 is 
very small, we can approximate the flux by evaluating the field at 
the center of each of the cube’s six sides, giving 


® = Defi + 4® side + Bright 
= |E je ft|| Ave ft| cos 180° + 4|Eside||Aside| COS Oside 
+ |Eright| |Aright| COs 0° r) 


and a little trig gives cos Aside © b/d, so 
b 
® = —|Eje || Ate fe| + 4|Eside||Aside| 5 + |E,ignt||Arigne| 
b 
= (4b”) (-iBierl + 4|Esside| 5 + [Erol 
kq kq b kq 
= (40° p44 
@) (-gie Eat wa) 
_ (Akgb? ee eee 
i d? (1—b/d)? d= (1+4+6b/d)? 


Using the approximation (1+.¢)~? ~ 1—2e for small €, this becomes 


Akqb? 2b. Ab 2b 
= it +1 
( a2 yee dd 7) 


=0 


Thus in the limit of a very small cube, b < d, we have proved 
that the flux due to this exterior charge is zero. The proof can be 
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extended to the case where the charge is not along any axis of the 
cube,!! and based on additivity we then have a proof that the flux 
due to an outside charge is always zero. 


Discussion Questions 


c ———— 
<— 
increasing 
current 


A One question that might naturally occur to you about Gauss’s law 
is what happens for charge that is exactly on the surface — should it be 
counted toward the enclosed charge, or not? If charges can be perfect, 
infinitesimal points, then this could be a physically meaningful question. 
Suppose we approach this question by way of a limit: start with charge q 
spread out over a sphere of finite size, and then make the size of the 
sphere approach zero. The figure shows a uniformly charged sphere 
that’s exactly half-way in and half-way out of the cubical Gaussian sur- 
face. What is the flux through the cube, compared to what it would be if 
the charge was entirely enclosed? (There are at least three ways to find 
this flux: by direct integration, by Gauss’s law, or by the additivity of flux 
by region.) 


B___ The dipole is completely enclosed in the cube. What does Gauss’s 
law say about the flux through the cube? If you imagine the dipole’s field 
pattern, can you verify that this makes sense? 


C __ The wire passes in through one side of the cube and out through 
the other. If the current through the wire is increasing, then the wire will 
act like an inductor, and there will be a voltage difference between its 
ends. (The inductance will be relatively small, since the wire isn’t coiled 
up, and the AV will therefore also be fairly small, but still not zero.) The 
AV implies the existence of electric fields, and yet Gauss’s law says the 
flux must be zero, since there is no charge inside the cube. Why isn’t 
Gauss’s law violated? 


“The math gets messy for the off-axis case. This part of the proof can be 
completed more easily and transparently using the techniques of section 10.7, 
and that is exactly we’ll do in example 36 on page 584. 


g / Discussion question A-D. 
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h / Completing 
Gauss’ theorem. 
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D The charge has been loitering near the edge of the cube, but is 
then suddenly hit with a mallet, causing it to fly off toward the left side 
of the cube. We haven't yet discussed in detail how disturbances in the 
electric and magnetic fields ripple outward through space, but it turns out 
that they do so at the speed of light. (In fact, that’s what light is: ripples 
in the electric and magnetic fields.) Because the charge is closer to the 
left side of the cube, the change in the electric field occurs there before 
the information reaches the right side. This would seem certain to lead 
to a violation of Gauss’s law. How can the ideas explored in discussion 
question C show the resolution to this paradox? 


10.6.4 Proof of Gauss’ theorem 


With the computational machinery we’ve developed, it is now 
simple to prove Gauss’ theorem. Based on additivity by charge, it 
suffices to prove the law for a point charge. We have already proved 
Gauss’ law for a point charge in the case where the point charge is 
outside the region. If we can prove it for the inside case, then we’re 
all done. 


If the charge is inside, we reason as follows. First, we forget 
about the actual Gaussian surface of interest, and instead construct 
a spherical one, centered on the charge. For the case of a sphere, 
we’ve already seen the proof written on a napkin by the flea named 
Newton (page 573). Now wherever the actual surface sticks out 
beyond the sphere, we glue appropriately shaped pieces onto the 
sphere. In the example shown in figure h, we have to add two 
Mickey Mouse ears. Since these added pieces do not contain the 
point charge, the flux through them is zero, and additivity of flux 
by region therefore tells us that the total flux is not changed when 
we make this alteration. Likewise, we need to chisel out any regions 
where the sphere sticks out beyond the actual surface. Again, there 
is no change in flux, since the region being altered doesn’t contain 
the point charge. This proves that the flux through the Gaussian 
surface of interest is the same as the flux through the sphere, and 
since we’ve already proved that that flux equals 47kq;,, our proof 
of Gauss’ theorem is complete. 


Discussion Questions 

A A critical part of the proof of Gauss’ theorem was the proof that 
a tiny cube has zero flux through it due to an external charge. Discuss 
qualitatively why this proof would fail if Coulomb’s law was a 1/r or 1/r° 
law. 


10.6.5 Gauss’ law as a fundamental law of physics 


Note that the proof of Gauss’ theorem depended on the compu- 
tation on the napkin discussed on page 10.6.1. The crucial point in 
this computation was that the electric field of a point charge falls 
off like 1/r?, and since the area of a sphere is proportional to r?, 
the result is independent of r. The 1/r? variation of the field also 
came into play on page 576 in the proof that the flux due to an out- 
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side charge is zero. In other words, if we discover some other force 
of nature which is proportional to 1/r? or r, then Gauss’ theorem 
will not apply to that force. Gauss’ theorem is not true for nuclear 
forces, which fall off exponentially with distance. However, this is 
the only assumption we had to make about the nature of the field. 
Since gravity, for instance, also has fields that fall off as 1/r?, Gauss’ 
theorem is equally valid for gravity — we just have to replace mass 
with charge, change the Coulomb constant & to the gravitational 
constant G, and insert a minus sign because the gravitational fields 
around a (positive) mass point inward. 


Gauss’ theorem can only be proved if we assume a 1/r? field, 
and the converse is also true: any field that satisfies Gauss’ theo- 
rem must be a 1/r? field. Thus although we previously thought of 
Coulomb’s law as the fundamental law of nature describing electric 
forces, it is equally valid to think of Gauss’ theorem as the basic law 
of nature for electricity. From this point of view, Gauss’ theorem is 
not a mathematical fact but an experimentally testable statement 
about nature, so we'll refer to it as Gauss’ law, just as we speak of 
Coulomb’s law or Newton’s law of gravity. 


If Gauss’ law is equivalent to Coulomb’s law, why not just use 
Coulomb’s law? First, there are some cases where calculating a 
field is easy with Gauss’ law, and hard with Coulomb’s law. More 
importantly, Gauss’ law and Coulomb’s law are only mathematically 
equivalent under the assumption that all our charges are standing 
still, and all our fields are constant over time, i.e., in the study 
of electrostatics, as opposed to electrodynamics. As we broaden 
our scope to study generators, inductors, transformers, and radio 
antennas, we will encounter cases where Gauss’ law is valid, but 
Coulomb’s law is not. 


10.6.6 Applications 


Often we encounter situations where we have a static charge 
distribution, and we wish to determine the field. Although super- 
position is a generic strategy for solving this type of problem, if the 
charge distribution is symmetric in some way, then Gauss’ law is 
often a far easier way to carry out the computation. 


Field of a long line of charge 


Consider the field of an infinitely long line of charge, holding a 
uniform charge per unit length A. Computing this field by brute- 
force superposition was fairly laborious (examples 10 on page 528 
and 13 on page 534). With Gauss’ law it becomes a very simple 
calculation. 


The problem has two types of symmetry. The line of charge, 
and therefore the resulting field pattern, look the same if we rotate 
them about the line. The second symmetry occurs because the line 
is infinite: if we slide the line along its own length, nothing changes. 


i/ Applying Gauss’ law to an 
infinite line of charge. 
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j/Applying Gauss’ law to an 
infinite charged surface. 


This sliding symmetry, known as a translation symmetry, tells us 
that the field must point directly away from the line at any given 
point. 


Based on these symmetries, we choose the Gaussian surface 
shown in figure i. If we want to know the field at a distance R 
from the line, then we choose this surface to have a radius R, as 
shown in the figure. The length, DL, of the surface is irrelevant. 


The field is parallel to the surface on the end caps, and therefore 
perpendicular to the end caps’ area vectors, so there is no contribu- 
tion to the flux. On the long, thin strips that make up the rest of the 
surface, the field is perpendicular to the surface, and therefore paral- 
lel to the area vector of each strip, so that the dot product occurring 
in the definition of the flux is E; - A; = |E;||Aj;||cos 0° = |E;||Aj]. 
Gauss’ law gives 


Atkin = SE, : Aj 
4nkAL = 5 > |B;||Ay| 


The magnitude of the field is the same on every strip, so we can 
take it outside the sum. 


4nkAL = \B| > |AG| 


In the limit where the strips are infinitely narrow, the surface be- 
comes a cylinder, with (area)=(circumference) (length)=27RL. 


4rkAL = |E| x 2nRL 
QkA 
E| =— 
IE|=— 


Field near a surface charge 


As claimed earlier, the result E = 27ko for the field near a 
charged surface is a special case of Gauss’ law. We choose a Gaussian 
surface of the shape shown in figure j, known as a Gaussian pillbox. 
The exact shape of the flat end caps is unimportant. 


The symmetry of the charge distribution tells us that the field 
points directly away from the surface, and is equally strong on both 
sides of the surface. This means that the end caps contribute equally 
to the flux, and the curved sides have zero flux through them. If the 
area of each end cap is A, then 


Atkqin = E, - Ai, + Eg: Ap ; 


where the subscripts 1 and 2 refer to the two end caps. We have 
A» = —Aj, so 


Atkin = Ej, ‘ Ay = Eo : Aj 
Atkin = (Ey = Ez) y Aj , 
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and by symmetry the magnitudes of the two fields are equal, so 


2|E|A = 4rkoA 
|E| = 2rko 


The symmetry between the two sides could be broken by the 
existence of other charges nearby, whose fields would add onto the 
field of the surface itself. Even then, Gauss’s law still guarantees 


Atkqin = (Ey — Eg) - Ay ; 
or 
|E. 1 — E,2| =4rko ; 


where the subscript | indicates the component of the field parallel 
to the surface (i.e., parallel to the area vectors). In other words, 
the electric field changes discontinuously when we pass through a 
charged surface; the discontinuity occurs in the component of the 
field perpendicular to the surface, and the amount of discontinuous 
change is 4rko. This is a completely general statement that is true 
near any charged surface, regardless of the existence of other charges 
nearby. 
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surface. 


10.7 Gauss’ Law In Differential Form 


Gauss’ law is a bit spooky. It relates the field on the Gaussian 
surface to the charges inside the surface. What if the charges have 
been moving around, and the field at the surface right now is the one 
that was created by the charges in their previous locations? Gauss’ 
law — unlike Coulomb’s law — still works in cases like these, but 
it’s far from obvious how the flux and the charges can still stay in 
agreement if the charges have been moving around. 


For this reason, it would be more physically attractive to restate 
Gauss’ law in a different form, so that it related the behavior of 
the field at one point to the charges that were actually present at 
that point. This is essentially what we were doing in the fable of 
the flea named Gauss: the fleas’ plan for surveying their planet was 
essentially one of dividing up the surface of their planet (which they 
believed was flat) into a patchwork, and then constructing small 
a Gaussian pillbox around each small patch. The equation FE, = 
2rko then related a particular property of the local electric field to 
the local charge density. 


In general, charge distributions need not be confined to a flat 
surface — life is three-dimensional — but the general approach of 
defining very small Gaussian surfaces is still a good one. Our strat- 
egy is to divide up space into tiny cubes, like the one on page 575. 
Each such cube constitutes a Gaussian surface, which may contain 
some charge. Again we approximate the field using its six values at 
the center of each of the six sides. Let the cube extend from x to 
x+dz, from y to y+dy, and from y to y + dy. 


The sides at x and «+ da have area vectors —dydzx and dydzx, 
respectively. The flux through the side at x is —E,(x)dydz, and 
the flux through the opposite side, at x + dx is F(x + dx)dydz. 
The sum of these is (E,(a + dx) — E,(x))dydz, and if the field 
was uniform, the flux through these two opposite sides would be 
zero. It will only be zero if the field’s x component changes as a 
function of «. The difference E,.(a + dx) — E(x) can be rewritten 
as dE, = (dE,)/(dx)dx, so the contribution to the flux from these 
two sides of the cube ends up being 


Ey 
g dadydz 
dz 


Doing the same for the other sides, we end up with a total flux 


dE, dk, dk 

a= x y z 

‘ (= dy 7 dz 
dE, . dé, . dE, 

-(= - dy 2 i) a 


) dadydz 


’ 
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where dv is the volume of the cube. In evaluating each of these 
three derivatives, we are going to treat the other two variables as 
constants, to emphasize this we use the partial derivative notation 
O introduced in chapter 3, 


_ (OE OE, , OE, 
ao = (F4 7) ae 


Using Gauss’ law, 


Atkdin = & ou; =) dv 


Ox Oy | Oz 


and we introduce the notation p (Greek letter rho) for the charge 
per unit volume, giving 


The quantity on the right is called the divergence of the electric 
field, written div E. Using this notation, we have 


div E = 4rkp 


This equation has all the same physical implications as Gauss’ law. 
After all, we proved Gauss’ law by breaking down space into little 
cubes like this. We therefore refer to it as the differential form of 
Gauss’ law, as opposed to ® = 4rkqin, which is called the integral 
form. 


Figure b shows an intuitive way of visualizing the meaning of 
the divergence. The meter consists of some electrically charged balls 
connected by springs. If the divergence is positive, then the whole 
cluster will expand, and it will contract its volume if it is placed at 
a point where the field has div E < 0. What if the field is constant? 
We know based on the definition of the divergence that we should 
have div E = 0 in this case, and the meter does give the right result: 
all the balls will feel a force in the same direction, but they will 
neither expand nor contract. 


Divergence of a sine wave example 35 
> Figure c shows an electric field that varies as a sine wave. This 
is in fact what you'd see in a light wave: light is a wave pattern 
made of electric and magnetic fields. (The magnetic field would 
look similar, but would be in a plane perpendicular to the page.) 
What is the divergence of such a field, and what is the physical 
significance of the result? 


> Intuitively, we can see that no matter where we put the div-meter 
in this field, it will neither expand nor contract. For instance, if we 
put it at the center of the figure, it will start spinning, but that’s it. 


b/A meter — for 
div E. 


c/ Example 35. 
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Mathematically, let the x axis be to the right and let y be up. The 
field is of the form 


E = (sinkx) y : 


where the constant K is not to be confused with Coulomb’s con- 
stant. Since the field has only a y component, the only term in 
the divergence we need to evaluate is 


OE, 
Biskeya 


but this vanishes, because Ey depends only on x, not y: we treat 
y as a constant when evaluating the partial derivative OE, /oy, 
and the derivative of an expression containing only constants 
must be zero. 


Physically this is a very important result: it tells us that a light 
wave can exist without any charges along the way to “keep it go- 
ing.” In other words, light can travel through a vacuum, a region 
with no particles in it. If this wasn’t true, we’d be dead, because 
the sun’s light wouldn’t be able to get to us through millions of 
kilometers of empty space! 


Electric field of a point charge example 36 
The case of a point charge is tricky, because the field behaves 
badly right on top of the charge, blowing up and becoming dis- 
continuous. At this point, we cannot use the component form of 
the divergence, since none of the derivatives are well defined. 
However, a little visualization using the original definition of the 
divergence will quickly convince us that div E is infinite here, and 
that makes sense, because the density of charge has to be in- 
finite at a point where there is a zero-size point of charge (finite 
charge in zero volume). 


At all other points, we have 


where f = r/r = (xX + y¥ + 2Z)/r is the unit vector pointing radially 
away from the charge. The field can therefore be written as 


kqQ(xX + yy + 22) 
7 Tes 


(x? + y? + 22 


The three terms in the divergence are all similar, e.g., 


OE, be O xX 
ax ~ “Tax (x2 + y2 + 22)°? 


ae 1 3 2x? 
(x2 +2422)? 2 (x2 4 y2 4 22)” 
=kq ae - 3x?/-5) 


Straightforward algebra shows that adding in the other two terms 
results in zero, which makes sense, because there is no charge 
except at the origin. 


Gauss’ law in differential form lends itself most easily to finding 
the charge density when we are give the field. What if we want 
to find the field given the charge density? As demonstrated in the 
following example, one technique that often works is to guess the 
general form of the field based on experience or physical intuition, 
and then try to use Gauss’ law to find what specific version of that 
general form will be a solution. 


The field inside a uniform sphere of charge example 37 
> Find the field inside a uniform sphere of charge whose charge 
density is p. (This is very much like finding the gravitational field 
at some depth below the surface of the earth.) 

> By symmetry we know that the field must be purely radial (in 
and out). We guess that the solution might be of the form 

E = brPf 

where r is the distance from the center, and b and p are con- 
stants. A negative value of p would indicate a field that was 
strongest at the center, while a positive p would give zero field 
at the center and stronger fields farther out. Physically, we know 


by symmetry that the field is zero at the center, so we expect p to 
be positive. 


As in the example 36, we rewrite fF as r/r, and to simplify the 
writing we define n = p— 1, so 


E=obr"r 
Gauss’ law in differential form is 
div E = 47kp : 


so we want a field whose divergence is constant. For a field of 
the form we guessed, the divergence has terms in it like 
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The partial derivative Or/Ox is easily calculated to be x/r, so 


OE, 
Ox 


=b (nr-2x2 rt .) 


Adding in similar expressions for the other two terms in the diver- 
gence, and making use of x? + y? + z* = r?, we have 


div E = b(n+3)r” 


This can indeed be constant, but only if nis O or —3, i.e., pis 1 
or —2. The second solution gives a divergence which is constant 
and zero: this is the solution for the outside of the sphere! The 
first solution, which has the field directly proportional to r, must 
be the one that applies to the inside of the sphere, which is what 
we care about right now. Equating the coefficient in front to the 
one in Gauss’ law, the field is 


_ ATtKP 


E 
3 


The field is zero at the center, and gets stronger and stronger as 
we approach the surface. 


Discussion Questions 


A As suggested by the figure, discuss the results you would get by 
inserting the div-meter at various locations in the sine-wave field. 


This chapter is summarized on page 902. Notation and terminology 
are tabulated on pages 885-886. 
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Problems 
The symbols Vv, “, etc. are explained on page 596. 


1 The gap between the electrodes in an automobile engine’s 
spark plug is 0.060 cm. To produce an electric spark in a gasoline- 
air mixture, an electric field of 3.0 x 10° V/m must be achieved. 
On starting a car, what minimum voltage must be supplied by the 
ignition circuit? Assume the field is uniform. 

(b) The small size of the gap between the electrodes is inconvenient 
because it can get blocked easily, and special tools are needed to 
measure it. Why don’t they design spark plugs with a wider gap? 


2 (a) As suggested in example 9 on page 527, use approximations 
to show that the expression given for the electric field approaches 
kQ/d? for large d. 

(b) Do the same for the result of example 12 on page 531. L 


3 Astronomers believe that the mass distribution (mass per 
unit volume) of some galaxies may be approximated, in spherical 
coordinates, by p = ae~°", for 0 < r < oo, where p is the density. 
Find the total mass. ~ 


4 (a) At time t = 0, a positively charged particle is placed, 
at rest, in a vacuum, in which there is a uniform electric field of 
magnitude E. Write an equation giving the particle’s speed, v, in 
terms of t, &, and its mass and charge m and q. 

(b) If this is done with two different objects and they are observed 
to have the same motion, what can you conclude about their masses 
and charges? (For instance, when radioactivity was discovered, it 
was found that one form of it had the same motion as an electron 
in this type of experiment.) 


5 Show that the alternative definition of the magnitude of the 
electric field, |E| = 7/D;sin 6, has units that make sense. 


6 Redo the calculation of example 5 on page 520 using a different 
origin for the coordinate system, and show that you get the same 
result. & 


7 The definition of the dipole moment, D = )> qr;, involves the 
vector r; stretching from the origin of our coordinate system out to 
the charge q;. There are clearly cases where this causes the dipole 
moment to be dependent on the choice of coordinate system. For 
instance, if there is only one charge, then we could make the dipole 
moment equal zero if we chose the origin to be right on top of the 
charge, or nonzero if we put the origin somewhere else. 

(a) Make up a numerical example with two charges of equal mag- 
nitude and opposite sign. Compute the dipole moment using two 
different coordinate systems that are oriented the same way, but 
differ in the choice of origin. Comment on the result. 
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(b) Generalize the result of part a to any pair of charges with equal 
magnitude and opposite sign. This is supposed to be a proof for any 
arrangement of the two charges, so don’t assume any numbers. 

(c) Generalize further, to n charges. & 


8 Compare the two dipole moments. z 


9 Find an arrangement of charges that has zero total charge and 
zero dipole moment, but that will make nonvanishing electric fields. 


~ 


10 As suggested in example 11 on page 529, show that you can 
get the same result for the on-axis field by differentiating the voltage 


11 Three charges are arranged on a square as shown. All three 
charges are positive. What value of q2/q1 will produce zero electric 
field at the center of the square? vo 


12 This is a one-dimensional problem, with everything confined 
to the x axis. Dipole A consists of a —1.000 C charge at x = 0.000 
m and a 1.000 C charge at x = 1.000 m. Dipole B has a —2.000 C 
charge at x = 0.000 m and a 2.000 C charge at x = 0.500 m. 

(a) Compare the two dipole moments. 

(b) Calculate the field created by dipole A at x = 10.000 m, and 
compare with the field dipole B would make. Comment on the 
result. Vv 


13 In our by-now-familiar neuron, the voltage difference be- 
tween the inner and outer surfaces of the cell membrane is about 
Vout — Vin = —70 mV in the resting state, and the thickness of the 
membrane is about 6.0 nm (i.e., only about a hundred atoms thick). 
What is the electric field inside the membrane? v OE 


14 A proton is in a region in which the electric field is given by 
E =a+bzx?. If the proton starts at rest at 2; = 0, find its speed, 
v, when it reaches position x2. Give your answer in terms of a, b, 
x2, and e and m, the charge and mass of the proton. vo 


15 (a) Given that the on-axis field of a dipole at large distances is 
proportional to D/r?, show that its voltage varies as D/r?. (Ignore 
positive and negative signs and numerical constants of proportion- 
ality.) 

(b) Write down an exact expression for the voltage of a two-charge 
dipole at an on-axis point, without assuming that the distance is 
large compared to the size of the dipole. Your expression will have 
to contain the actual charges and size of the dipole, not just its dipole 
moment. Now use approximations to show that, at large distances, 
this is consistent with your answer to part a. b> Hint, p. 865 & 


16 A hydrogen atom is electrically neutral, so at large distances, 
we expect that it will create essentially zero electric field. This is 
not true, however, near the atom or inside it. Very close to the 
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proton, for example, the field is very strong. To see this, think of 
the electron as a spherically symmetric cloud that surrounds the 
proton, getting thinner and thinner as we get farther away from the 
proton. (Quantum mechanics tells us that this is a more correct 
picture than trying to imagine the electron orbiting the proton.) 
Near the center of the atom, the electron cloud’s field cancels out 
by symmetry, but the proton’s field is strong, so the total field is 
very strong. The voltage in and around the hydrogen atom can be 
approximated using an expression of the form V = r-te~". (The 
units come out wrong, because I’ve left out some constants.) Find 
the electric field corresponding to this voltage, and comment on its 
behavior at very large and very small r. > Solution, p. 881 & 


17 A carbon dioxide molecule is structured like O-C-O, with 
all three atoms along a line. The oxygen atoms grab a little bit of 
extra negative charge, leaving the carbon positive. The molecule’s 
symmetry, however, means that it has no overall dipole moment, 
unlike a V-shaped water molecule, for instance. Whereas the voltage 
of a dipole of magnitude D is proportional to D/r? (see problem 15), 
it turns out that the voltage of a carbon dioxide molecule at a distant 
point along the molecule’s axis equals b/r?, where r is the distance 
from the molecule and 6 is a constant (cf. problem 9). What would 
be the electric field of a carbon dioxide molecule at a point on the 
molecule’s axis, at a distance r from the molecule? v 


18 A hydrogen atom in a particular state has the charge density 
(charge per unit volume) of the electron cloud given by p = ae" z?, 
where r is the distance from the proton, and z is the coordinate mea- 
sured along the z axis. Given that the total charge of the electron 
cloud must be —e, find a in terms of the other variables. & 


\<———_ .R ——>e 


ae 
19 A dipole has a midplane, i.e., the plane that cuts through the | | 
dipole’s center, and is perpendicular to the dipole’s axis. Consider +-q 
a two-charge dipole made of point charges +q located at z = +¢/2. 
Use approximations to find the field at a distant point in the mid- Problem 19. 
plane, and show that its magnitude comes out to be kD/R?® (half 


what it would be at a point on the axis lying an equal distance from 
the dipole). = 
20 ~The figure shows a vacuum chamber surrounded by four metal i + 
electrodes shaped like hyperbolas. (Yes, physicists do sometimes ask 
their university machine shops for things machined in mathematical 
shapes like this. They have to be made on computer-controlled 
mills.) We assume that the electrodes extend far into and out of 
the page along the unseen z axis, so that by symmetry, the electric 
fields are the same for all z. The problem is therefore effectively + — 
two-dimensional. T'wo of the electrodes are at voltage +V,, and 
the other two at —Vo, as shown. The equations of the hyperbolic 


surfaces are |xy| = 6”, where b is a constant. (We can interpret b as 
giving the locations « = +b, y = +b of the four points on the surfaces 


Problem 20. 
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that are closest to the central axis.) There is no obvious, pedestrian 
way to determine the field or voltage in the central vacuum region, 
but there’s a trick that works: with a little mathematical insight, 
we see that the voltage V = Vob~?xy is consistent with all the 
given information. (Mathematicians could prove that this solution 
was unique, but a physicist knows it on physical grounds: if there 
were two different solutions, there would be no physical way for 
the system to decide which one to do!) (a) Use the techniques of 
subsection 10.2.2 to find the field in the vacuum region, and (b) 
sketch the field as a “sea of arrows.” Vv 


21 (a) A certain region of three-dimensional space has a voltage 
that varies as V = br?, where r is the distance from the origin. Use 
the techniques of subsection 10.2.2 to find the field. v 
(b) Write down another voltage that gives exactly the same field. 


~— 


22 (a) Example 10 on page 528 gives the field of a charged rod in 
its midplane. Starting from this result, take the limit as the length 
of the rod approaches infinity. Note that is not changing, so as D 
gets bigger, the total charge Q increases. > Answer, p. 875 
(b) In the text, I have shown (by several different methods) that the 
field of an infinite, uniformly charged plane is 27ka. Now you’re 
going to rederive the same result by a different method. Suppose 
that it is the x — y plane that is charged, and we want to find the 
field at the point (0,0,z). (Since the plane is infinite, there is no 
loss of generality in assuming x = 0 and y = 0.) Imagine that we 
slice the plane into an infinite number of straight strips parallel to 
the y axis. Each strip has infinitesimal width dx, and extends from 
x to x+dz. The contribution any one of these strips to the field 
at our point has a magnitude which can be found from part a. By 
vector addition, prove the desired result for the field of the plane of 
charge. z 


23 Consider the electric field created by a uniformly charged 
cylindrical surface that extends to infinity in one direction. 

(a) Show that the field at the center of the cylinder’s mouth is 27ko, 
which happens to be the same as the field of an infinite flat sheet of 
charge! 

(b) This expression is independent of the radius of the cylinder. 
Explain why this should be so. For example, what would happen if 
you doubled the cylinder’s radius? z 


24 In an electrical storm, the cloud and the ground act like a 
parallel-plate capacitor, which typically charges up due to frictional 
electricity in collisions of ice particles in the cold upper atmosphere. 
Lightning occurs when the magnitude of the electric field builds up 
to a critical value, E., at which air is ionized. 

(a) Treat the cloud as a flat square with sides of length L. If it is at 
a height h above the ground, find the amount of energy released in 
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the lightning strike. Vv 
(b) Based on your answer from part a, which is more dangerous, a 
lightning strike from a high-altitude cloud or a low-altitude one? 
(c) Make an order-of-magnitude estimate of the energy released by 
a typical lightning bolt, assuming reasonable values for its size and 
altitude. E, is about 10° V/m. E 


25 (a) Show that the energy in the electric field of a point charge 
is infinite! Does the integral diverge at small distances, at large dis- 
tances, or both? > Hint, p. 865 
(b) Now calculate the energy in the electric field of a uniformly 
charged sphere with radius 6. Based on the shell theorem, it can 
be shown that the field for r > 6 is the same as for a point charge, 
while the field for r < b is kgqr/b®. (Example 37 shows this using a 
different technique.) 


Remark: The calculation in part a seems to show that infinite energy would 
be required in order to create a charged, pointlike particle. However, there are 
processes that, for example, create electron-positron pairs, and these processes 
don’t require infinite energy. According to Einstein’s famous equation E = mc’, 
the energy required to create such a pair should only be 2mc?, which is finite. 
One way out of this difficulty is to assume that no particle is really pointlike, and 
this is in fact the main motivation behind a speculative physical theory called 
string theory, which posits that charged particles are actually tiny loops, not 
points. _ 


26 The neuron in the figure has been drawn fairly short, but 
some neurons in your spinal cord have tails (axons) up to a meter 
long. The inner and outer surfaces of the membrane act as the 
“plates” of a capacitor. (The fact that it has been rolled up into a 
cylinder has very little effect.) In order to function, the neuron must 
create a voltage difference V between the inner and outer surfaces 
of the membrane. Let the membrane’s thickness, radius, and length 
be t, r, and L. (a) Calculate the energy that must be stored in 
the electric field for the neuron to do its job. (In real life, the 
membrane is made out of a substance called a dielectric, whose 
electrical properties increase the amount of energy that must be 
stored. For the sake of this analysis, ignore this fact.) 

> Hint, p. 865 Vv 
(b) An organism’s evolutionary fitness should be better if it needs 
less energy to operate its nervous system. Based on your answer to 
part a, what would you expect evolution to do to the dimensions t 
and r? What other constraints would keep these evolutionary trends 
from going too far? g 


27 The figure shows cross-sectional views of two cubical ca- 
pacitors, and a cross-sectional view of the same two capacitors put 
together so that their interiors coincide. A capacitor with the plates 
close together has a nearly uniform electric field between the plates, 
and almost zero field outside; these capacitors don’t have their plates 
very close together compared to the dimensions of the plates, but 
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for the purposes of this problem, assume that they still have ap- 
proximately the kind of idealized field pattern shown in the figure. 
Each capacitor has an interior volume of 1.00 m°, and is charged up 
to the point where its internal field is 1.00 V/m. 

(a) Calculate the energy stored in the electric field of each capacitor 
when they are separate. v 
(b) Calculate the magnitude of the interior field when the two ca- 
pacitors are put together in the manner shown. Ignore effects arising 
from the redistribution of each capacitor’s charge under the influ- 
ence of the other capacitor. Vv 
(c) Calculate the energy of the put-together configuration. Does as- 
sembling them like this release energy, consume energy, or neither? 


~ 


28 Find the capacitance of the surface of the earth, assuming 
there is an outer spherical “plate” at infinity. (In reality, this outer 
plate would just represent some distant part of the universe to which 


we carried away some of the earth’s charge in order to charge up the 
earth.) Vv 


29 (a) Show that the field found in example 10 on page 528 


b reduces to F = 2kX/R in the limit of L > co. 
(b) An infinite strip of width 6 has a surface charge density o. Find 
Problem 29. the field at a point at a distance z from the strip, lying in the plane 


perpendicularly bisecting the strip. Vv 
(c) Show that this expression has the correct behavior in the limit 
where z approaches zero, and also in the limit of z >> b. For the 
latter, you’ll need the result of problem 22a, which is given on page 
875. g 


30 A solid cylinder of radius b and length @ is uniformly charged 
with a total charge Q. Find the electric field at a point at the center 
of one of the flat ends. z 


31 Find the voltage at the edge of a uniformly charged disk. 
(Define V = 0 to be infinitely far from the disk.) 
Vv p Hint, p. 865 


32 Find the energy stored in a capacitor in terms of its capaci- 
tance and the voltage difference across it. Vv 
33 (a) Find the capacitance of two identical capacitors in series. 


(b) Based on this, how would you expect the capacitance of a 
parallel-plate capacitor to depend on the distance between the plates? 


~ 


34 (a) Use complex number techniques to rewrite the function 
f(t) =4sinwt + 3coswt in the form Asin(wt + 6). v 
(b) Verify the result using the trigonometric identity sin(a + 8) = 
sina cos 6 + sin 8 cosa. & 


35 (a) Show that the equation V; = LdI/dt has the right units. 
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(b) Verify that RC has units of time. 
(c) Verify that L/R has units of time. 


36 Find the inductance of two identical inductors in parallel. 
37 Calculate the quantity i’. E 
38 The wires themselves in a circuit can have resistance, induc- 


tance, and capacitance. Would “stray” inductance and capacitance 
be most important for low-frequency or for high-frequency circuits? 
For simplicity, assume that the wires act like they’re in series with 
an inductor or capacitor. 


39 Starting from the relation V = LdI/dt for the voltage dif- 
ference across an inductor, show that an inductor has an impedance 
equal to Lw. 


40 A rectangular box is uniformly charged with a charge density 
p. The box is extremely long and skinny, and its cross-section is a 
square with sides of length b. The length is so great in comparison 
to 6 that we can consider it as being infinite. Find the electric field 
at a point lying on the box’s surface, at the midpoint between the 
two edges. Your answer will involve an integral that is most easily 
done using computer software. z 


Al A hollow cylindrical pipe has length ¢ and radius 6. Its 
ends are open, but on the curved surface it has a charge density 
o. A charge gq with mass m is released at the center of the pipe, 
in unstable equilibrium. Because the equilibrium is unstable, the 
particle acclerates off in one direction or the other, along the axis 
of the pipe, and comes shooting out like a bullet from the barrel of 
a gun. Find the velocity of the particle when it’s infinitely far from 
the “gun.” Your answer will involve an integral that is difficult to 
do by hand; you may want to look it up in a table of integrals, do 
it online at integrals.com, or download and install the free Maxima 
symbolic math software from maxima.sourceforge.net. B 


42 If an FM radio tuner consisting of an LRC circuit contains 
a 1.0 wH inductor, what range of capacitances should the variable 
capacitor be able to provide? Vv 


43 (a) Find the parallel impedance of a 37 kQ resistor and a 1.0 
nF capacitor at f = 1.0 x 104 Hz. Vv 
(b) A voltage with an amplitude of 1.0 mV drives this impedance 
at this frequency. What is the amplitude of the current drawn from 
the voltage source, what is the current’s phase angle with respect to 
the voltage, and does it lead the voltage, or lag behind it? V & 


44 A series LRC circuit consists of a 1.000 2 resistor, a 1.000 F 
capacitor, and a 1.000 H inductor. (These are not particularly easy 
values to find on the shelf at Radio Shack!) 

(a) Plot its impedance as a point in the complex plane for each of 
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the following frequencies: w=0.250, 0.500, 1.000, 2.000, and 4.000 
Hz. 
(b) What is the resonant angular frequency, w;e;, and how does this 


relate to your plot? Vv 
(c) What is the resonant frequency f,e; corresponding to your an- 
swer in part b? Vv E 


45 Atafrequency w, acertain series LR circuit has an impedance 
of 1 Q+ (2 Q)i. Suppose that instead we want to achieve the same 
impedance using two circuit elements in parallel. What must the 
elements be? g 


46 (a) Use Gauss’ law to find the fields inside and outside an 
infinite cylindrical surface with radius 6 and uniform surface charge 
density o. Vv 
(b) Show that there is a discontinuity in the electric field equal to 
4rko between one side of the surface and the other, as there should 
be (see page 581). 

(c) Reexpress your result in terms of the charge per unit length, and 
compare with the field of a line of charge. 

(d) A coaxial cable has two conductors: a central conductor of radius 
a, and an outer conductor of radius b. These two conductors are 
separated by an insulator. Although such a cable is normally used 
for time-varying signals, assume throughout this problem that there 
is simply a DC voltage between the two conductors. The outer 
conductor is thin, as in part c. The inner conductor is solid, but, 
as is always the case with a conductor in electrostatics, the charge 
is concentrated on the surface. Thus, you can find all the fields in 
part b by superposing the fields due to each conductor, as found in 
part c. (Note that on a given length of the cable, the total charge of 
the inner and outer conductors is zero, so Ay = —A2, but o1 # 02, 
since the areas are unequal.) Find the capacitance per unit length 
of such a cable. vo 


AT In a certain region of space, the electric field is constant 
(i.e., the vector always has the same magnitude and direction). For 
simplicity, assume that the field points in the positive x direction. 
(a) Use Gauss’s law to prove that there is no charge in this region 
of space. This is most easily done by considering a Gaussian surface 
consisting of a rectangular box, whose edges are parallel to the z, 
y, and z axes. 

(b) If there are no charges in this region of space, what could be 
making this electric field? a 


48 (a) In a series LC circuit driven by a DC voltage (w = 0), 
compare the energy stored in the inductor to the energy stored in 
the capacitor. 

(b) Carry out the same comparison for an LC circuit that is oscil- 
lating freely (without any driving voltage). 

(c) Now consider the general case of a series LC circuit driven by 
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an oscillating voltage at an arbitrary frequency. Let Uz, and be the 
average energy stored in the inductor, and similarly for Uc. Define 
a quantity u = Uc/(U, + Uc), which can be interpreted as the ca- 
pacitor’s average share of the energy, while 1 — wu is the inductor’s 
average share. Find u in terms of L, C, and w, and sketch a graph 
of u and 1 — u versus w. What happens at resonance? Make sure 
your result is consistent with your answer to part a. v 


~ 


49 Use Gauss’ law to find the field inside an infinite cylinder 
with radius b and uniform charge density p. (The external field has 
the same form as the one in problem 46.) Vv o 


50 (a) In a certain region of space, the electric field is given 
by E = bax, where 6 is a constant. Find the amount of charge 
contained within a cubical volume extending from + = 0 to x = a, 
from y = 0 to y =a, and from z = 0 to z =a. 

(b) Repeat for E = brz. 

(c) Repeat for E = 13b2% — Tczy. 

(d) Repeat for E = brzz. E 


51 Light is a wave made of electric and magnetic fields, and the 
fields are perpendicular to the direction of the wave’s motion, i.e., 
they’re transverse. An example would be the electric field given by 
E = bxsincz, where b and c are constants. (There would also be an 
associated magnetic field.) We observe that light can travel through 
a vacuum, so we expect that this wave pattern is consistent with the 
nonexistence of any charge in the space it’s currently occupying. Use 
Gauss’s law to prove that this is true. g 


52 This is an alternative approach to problem 49, using a dif- 
ferent technique. Suppose that a long cylinder contains a uniform 
charge density p throughout its interior volume. 

(a) Use the methods of section 10.7 to find the electric field inside 
the cylinder. Vv 
(b) Extend your solution to the outside region, using the same tech- 
nique. Once you find the general form of the solution, adjust it so 
that the inside and outside fields match up at the surface. V o& 


53 The purpose of this homework problem is to prove that 
the divergence is invariant with respect to translations. That is, it 
doesn’t matter where you choose to put the origin of your coordinate 
system. Suppose we have a field of the form E = axx + byy + czz. 
This is the most general field we need to consider in any small region 
as far as the divergence is concerned. (The dependence on x, y, and 
z is linear, but any smooth function looks linear close up. We also 
don’t need to put in terms like ry, because they don’t contribute to 
the divergence.) Define a new set of coordinates (u,v, w) related to 
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(x,y,z) by 


L=uUutrp 
= UG 
Z=wet+r F 


where p, qg, and r are constants. Show that the field’s divergence is 
the same in these new coordinates. Note that x and 0 are identical, 
and similarly for the other coordinates. g 


54 Using a techniques similar to that of problem 53, show that 
the divergence is rotationally invariant, in the special case of ro- 
tations about the z axis. In such a rotation, we rotate to a new 
(u,v, Z) coordinate system, whose axes are rotated by an angle 0 
with respect to those of the (x,y,z) system. The coordinates are 
related by 


x = ucosé+ vsind 


y = —usiné + vcosé 


Find how the u and v components the field E depend on wu and 
v, and show that its divergence is the same in this new coordinate 
system. ~ 


55 An electric field is given in cylindrical coordinates (R, ¢, z) 
by Er = ce~“?|R-! cos? 6, where the notation Ep indicates the 
component of the field pointing directly away from the axis, and 
the components in the other directions are zero. (This isn’t a com- 
pletely impossible expression for the field near a radio transmitting 
antenna.) (a) Find the total charge enclosed within the infinitely 
long cylinder extending from the axis out to R = b. (b) Interpret 
the R-dependence of your answer to part a. g 


56 Use Euler’s theorem to derive the addition theorems that 
express sin(a +b) and cos(a+ b) in terms of the sines and cosines of 
a and 0b. > Solution, p. 882 © 


57 Find every complex number z such that 2? = 1. 
> Solution, p. 882 L 


58 Factor the expression x? — y® into factors of the lowest 
possible order, using complex coefficients. (Hint: use the result of 
problem 57.) Then do the same using real coefficients. 

> Solution, p. 882 © 


Key to symbols: 
easy © typical challenging Mdifficult © very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 

Exercise 10A: Field Vectors 
Apparatus: 
3 solenoids 
DC power supply 
compass 
ruler 
cut-off plastic cup 


At this point you’ve studied the gravitational field, g, and the electric field, E, but not the 
magnetic field, B. However, they all have some of the same mathematical behavior: they act 
like vectors. Furthermore, magnetic fields are the easiest to manipulate in the lab. Manipulating 
gravitational fields directly would require futuristic technology capable of moving planet-sized 
masses around! Playing with electric fields is not as ridiculously difficult, but static electric 
charges tend to leak off through your body to ground, and static electricity effects are hard to 
measure numerically. Magnetic fields, on the other hand, are easy to make and control. Any 
moving charge, i.e., any current, makes a magnetic field. 


A practical device for making a strong magnetic field is simply a coil of wire, formally known 
as a solenoid. The field pattern surrounding the solenoid gets stronger or weaker in proportion 
to the amount of current passing through the wire. 


1. With a single solenoid connected to the power supply and laid with its axis horizontal, use a 
magnetic compass to explore the field pattern inside and outside it. The compass shows you the 
field vector’s direction, but not its magnitude, at any point you choose. Note that the field the 
compass experiences is a combination (vector sum) of the solenoid’s field and the earth’s field. 


2. What happens when you bring the compass extremely far away from the solenoid? 


What does this tell you about the way the solenoid’s field varies with distance? 


Thus although the compass doesn’t tell you the field vector’s magnitude numerically, you can 
get at least some general feel for how it depends on distance. 
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3. The figure below is a cross-section of the solenoid in the plane containing its axis. Make a 
sea-of-arrows sketch of the magnetic field in this plane. The length of each arrow should at least 
approximately reflect the strength of the magnetic field at that point. 


Does the field seem to have sources or sinks? 


4. What do you think would happen to your sketch if you reversed the wires? 


Try it. 
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5. Now hook up the two solenoids in parallel. You are going to measure what happens when 
their two fields combine in the at a certain point in space. As you’ve seen already, the solenoids’ 
nearby fields are much stronger than the earth’s field; so although we now theoretically have 
three fields involved (the earth’s plus the two solenoids’), it will be safe to ignore the earth’s 
field. The basic idea here is to place the solenoids with their axes at some angle to each other, 
and put the compass at the intersection of their axes, so that it is the same distance from each 
solenoid. Since the geometry doesn’t favor either solenoid, the only factor that would make one 
solenoid influence the compass more than the other is current. You can use the cut-off plastic 
cup as a little platform to bring the compass up to the same level as the solenoids’ axes. 


a)What do you think will happen with the solenoids’ axes at 90 degrees to each other, and equal 
currents? Try it. Now represent the vector addition of the two magnetic fields with a diagram. 
Check your diagram with your instructor to make sure you’re on the right track. 


S| @ 


b) Now try to make a similar diagram of what would happen if you switched the wires on one 
of the solenoids. 


After predicting what the compass will do, try it and see if you were right. 


c)Now suppose you were to go back to the arrangement you had in part a, but you changed one 
of the currents to half its former value. Make a vector addition diagram, and use trig to predict 


the angle. 


Try it. To cut the current to one of the solenoids in half, an easy and accurate method is 
simply to put the third solenoid in series with it, and put that third solenoid so far away that 
its magnetic field doesn’t have any significant effect on the compass. 
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Chapter 11 
Electromagnetism 


Think not that I am come to destroy the law, or the prophets: I am 
not come to destroy, but to fulfill. Matthew 5:17 


11.1 More About the Magnetic Field 


11.1.1 Magnetic forces 


In this chapter, I assume you know a few basic ideas about Ein- 
stein’s theory of relativity, as described in sections 7.1 and 7.2. Un- 
less your typical workday involves rocket ships or particle accelera- 
tors, all this relativity stuff might sound like a description of some 
bizarre futuristic world that is completely hypothetical. There is, 
however, a relativistic effect that occurs in everyday life, and it is 
obvious and dramatic: magnetism. Magnetism, as we discussed 
previously, is an interaction between a moving charge and another 
moving charge, as opposed to electric forces, which act between any 
pair of charges, regardless of their motion. Relativistic effects are 
weak for speeds that are small compared to the speed of light, and 
the average speed at which electrons drift through a wire is quite 
low (centimeters per second, typically), so how can relativity be be- 
hind an impressive effect like a car being lifted by an electromagnet (1) © 
hanging from a crane? The key is that matter is almost perfectly 
electrically neutral, and electric forces therefore cancel out almost er 
perfectly. Magnetic forces really aren’t very strong, but electric 
forces are even weaker. © 
What about the word “relativity” in the name of the theory? It 
would seem problematic if moving charges interact differently than (2) =6 
stationary charges, since motion is a matter of opinion, depending 
on your frame of reference. Magnetism, however, comes not to de- 
stroy relativity but to fulfill it. Magnetic interactions must exist 
according to the theory of relativity. To understand how this can 
be, consider how time and space behave in relativity. Observers 
in different frames of reference disagree about the lengths of mea- 
suring sticks and the speeds of clocks, but the laws of physics are 
valid and self-consistent in either frame of reference. Similarly, ob- 
servers in different frames of reference disagree about what electric 
and magnetic fields and forces there are, but they agree about con- 
crete physical events. For instance, figure a/1 shows two particles, 
with opposite charges, which are not moving at a particular mo- 


a/The pair of charged parti- 
cles, as seen in two different 
frames of reference. 
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b/A large current is created 
by shorting across the leads of 
the battery. The moving charges 
in the wire attract the moving 
charges in the electron beam, 
causing the electrons to curve. 


(1) Go —v 
—>u 
© © 666 OG Ge @ 
@©oeeeee8 © © 
| ——— 
(2) © 
—u-v 
6 © © © © © @ 
SCOCCOCCOCCECOCOCOCOE 
U-Vv <r 


c/A charged particle and a 
current, seen in two different 
frames of reference. The second 
frame is moving at velocity v 
with respect to the first frame, 
so all the velocities have v sub- 
tracted from them. (As discussed 
in the main text, this is only 
approximately correct.) 
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ment in time. An observer in this frame of reference says there are 
electric fields around the particles, and predicts that as time goes 
on, the particles will begin to accelerate towards one another, even- 
tually colliding. A different observer, a/2, says the particles are 
moving. This observer also predicts that the particles will collide, 
but explains their motion in terms of both an electric field, E, and a 
magnetic field, B. As we’ll see shortly, the magnetic field is required 
in order to maintain consistency between the predictions made in 
the two frames of reference. 


To see how this really works out, we need to find a nice sim- 
ple example that is easy to calculate. An example like figure a is 
not easy to handle, because in the second frame of reference, the 
moving charges create fields that change over time at any given lo- 
cation. Examples like figure b are easier, because there is a steady 
flow of charges, and all the fields stay the same over time.! What is 
remarkable about this demonstration is that there can be no elec- 
tric fields acting on the electron beam at all, since the total charge 
density throughout the wire is zero. Unlike figure a/2, figure b is 
purely magnetic. 


To see why this must occur based on relativity, we make the 
mathematically idealized model shown in figure c. The charge by 
itself is like one of the electrons in the vacuum tube beam of fig- 
ure b, and a pair of moving, infinitely long line charges has been 
substituted for the wire. The electrons in a real wire are in rapid 
thermal motion, and the current is created only by a slow drift su- 
perimposed on this chaos. A second deviation from reality is that 
in the real experiment, the protons are at rest with respect to the 
tabletop, and it is the electrons that are in motion, but in c/1 we 
have the positive charges moving in one direction and the negative 
ones moving the other way. If we wanted to, we could construct a 
third frame of reference in which the positive charges were at rest, 
which would be more like the frame of reference fixed to the table- 
top in the real demonstration. However, as we’ll see shortly, frames 
c/1 and c/2 are designed so that they are particularly easy to ana- 
lyze. It’s important to note that even though the two line charges 
are moving in opposite directions, their currents don’t cancel. A 
negative charge moving to the left makes a current that goes to the 
right, so in frame c/1, the total current is twice that contributed by 
either line charge. 


Frame 1 is easy to analyze because the charge densities of the 
two line charges cancel out, and the electric field experienced by the 


'For a more practical demonstration of this effect, you can put an ordinary 
magnet near a computer monitor. The picture will be distorted. Make sure that 
the monitor has a demagnetizing (“degaussing”) button, however! Otherwise 
you may permanently damage it. Don’t use a television tube, because TV tubes 
don’t have demagnetizing buttons. 
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lone charge is therefore zero: 
E, =0 


In frame 1, any force experienced by the lone charge must therefore 
be attributed solely to magnetism. 


Frame 2 shows what we’d see if we were observing all this from 
a frame of reference moving along with the lone charge. Why don’t 
the charge densities also cancel in this frame? MHere’s where the 
relativity comes in. Relativity tells us that moving objects appear 
contracted to an observer who is not moving along with them. Both 
line charges are in motion in both frames of reference, but in frame 1, 
the line charges were moving at equal speeds, so their contractions 
were equal, and their charge densities canceled out. In frame 2, 
however, their speeds are unequal. The positive charges are moving 
more slowly than in frame 1, so in frame 2 they are less contracted. 
The negative charges are moving more quickly, so their contraction 
is greater now. Since the charge densities don’t cancel, there is an 
electric field in frame 2, which points into the wire, attracting the 
lone charge. Furthermore, the attraction felt by the lone charge 
must be purely electrical, since the lone charge is at rest in this 
frame of reference, and magnetic effects occur only between moving 
charges and other moving charges.” 


To summarize, frame 1 displays a purely magnetic attraction, 
while in frame 2 it is purely electrical. 


A common source of confusion in this argument is that it seems 
as though the excess of negative electric charge must have been 
stolen from some other part of the wire, leaving that part with a 
net positive charge. That wouldn’t make sense, because there is no 
physical reason why one part of the wire would behave differently 
than any other. The flaw in this reasoning has to do with the fact 
that simultaneity is not well defined in relativity. In frame c/1, sup- 
pose that we label all the positive charges with integers, and likewise 
all the negative ones, so that the positive charge labeled 42 is on top 
of the negative charge labeled 42, and so on. In this frame of refer- 
ence, every charge has a partner that cancels it, and the net charge 
everywhere is zero. If simultaneity were a valid concept in relativity, 


?One could object that this is circular reasoning, since the whole purpose of 
this argument is to prove from first principles that magnetic effects follow from 
the theory of relativity. Could there be some extra interaction which occurs 
between a moving charge and any other charge, regardless of whether the other 
charge is moving or not? We can argue, however, that such a theory would lack 
self-consistency, since we have to define the electric field somehow, and the only 
way to define it is in terms of F’/q, where F is the force on a test charge q which is 
at rest. In other words, we’d have to say that there was some extra contribution 
to the electric field if the charge making it was in motion. This would, however, 
violate Gauss’ law, and Gauss’ law is amply supported by experiment, even when 
the sources of the electric field are moving. It would also violate the time-reversal 
symmetry of the laws of physics. 
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then not only would 42’s pairing with 42, and 43’s pairing with 43, 
occur simultaneously in frame c/1, but these same pairings would 
occur all at the same time in frame c/2. But observers in differ- 
ent frames of reference do not agree on simultaneity. For simplicity, 
let’s imagine that the Lorentz contractions are such that the spacing 
between the negative charges in frame c/2 is exactly half as much 
as the spacing between the positive charges. Then we may have 
negative charge number 42 paired up with positive charge 21, and 
negative charge 44 paired with positive charge 22, while negative 
charge 43 has no partner. 


Now we can calculate the force in frame 2, and equating it to 
the force in frame 1, we can find out how much magnetic force 
occurs. To keep the math simple, and to keep from assuming too 
much about your knowledge of relativity, we’re going to carry out 
this whole calculation in the approximation where all the speeds 
are fairly small compared to the speed of light.? For instance, if 
we find an expression such as (v/c)? + (v/c)*, we will assume that 
the fourth-order term is negligible by comparison. This is known as 
a calculation “to leading order in u/c.” In fact, I’ve already used 
the leading-order approximation twice without saying so! The first 
time I used it implicitly was in figure c, where I assumed that the 
velocities of the two line charges were u—v and —u—v. Relativistic 
velocities don’t just combine by simple addition and subtraction like 
this, but this is an effect we can ignore in the present approximation. 
The second sleight of hand occurred when I stated that we could 
equate the forces in the two frames of reference. Force, like time 
and distance, is distorted relativistically when we change from one 
frame of reference to another. Again, however, this is an effect that 
we can ignore to the desired level of approximation. 


Let +X be the charge per unit length of each line charge with- 
out relativistic contraction, i.e., in the frame moving with that line 
charge. Using the approximation y = (1 — v?/c?)~1/? & 1 + v?/2¢c? 
for v < c, the total charge per unit length in frame 2 is 


m _(u-v)P Oe ea 
Atotal, 22 c y D2 A\14 2 


—2duv 
2 


Cc 


Let R be the distance from the line charge to the lone charge. Ap- 
plying Gauss’ law to a cylinder of radius R centered on the line 
charge, we find that the magnitude of the electric field experienced 
by the lone charge in frame 2 is 


Be AkAuv . 
c2R 


’The reader who wants to see the full relativistic treatment is referred to 
E.M. Purcell, Electricity and Magnetism, McGraw Hill, 1985, p. 174. 
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and the force acting on the lone charge q is 


4kX 
F- quu 
eR 
In frame 1, the current is J = 2A,u (see homework problem 5), 
which we can approximate as J = 2Xu, since the current, unlike 


Mtotal, 2, doesn’t vanish completely without the relativistic effect. 
The magnetic force on the lone charge g due to the current I is 


_ 2kIqu 
— @R 


11.1.2 The magnetic field 
Definition in terms of the force on a moving particle 


With electricity, it turned out to be useful to define an electric 
field rather than always working in terms of electric forces. Likewise, 
we want to define a magnetic field, B. Let’s look at the result of 
the preceding subsection for insight. The equation 


2kIqu 
Fe 
eR 


shows that when we put a moving charge near other moving charges, 
there is an extra magnetic force on it, in addition to any electric 
forces that may exist. Equations for electric forces always have a 
factor of & in front — the Coulomb constant k is called the coupling 
constant for electric forces. Since magnetic effects are relativistic 
in origin, they end up having a factor of k/c? instead of just k. 
In a world where the speed of light was infinite, relativistic effects, 
including magnetism, would be absent, and the coupling constant 
for magnetism would be zero. A cute feature of the metric system is 
that we have k/c? = 10-* N-s?/C? exactly, as a matter of definition. 


Naively, we could try to work by analogy with the electric field, 
and define the magnetic field as the magnetic force per unit charge. 
However, if we think of the lone charge in our example as the test 
charge, we’ll find that this approach fails, because the force depends 
not just on the test particle’s charge, but on its velocity, v, as well. 
Although we only carried out calculations for the case where the 
particle was moving parallel to the wire, in general this velocity is 
a vector, v, in three dimensions. We can also anticipate that the 
magnetic field will be a vector. The electric and gravitational fields 
are vectors, and we expect intuitively based on our experience with 
magnetic compasses that a magnetic field has a particular direction 
in space. Furthermore, reversing the current J in our example would 
have reversed the force, which would only make sense if the magnetic 
field had a direction in space that could be reversed. Summarizing, 
we think there must be a magnetic field vector B, and the force 


d/ The right-hand relation- 
ship between the velocity of a 
positively charged particle, the 
magnetic field through which it is 
moving, and the magnetic force 
on it. 


e/The unit of magnetic field, 
the tesla, is named after Serbian- 
American inventor Nikola Tesla. 
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f/A standard dipole made 
from a square loop of wire short- 
ing across a battery. It acts very 
much like a bar magnet, but its 
strength is more easily quantified. 


B B B 
g/A dipole tends to align it- 


self to the surrounding magnetic 
field. 


direction of mM 
and A vectors 


4 


Tes 


w/ 
| 


va 


Vs 


h/ The m and A vectors. 
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on a test particle moving through a magnetic field is proportional 
both to the B vector and to the particle’s own v vector. In other 
words, the magnetic force vector F is found by some sort of vector 
multiplication of the vectors v and B. As proved on page 857, 
however, there is only one physically useful way of defining such a 
multiplication, which is the cross product. 


We therefore define the magnetic field vector, B, as the vector that 
determines the force on a charged particle according to the following 
rule: 


F=qvxB [definition of the magnetic field] 


From this definition, we see that the magnetic field’s units are 
N-s/C-m, which are usually abbreviated as teslas, 1 T=1N- 
s/C-m. The definition implies a right-hand-rule relationship among 
the vectors, figure d, if the charge q is positive, and the opposite 
handedness if it is negative. 


This is not just a definition but a bold prediction! Is it really 
true that for any point in space, we can always find a vector B that 
successfully predicts the force on any passing particle, regardless 
of its charge and velocity vector? Yes — it’s not obvious that it 
can be done, but experiments verify that it can. How? Well for 
example, the cross product of parallel vectors is zero, so we can try 
particles moving in various directions, and hunt for the direction 
that produces zero force; the B vector lies along that line, in either 
the same direction the particle was moving, or the opposite one. We 
can then go back to our data from one of the other cases, where the 
force was nonzero, and use it to choose between these two directions 
and find the magnitude of the B vector. We could then verify that 
this vector gave correct force predictions in a variety of other cases. 


Even with this empirical reassurance, the meaning of this equa- 
tion is not intuitively transparent, nor is it practical in most cases 
to measure a magnetic field this way. For these reasons, let’s look 
at an alternative method of defining the magnetic field which, al- 
though not as fundamental or mathematically simple, may be more 
appealing. 


Definition in terms of the torque on a dipole 


A compass needle in a magnetic field experiences a torque which 
tends to align it with the field. This is just like the behavior of an 
electric dipole in an electric field, so we consider the compass needle 
to be a magnetic dipole. In subsection 10.1.3 on page 521, we gave 
an alternative definition of the electric field in terms of the torque 
on an electric dipole. 


To define the strength of a magnetic field, however, we need 
some way of defining the strength of a test dipole, i.c., we need a 
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definition of the magnetic dipole moment. We could use an iron 
permanent magnet constructed according to certain specifications, 
but such an object is really an extremely complex system consisting 
of many iron atoms, only some of which are aligned with each other. 
A more fundamental standard dipole is a square current loop. This 
could be little resistive circuit consisting of a square of wire shorting 
across a battery, f. 


Applying F = v x B, we find that such a loop, when placed in 
a magnetic field, g, experiences a torque that tends to align plane 
so that its interior “face” points in a certain direction. Since the 
loop is symmetric, it doesn’t care if we rotate it like a wheel with- 
out changing the plane in which it lies. It is this preferred facing 
direction that we will end up using as our alternative definition of 
the magnetic field. 


If the loop is out of alignment with the field, the torque on it 
is proportional to the amount of current, and also to the interior 
area of the loop. The proportionality to current makes sense, since 
magnetic forces are interactions between moving charges, and cur- 
rent is a measure of the motion of charge. The proportionality to 
the loop’s area is also not hard to understand, because increasing 
the length of the sides of the square increases both the amount of 
charge contained in this circular “river” and the amount of lever- 
age supplied for making torque. Two separate physical reasons for 
a proportionality to length result in an overall proportionality to 
length squared, which is the same as the area of the loop. For these 
reasons, we define the magnetic dipole moment of a square current 
loop as 

m=IJA ; 


where the direction of the vectors is defined as shown in figure h. 


We can now give an alternative definition of the magnetic field: 


The magnetic field vector, B, at any location in space is defined by 
observing the torque exerted on a magnetic test dipole m; consisting 
of a square current loop. The field’s magnitude is 


|B| 


_ “p 
~ |m,|sin@ 


where @ is the angle between the dipole vector and the field. This 
is equivalent to the vector cross product T = m; x B. 


Let’s show that this is consistent with the previous definition, 
using the geometry shown in figure i. The velocity vector that point 
in and out of the page are shown using the convention defined in 
figure j. Let the mobile charge carriers in the wire have linear density 
A, and let the sides of the loop have length h, so that we have 
I = Xv, and m = h?Xv. The only nonvanishing torque comes from 
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velocity out 
of page 


i/The torque on a_ current 
loop in a magnetic field. The 
current comes out of the page, 
goes across, goes back into the 
page, and then back across the 
other way in the hidden side of 
the loop. 


© out of the page 
® into the page 


j/A vector coming out of the 
page is shown with the tip of an 
arrowhead. A vector going into 
the page is represented using the 
tailfeathers of the arrow. 


k / Dipole vectors can be added. 
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1/An_ irregular loop can be 
broken up into little squares. 
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m/The 
tern around a bar magnet is 
created by the superposition of 
the dipole fields of the individual 


magnetic field pat- 


iron atoms. Roughly speaking, 
it looks like the field of one big 
dipole, especially farther away 
from the magnet. Closer in, 
however, you can see a hint of 
the magnet’s rectangular shape. 
The picture was made by placing 
iron filings on a piece of paper, 
and then bringing a magnet up 
underneath. 
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the forces on the left and right sides. The currents in these sides 
are perpendicular to the field, so the magnitude of the cross product 
F = qv x B is simply |F| = qvB. The torque supplied by each of 
these forces is r x F, where the lever arm r has length h/2, and 
makes an angle @ with respect to the force vector. The magnitude 
of the total torque acting on the loop is therefore 


h 
Fe] = 25 IF sin 0 
=hquvB siné ; 
and substituting g = Ah and v = m/h?A, we have 


|r| =hAh 77 Bsind 


= mB sind ; 


which is consistent with the second definition of the field. 


It undoubtedly seems artificial to you that we have discussed 
dipoles only in the form of a square loop of current. A permanent 
magnet, for example, is made out of atomic dipoles, and atoms 
aren’t square! However, it turns out that the shape doesn’t matter. 
To see why this is so, consider the additive property of areas and 
dipole moments, shown in figure k. Each of the square dipoles has 
a dipole moment that points out of the page. When they are placed 
side by side, the currents in the adjoining sides cancel out, so they 
are equivalent to a single rectangular loop with twice the area. We 
can break down any irregular shape into little squares, as shown in 
figure 1, so the dipole moment of any planar current loop can be 
calculated based on its area, regardless of its shape. 


The magnetic dipole moment of an atom example 1 
Let’s make an order-of-magnitude estimate of the magnetic dipole 
moment of an atom. A hydrogen atom is about 10~'° m in diam- 
eter, and the electron moves at speeds of about 10-*c. We don’t 
know the shape of the orbit, and indeed it turns out that accord- 
ing to the principles of quantum mechanics, the electron doesn’t 
even have a well-defined orbit, but if we’re brave, we can still es- 
timate the dipole moment using the cross-sectional area of the 
atom, which will be on the order of (10~'1° m)? = 10-29 m?. The 
electron is a single particle, not a steady current, but again we 
throw caution to the winds, and estimate the current it creates as 
e/At, where At, the time for one orbit, can be estimated by divid- 
ing the size of the atom by the electron’s velocity. (This is only 
a rough estimate, and we don’t know the shape of the orbit, so it 
would be silly, for instance, to bother with multiplying the diameter 
by 7 based on our intuitive visualization of the electron as moving 
around the circumference of a circle.) The result for the dipole 
moment is m ~ 10-23 A-m?. 
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Should we be impressed with how small this dipole moment is, 
or with how big it is, considering that it’s being made by a single 
atom? Very large or very small numbers are never very interest- 
ing by themselves. To get a feeling for what they mean, we need 
to compare them to something else. An interesting comparison 
here is to think in terms of the total number of atoms in a typical 
object, which might be on the order of 107° (Avogadro’s number). 
Suppose we had this many atoms, with their moments all aligned. 
The total dipole moment would be on the order of 10° Am?, which 
is a pretty big number. To get a dipole moment this strong using 
human-scale devices, we’d have to send a thousand amps of cur- 
rent through a one-square meter loop of wire! The insight to be 
gained here is that, even in a permanent magnet, we must not 
have all the atoms perfectly aligned, because that would cause 
more spectacular magnetic effects than we really observe. Ap- 
parently, nearly all the atoms in such a magnet are oriented ran- 
domly, and do not contribute to the magnet’s dipole moment. 


Discussion Questions 


A __ The physical situation shown in figure c on page 602 was analyzed 
entirely in terms of forces. Now let’s go back and think about it in terms of 
fields. The charge by itself up above the wire is like a test charge, being 
used to determine the magnetic and electric fields created by the wire. 
In figures c/1 and c/2, are there fields that are purely electric or purely 
magnetic? Are there fields that are a mixture of E and B? How does this 
compare with the forces? 


B Continuing the analysis begun in discussion question A, can we 
come up with a scenario involving some charged particles such that the 
fields are purely magnetic in one frame of reference but a mixture of E and 
B in another frame? How about an example where the fields are purely 
electric in one frame, but mixed in another? Or an example where the 
fields are purely electric in one frame, but purely magnetic in another? 


11.1.3 Some applications 


Magnetic levitation example 2 
In figure n, a small, disk-shaped permanent magnet is stuck on 
the side of a battery, and a wire is clasped loosely around the 
battery, shorting it. A large current flows through the wire. The F 
electrons moving through the wire feel a force from the magnetic 
field made by the permanent magnet, and this force levitates the 
wire. 


From the photo, it’s possible to find the direction of the magnetic 

field made by the permanent magnet. The electrons in the copper 

wire are negatively charged, so they flow from the negative (flat) 

terminal of the battery to the positive terminal (the one with the 

bump, in front). As the electrons pass by the permanent magnet, y 
we can imagine that they would experience a field either toward 

the magnet, or away from it, depending on which way the magnet n/ Example 2. 
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o/Magnetic forces cause a 
beam of electrons to move in a 
circle. 
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p/ You can’t isolate the poles of a 
magnet by breaking it in half. 


+ 


q/A magnetic dipole is made 
out of other dipoles, not out of 
monopoles. 
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was flipped when it was stuck onto the battery. By the right-hand 
rule (figure d on page 605), the field must be toward the battery. 


‘Hallucinations during an MRI scan example 3 
During an MRI scan of the head, the patient’s nervous system 
is exposed to intense magnetic fields. The average velocities of 
the charge-carrying ions in the nerve cells is fairly low, but if the 
patient moves her head suddenly, the velocity can be high enough 
that the magnetic field makes significant forces on the ions. This 
can result in visual and auditory hallucinations, e.g., frying bacon 
sounds. 


‘A circular orbit example 4 
The magnetic force is always perpendicular to the motion of the 
particle, so it can never do any work, and a charged particle mov- 
ing through a magnetic field does not experience any change in 
its kinetic energy: its velocity vector can change its direction, but 
not its magnitude. If the velocity vector is initially perpendicular 
to the field, then the curve of its motion will remain in the plane 
perpendicular to the field, so the magnitude of the magnetic force 
on it will stay the same. When an object experiences a force with 
constant magnitude, which is always perpendicular to the direc- 
tion of its motion, the result is that it travels in a circle. 


Figure o shows a beam of electrons in a spherical vacuum tube. 
In the top photo, the beam is emitted near the right side of the 
tube, and travels straight up. In the bottom photo, a magnetic field 
has been imposed by an electromagnet surrounding the vacuum 
tube; the ammeter on the right shows that the current through the 
electromagnet is now nonzero. We observe that the beam is bent 
into a circle. 


self-check A 
Infer the direction of the magnetic field. Don’t forget that the beam is 
made of electrons, which are negatively charged! > Answer, p. 871 


Homework problem 12 is a quantitative analysis of circular orbits. 
‘A velocity filter example 5 
Suppose you see the electron beam in figure 0, and you want to 
determine how fast the electrons are going. You certainly can’t 
do it with a stopwatch! Physicists may also encounter situations 
where they have a beam of unknown charged particles, and they 
don’t even know their charges. This happened, for instance, when 
alpha and beta radiation were discovered. One solution to this 
problem relies on the fact that the force experienced by a charged 
particle in an electric field, Fe = gE, is independent of its veloc- 
ity, but the force due to a magnetic field, Fg = gv x B, isn’t. One 
can send a beam of charged particles through a space containing 
both an electric and a magnetic field, setting up the fields so that 
the two forces will cancel out perfectly for a certain velocity. Note 
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that since both forces are proportional to the charge of the parti- 
cles, the cancellation is independent of charge. Such a velocity 
filter can be used either to determine the velocity of an unknown 
beam or particles, or to select from a beam of particles only those 
having velocities within a certain desired range. Homework prob- 
lem 7 is an analysis of this application. 


11.1.4 No magnetic monopoles 


If you could play with a handful of electric dipoles and a handful 
of bar magnets, they would appear very similar. For instance, a pair 
of bar magnets wants to align themselves head-to-tail, and a pair of 
electric dipoles does the same thing. (It is unfortunately not that 
easy to make a permanent electric dipole that can be handled like 
this, since the charge tends to leak.) 


You would eventually notice an important difference between the 
two types of objects, however. The electric dipoles can be broken 
apart to form isolated positive charges and negative charges. The 
two-ended device can be broken into parts that are not two-ended. 
But if you break a bar magnet in half, p, you will find that you have 
simply made two smaller two-ended objects. 


The reason for this behavior is not hard to divine from our mi- 
croscopic picture of permanent iron magnets. An electric dipole has 
extra positive “stuff” concentrated in one end and extra negative in 
the other. The bar magnet, on the other hand, gets its magnetic 
properties not from an imbalance of magnetic “stuff” at the two 
ends but from the orientation of the rotation of its electrons. One 
end is the one from which we could look down the axis and see the 
electrons rotating clockwise, and the other is the one from which 
they would appear to go counterclockwise. There is no difference 
between the “stuff” in one end of the magnet and the other, q. 


Nobody has ever succeeded in isolating a single magnetic pole. 
In technical language, we say that magnetic monopoles not seem to 
exist. Electric monopoles do exist — that’s what charges are. 


Electric and magnetic forces seem similar in many ways. Both 
act at a distance, both can be either attractive or repulsive, and 
both are intimately related to the property of matter called charge. 
(Recall that magnetism is an interaction between moving charges.) 
Physicists’s aesthetic senses have been offended for a long time be- 
cause this seeming symmetry is broken by the existence of elec- 
tric monopoles and the absence of magnetic ones. Perhaps some 
exotic form of matter exists, composed of particles that are mag- 
netic monopoles. If such particles could be found in cosmic rays 
or moon rocks, it would be evidence that the apparent asymmetry 
was only an asymmetry in the composition of the universe, not in 
the laws of physics. For these admittedly subjective reasons, there 
have been several searches for magnetic monopoles. Experiments 
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have been performed, with negative results, to look for magnetic 
monopoles embedded in ordinary matter. Soviet physicists in the 
1960s made exciting claims that they had created and detected mag- 
netic monopoles in particle accelerators, but there was no success 
in attempts to reproduce the results there or at other accelerators. 
The most recent search for magnetic monopoles, done by reanalyz- 
ing data from the search for the top quark at Fermilab, turned up 
no candidates, which shows that either monopoles don’t exist in 
nature or they are extremely massive and thus hard to create in 
accelerators. 


The nonexistence of magnetic monopoles means that unlike an 
electric field, a magnetic one, can never have sources or sinks. The 
magnetic field vectors lead in paths that loop back on themselves, 
without ever converging or diverging at a point, as in the fields 
shown in figure r. Gauss’ law for magnetism is therefore much sim- 
pler than Gauss’ law for electric fields: 


bp =) _>B;- A; =0 


The magnetic flux through any closed surface is zero. 


self-check B 

Draw a Gaussian surface on the electric dipole field of figure r that has 
nonzero electric flux through it, and then draw a similar surface on the 
magnetic field pattern. What happens? > Answer, p. 871 


The field of a wire example 6 
> On page 605, we showed that a long, straight wire carrying 
current / exerts a magnetic force 


2kIqv 
a c?R 
on a particle with charge q moving parallel to the wire with velocity 
v. What, then, is the magnetic field of the wire? 


> Comparing the equation above to the first definition of the mag- 
netic field, F = v x B, it appears that the magnetic field is one that 
falls off like 1/R, where R is the distance from the wire. However, 
it's not so easy to determine the direction of the field vector. There 
are two other axes along which the particle could have been mov- 
ing, and the brute-force method would be to carry out relativistic 
calculations for these cases as well. Although this would probably 
be enough information to determine the field, we don’t want to do 
that much work. 


Instead, let’s consider what the possibilities are. The field can’t 
be parallel to the wire, because a cross product vanishes when 
the two vectors are parallel, and yet we know from the case we 
analyzed that the force doesn’t vanish when the particle is moving 
parallel to the wire. The other two possibilities that are consistent 
with the symmetry of the problem are shown in figure s. One is 
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like a bottle brush, and the other is like a spool of thread. The 
bottle brush pattern, however, violates Gauss’ law for magnetism. 
If we made a cylindrical Gaussian surface with its axis coinciding 
with the wire, the flux through it would not be zero. We there- 
fore conclude that the spool-of-thread pattern is the correct one.* 
Since the particle in our example was moving perpendicular to 
the field, we have |F| = |q||v||B|, so 


1B = |F| 

lgllv| 
_ 2k! 
~ C2R 


11.1.5 Symmetry and handedness 


The physicist Richard Feynman helped to get me hooked on 
physics with an educational film containing the following puzzle. 
Imagine that you establish radio contact with an alien on another 
planet. Neither of you even knows where the other one’s planet 
is, and you aren’t able to establish any landmarks that you both 
recognize. You manage to learn quite a bit of each other’s languages, 
but you’re stumped when you try to establish the definitions of left 
and right (or, equivalently, clockwise and counterclockwise). Is there 
any way to do it? 


If there was any way to do it without reference to external land- 
marks, then it would imply that the laws of physics themselves were 
asymmetric, which would be strange. Why should they distinguish 
left from right? The gravitational field pattern surrounding a star 
or planet looks the same in a mirror, and the same goes for electric 
fields. However, the magnetic field patterns shown in figure s seems 
to violate this principle. Could you use these patterns to explain left 
and right to the alien? No. If you look back at the definition of the 
magnetic field, it also contains a reference to handedness: the di- 
rection of the vector cross product. The aliens might have reversed 
their definition of the magnetic field, in which case their drawings 


“Strictly speaking, there is a hole in this logic, since I’ve only ruled out a 
field that is purely along one of these three perpendicular directions. What if it 
has components along more than one of them? A little more work is required 
to eliminate these mixed possibilities. For example, we can rule out a field 
with a nonzero component parallel to the wire based on the following symmetry 
argument. Suppose a charged particle is moving in the plane of the page directly 
toward the wire. If the field had a component parallel to the wire, then the 
particle would feel a force into or out of the page, but such a force is impossible 
based on symmetry, since the whole arrangement is symmetric with respect to 
mirror-reflection across the plane of the page. 
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t/ Left-handed and right-handed definitions. 


of field patterns would look like mirror images of ours, as in the left 
panel of figure t. 


Until the middle of the twentieth century, physicists assumed 
that any reasonable set of physical laws would have to have this 
kind of symmetry between left and right. An asymmetry would 
be grotesque. Whatever their aesthetic feelings, they had to change 
their opinions about reality when experiments showed that the weak 
nuclear force violates right-left symmetry! It is still a mystery why 
right-left symmetry is observed so scrupulously in general, but is 
violated by one particular type of physical process. 
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11.2 Magnetic Fields by Superposition 


11.2.1 Superposition of straight wires 


In chapter 10, one of the most important goals was to learn 
how to calculate the electric field for a given charge distribution. 
The corresponding problem for magnetism would be to calculate the 
magnetic field arising from a given set of currents. So far, however, 
we only know how to calculate the magnetic field of a long, straight 
wire, 


c2R ; 


with the geometry shown in figure a. Whereas a charge distribution 
can be broken down into individual point charges, most currents 
cannot be broken down into a set of straight-line currents. Never- 
theless, let’s see what we can do with the tools that we have. 


‘A ground fault interrupter example 7 
Electric current in your home is supposed to flow out of one side 
of the outlet, through an appliance, and back into the wall through 
the other side of the outlet. If that’s not what happens, then we 
have a problem — the current must be finding its way to ground 
through some other path, perhaps through someone’s body. If 
you have outlets in your home that have “test” and “reset” but- 
tons on them, they have a safety device built into them that is 
meant to protect you in this situation. The ground fault interrupter 
(GFI) shown in figure b, routes the outgoing and returning cur- 
rents through two wires that lie very close together. The clockwise 
and counterclockwise fields created by the two wires combine by 
vector addition, and normally cancel out almost exactly. However, 
if current is not coming back through the circuit, a magnetic field 
is produced. The doughnut-shaped collar detects this field (us- 
ing an effect called induction, to be discussed in section 11.5), 
and sends a signal to a logic chip, which breaks the circuit within 
about 25 milliseconds. 

‘An example with vector addition example 8 
> Two long, straight wires each carry current / parallel to the y 
axis, but in opposite directions. They are separated by a gap 
2h in the x direction. Find the magnitude and direction of the 
magnetic field at a point located at a height z above the plane of 
the wires, directly above the center line. 


> The magnetic fields contributed by the two wires add like vec- 
tors, which means we can add their x and z components. The x 
components cancel by symmetry. The magnitudes of the individ- 
ual fields are equal, 


a/The magnetic field of a 


long, straight wire. 


— 
™ 


b/A ground fault interrupter. 


ly © @ Ip 
view along the y axis 


c/ Example 8. 
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so the total field in the z direction is 


2k! 
B, = 22R8n 0 ) 
where 6 is the angle the field vectors make above the x axis. The 
sine of this angle equals h/R, so 


4kIh 
~ C2 R2° 


(Putting this explicitly in terms of z gives the less attractive form 
B, = 4kIh/c?(h? + 2°).) 


At large distances from the wires, the individual fields are mostly 
in the +x direction, so most of their strength cancels out. It’s not 
surprising that the fields tend to cancel, since the currents are 
in opposite directions. What's more interesting is that not only 
is the field weaker than the field of one wire, it also falls off as 
R-? rather than R-'. If the wires were right on top of each other, 
their currents would cancel each other out, and the field would be 
zero. From far away, the wires appear to be almost on top of each 
other, which is what leads to the more drastic R~* dependence 
on distance. 


B, 


self-check C 

In example 8, what is the field right between the wires, at z = 0, and 
how does this simpler result follow from vector addition? > Answer, 
p. 871 


An alarming infinity 


An interesting aspect of the R~? dependence of the field in ex- 
ample 8 is the energy of the field. We’ve already established in 
the preceding chapter that the energy density of the magnetic field 
must be proportional to the square of the field strength, B?, the 
same as for the gravitational and electric fields. Suppose we try to 
calculate the energy per unit length stored in the field of a single 
wire. We haven’t yet found the proportionality factor that goes in 
front of the B?, but that doesn’t matter, because the energy per 
unit length turns out to be infinite! To see this, we can construct 
concentric cylindrical shells of length LD, with each shell extending 
from R to R+dR. The volume of the shell equals its circumference 
times its thickness times its length, du = (27R)(dR)(L) = 2nLdR. 
For a single wire, we have B ~ R7!, so the energy density is pro- 
portional to R~?, and the energy contained in each shell varies as 
R-?dv ~ R7'dr. Integrating this gives a logarithm, and as we let R 
approach infinity, we get the logarithm of infinity, which is infinite. 


Taken at face value, this result would imply that electrical cur- 
rents could never exist, since establishing one would require an in- 
finite amount of energy per unit length! In reality, however, we 
would be dealing with an electric circuit, which would be more like 


Electromagnetism 


the two wires of example 8: current goes out one wire, but comes 
back through the other. Since the field really falls off as R~?, we 
have an energy density that varies as R~*, which does not give in- 
finity when integrated out to infinity. (There is still an infinity at 
R = 0, but this doesn’t occur for a real wire, which has a finite 
diameter.) 


Still, one might worry about the physical implications of the 
single-wire result. For instance, suppose we turn on an electron 
gun, like the one in a TV tube. It takes perhaps a microsecond for 
the beam to progress across the tube. After it hits the other side 
of the tube, a return current is established, but at least for the first 
microsecond, we have only a single current, not two. Do we have 
infinite energy in the resulting magnetic field? No. It takes time for 
electric and magnetic field disturbances to travel outward through 
space, so during that microsecond, the field spreads only to some 
finite value of R, not R= oo. 


This reminds us of an important fact about our study of mag- 
netism so far: we have only been considering situations where the 
currents and magnetic fields are constant over time. The equation 
B = 2kI/c?R was derived under this assumption. This equation is 
only valid if we assume the current has been established and flowing 
steadily for a long time, and if we are talking about the field at 
a point in space at which the field has been established for a long 
time. The generalization to time-varying fields is nontrivial, and 
qualitatively new effects will crop up. We have already seen one 
example of this on page 552, where we inferred that an inductor’s 
time-varying magnetic field creates an electric field — an electric 
field which is not created by any charges anywhere. Effects like 
these will be discussed in section 11.5 


A sheet of current 


There is a saying that in computer science, there are only three 
nice numbers: zero, one, and however many you please. In other 
words, computer software shouldn’t have arbitrary limitations like 
a maximum of 16 open files, or 256 e-mail messages per mailbox. 
When superposing the fields of long, straight wires, the really inter- 
esting cases are one wire, two wires, and infinitely many wires. With 
an infinite number of wires, each carrying an infinitesimal current, 
we can create sheets of current, as in figure d. Such a sheet has a 
certain amount of current per unit length, which we notate 7 (Greek 
letter eta). The setup is similar to example 8, except that all the 
currents are in the same direction, and instead of adding up two 
fields, we add up an infinite number of them by doing an integral. 


perspective view 
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d/A sheet of charge. 
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where in the last step we have used the identity tan7!(—r) = 
—tan~!x, combined with the relation tan! b/z + tan-ta/z = 4, 
which can be verified with a little geometry and trigonometry. The 
calculation of B, is left as an exercise (problem 23). More interest- 
ing is what happens underneath the sheet: by the right-hand rule, 
all the currents make rightward contributions to the field there, so 


a sheet of charge B, abruptly reverses itself as we pass through the sheet. 


E, =+2rko Close to the sheet, the angle y approaches 7, so we have 
Sets ste ies py = 20 
<« ttt +ts > c 
at Na Be Va Veen Figure e shows the similarity between this result and the result for 
a sheet of charge. In one case the sources are charges and the field 
a sheet of current is electric; in the other case we have currents and magnetic fields. 
a +2nkn/c@ In both cases we find that the field changes suddenly when we pass 
through a sheet of sources, and the amount of this change doesn’t 
g <<< << ee € XK depend on the size of the sheet. It was this type of reasoning that 
FATOROROKOXOKORONORONO! eventually led us to Gauss’ law in the case of electricity, and in 
a ee eee section 11.3 we will see that a similar approach can be used with 
magnetism. The difference is that, whereas Gauss’ law involves the 
e/A sheet of charge and a flux, a measure of how much the field spreads out, the corresponding 
sheet of current. law for magnetism will measure how much the field curls. 


Is it just dumb luck that the magnetic-field case came out so 
similar to the electric field case? Not at all. We’ve already seen 
that what one observer perceives as an electric field, another ob- 
server may perceive as a magnetic field. An observer flying along 
above a charged sheet will say that the charges are in motion, and 
will therefore say that it is both a sheet of current and a sheet of 
charge. Instead of a pure electric field, this observer will experience 
a combination of an electric field and a magnetic one. (We could 
also construct an example like the one in figure c on page 602, in 
which the field was purely magnetic.) 
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11.2.2 Energy in the magnetic field 


In section 10.4, I’ve already argued that the energy density of 
the magnetic field must be proportional to |B|?, which we can write 
as B? for convenience. To pin down the constant of proportionality, 
we now need to do something like the argument on page 536: find 
one example where we can calculate the mechanical work done by 
the magnetic field, and equate that to the amount of energy lost by 
the field itself. The easiest example is two parallel sheets of charge, 
with their currents in opposite directions. Homework problem 53 is 
such a calculation, which gives the result 


C2 
dU,, = —~ B?’dv 


11.2.3 Superposition of dipoles 


To understand this subsection, you’ll have to have studied sec- 
tion 4.2.4, on iterated integrals. 


The distant field of a dipole, in its midplane 


Most current distributions cannot be broken down into long, 
straight wires, and subsection 11.2.1 has exhausted most of the in- 
teresting cases we can handle in this way. A much more useful 
building block is a square current loop. We have already seen how 
the dipole moment of an irregular current loop can be found by 
breaking the loop down into square dipoles (figure 1 on page 608), 
because the currents in adjoining squares cancel out on their shared 
edges. Likewise, as shown in figure f, if we could find the magnetic 
field of a square dipole, then we could find the field of any planar 
loop of current by adding the contributions to the field from all the 
squares. 


The field of a square-loop dipole is very complicated close up, 
but luckily for us, we only need to know the current at distances 
that are large compared to the size of the loop, because we’re free 
to make the squares on our grid as small as we like. The distant field 
of a square dipole turns out to be simple, and is no different from the 
distant field of any other dipole with the same dipole moment. We 
can also save ourselves some work if we only worry about finding the 
field of the dipole in its own plane, i.e., the plane perpendicular to 
its dipole moment. By symmetry, the field in this plane cannot have 
any component in the radial direction (inward toward the dipole, or 
outward away from it); it is perpendicular to the plane, and in the 
opposite direction compared to the dipole vector. (The field inside 
the loop is in the same direction as the dipole vector, but we’re 
interested in the distant field.) Letting the dipole vector be along 
the z axis, we find that the field in the x — y plane is of the form 
B. = f(r), where f(r) is some function that depends only on r, the 
distance from the dipole. 


f/ The field of any planar current 
loop can be found by breaking it 


down into square dipoles. 
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g/A_ long, — straight current- 
carrying wire can be constructed 
by filling half of a plane with 
square dipoles. 
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h/ Setting up the _ integral. 
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We can pin down the result even more without any math. We 
know that the magnetic field made by a current always contains a 
factor of k/c?, which is the coupling constant for magnetism. We 
also know that the field must be proportional to the dipole moment, 
m = IA. Fields are always directly proportional to currents, and 
the proportionality to area follows because dipoles add according 
to their area. For instance, a square dipole that is 2 micrometers 
by 2 micrometers in size can be cut up into four dipoles that are 
1 micrometer on a side. This tells us that our result must be of 
the form B, = (k/c*)(IA)g(r). Now if we multiply the quantity 
(k/c”)(IA) by the function g(r), we have to get units of teslas, and 
this only works out if g(r) has units of m~* (homework problem 15), 
so our result must be of the form 


BkIA 
8 Bs 


where £ is a unitless constant. Thus our only task is to determine 
2, and we will have determined the field of the dipole (in the plane 
of its current, i.e., the midplane with respect to its dipole moment 
vector). 


If we wanted to, we could simply build a dipole, measure its 
field, and determine 6 empirically. Better yet, we can get an exact 
result if we take a current loop whose field we know exactly, break 
it down into infinitesimally small squares, integrate to find the total 
field, set this result equal to the known expression for the field of the 
loop, and solve for 6. There’s just one problem here. We don’t yet 
know an expression for the field of any current loop of any shape — 
all we know is the field of a long, straight wire. Are we out of luck? 
No, because, as shown in figure g, we can make a long, straight 
wire by putting together square dipoles! Any square dipole away 
from the edge has all four of its currents canceled by its neighbors. 
The only currents that don’t cancel are the ones on the edge, so by 
superimposing all the square dipoles, we get a straight-line current. 


This might seem strange. If the squares on the interior have all 
their currents canceled out by their neighbors, why do we even need 
them? Well, we need the squares on the edge in order to make the 
straight-line current. We need the second row of squares to cancel 
out the currents at the top of the first row of squares, and so on. 


Integrating as shown in figure h, we have 
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where dB, is the contribution to the total magnetic field at our point 
of interest, which lies a distance R from the wire. 
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This can be simplified with the substitutions x = Ru, y = Rv, and 
da dy = R? du dv: 
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The wu integral is of the form [°° (u? + b) 3/2 du = 2/b?, so 


p. — BRE 1 
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and the remaining v integral is equals 2, so 


2BkI 
B. = 
c?R 


This is the field of a wire, which we already know equals 2kI/c?R, 
so we have G=1. Remember, the point of this whole calculation 
was not to find the field of a wire, which we already knew, but 
to find the unitless constant @ in the expression for the field of a 
dipole. The distant field of a dipole, in its midplane, is therefore 
B, = BkIA/c?r? = kIA/c?r, or, in terms of the dipole moment, 


km 
ers 


The distant field of a dipole, out of its midplane 


What about the field of a magnetic dipole outside of the dipole’s 
midplane? Let’s compare with an electric dipole. An electric dipole, 
unlike a magnetic one, can be built out of two opposite monopoles, 
i.e., charges, separated by a certain distance, and it is then straight- 
forward to show by vector addition that the field of an electric dipole 
is 


E,=kD (3 cos” 6 — 1) r 
Ep = kD (3sin@ cos) r~? ; 
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where r is the distance from the dipole to the point of interest, 0 
is the angle between the dipole vector and the line connecting the 
dipole to this point, and FE, and Ep are, respectively, the compo- 
nents of the field parallel to and perpendicular to the dipole vector. 


In the midplane, 6 equals 7/2, which produces E, = —kDr~3 
and Er = 0. This is the same as the field of a magnetic dipole in its 
midplane, except that the electric coupling constant k replaces the 
magnetic version k/c?, and the electric dipole moment D is substi- 
tuted for the magnetic dipole moment m. It is therefore reasonable 
to conjecture that by using the same presto-change-o recipe we can 
find the field of a magnetic dipole outside its midplane: 


km 
= eo 


B,= (3 cos? 9 — 1) r3 


a= km 


ge (3sin 6 cos@)r~? 


This turns out to be correct. ® 


Concentric, counterrotating currents example 9 
> Two concentric circular current loops, with radii a and b, carry 
the same amount of current /, but in opposite directions. What is 
the field at the center? 


> We can produce these currents by tiling the region between the 
circles with square current loops, whose currents all cancel each 
other except at the inner and outer edges. The flavor of the calcu- 
lation is the same as the one in which we made a line of current 
by filling a half-plane with square loops. The main difference is 
that this geometry has a different symmetry, so it will make more 
sense to use polar coordinates instead of x and y. The field at 
the center is 


The positive sign indicates that the field is out of the page. 


°If you’ve taken a course in differential equations, this won’t seem like a very 
surprising assertion. The differential form of Gauss’ law is a differential equation, 
and by giving the value of the field in the midplane, we’ve specified a boundary 
condition for the differential equation. Normally if you specify the boundary 
conditions, then there is a unique solution to the differential equation. In this 
particular case, it turns out that to ensure uniqueness, we also need to demand 
that the solution satisfy the differential form of Ampére’s law, which is discussed 
in section 11.4. 
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Field at the center of a circular loop example 10 
> What is the magnetic field at the center of a circular current loop 
of radius a, which carries a current /? 


> This is like example 9, but with the outer loop being very large, 
and therefore too distant to make a significant field at the center. 
Taking the limit of that result as b approaches infinity, we have 


amk! 


7" a 


Comparing the results of examples 9 and 10, we see that the 
directions of the fields are both out of the page. In example 9, the 
outer loop has a current in the opposite direction, so it contributes 
a field that is into the page. This, however, is weaker than the field 
due to the inner loop, which dominates because it is less distant. 


11.2.4 The Biot-Savart law (optional) 


In section 11.2.3 we developed a method for finding the field due 
to a given current distribution by tiling a plane with square dipoles. 
This method has several disadvantages: 


e The currents all have to lie in a single plane, and the point at 
which we’re computing the field must be in that plane as well. 


e We need to do integral over an area, which means one integral 
inside another, e.g., [ [...dzdy. That can get messy. 


e It’s physically bizarre to have to construct square dipoles in 
places where there really aren’t any currents. 


Figure k shows the first step in eliminating these defects: instead 
of spreading our dipoles out in a plane, we bring them out along an 
axis. As shown in figure l, this eliminates the restriction to currents 
that lie in a plane. Now we have to use the general equations for 
a dipole field from page 622, rather than the simpler expression for 
the field in the midplane of a dipole. This increase in complication 
is more than compensated for by a fortunate feature of the new 
geometry, which is that the infinite tube can be broken down into 
strips, and we can find the field of such a strip for once and for all. 
This means that we no longer have to do one integral inside another. 
The derivation of the most general case is a little messy, so Ill just 
present the case shown in figure m, where the point of interest is 
assumed to lie in the y — z plane. Intuitively, what we’re really 
finding is the field of the short piece of length dé on the end of the 
U; the two long parallel segments are going to be canceled out by 
their neighbors when we assemble all the strips to make the tube. 
We expect that the field of this end-piece will form a pattern that 
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k / Two ways of making a current 
loop out of square dipoles. 
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|1/The new method can _ han- 
dle non-planar currents. 
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m/The field of an infinite U. 


n/The geometry of the Biot- 
Savart law. The small arrows 
show the result of the Biot-Savart 
law at various positions relative 
to the current segment dé. The 
Biot-Savart law involves a cross 
product, and the right-hand 
rule for this cross product is 
demonstrated for one case. 
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circulates around the y axis, so at the point of interest, it’s really 
the x component of the field that we want to compute: 


dB, = [ aBpcose 
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In the more general case, 1, the current loop is not planar, the point 
of interest is not in the end-planes of the U’s, and the U shapes 
have their ends staggered, so the end-piece dé is not the only part of 
each U whose current is not canceled. Without going into the gory 
details, the correct general result is as follows: 


which is known as the Biot-Savart law. (It rhymes with “leo bazaar.” 
Both t’s are silent.) The distances dé and r are now defined as 
vectors, d@ and r, which point, respectively, in the direction of the 
current in the end-piece and the direction from the end-piece to 
the point of interest. The new equation looks different, but it is 
consistent with the old one. The vector cross product dé x r has a 
magnitude r dé sing, which cancels one of r’s in the denominator 
and makes the dé x r/r? into a vector with magnitude dé sin ¢/r?. 


The field at the center of a circular loop example 11 
Previously we had to do quite a bit of work (examples 9 and 10), 
to calculate the field at the center of a circular loop of current of 
radius a. It’s much easier now. Dividing the loop into many short 
segments, each dé is perpendicular to the r vector that goes from 
it to the center of the circle, and every r vector has magnitude a. 
Therefore every cross product dé x r has the same magnitude, 
ad, as well as the same direction along the axis perpendicular 
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to the loop. The field is 


kladé 
B= 
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Out-of-the-plane field of a circular loop example 12 
> What is the magnetic field of a circular loop of current at a point 
on the axis perpendicular to the loop, lying a distance z from the 
loop’s center? 

> Again, let’s write a for the loop’s radius. The r vector now has 
magnitude Vv a2 + z?, but it is still perpendicular to the dé vector. 
By symmetry, the only nonvanishing component of the field is 
along the Z axis, 


B. = | |B cos « 
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Is it the field of a particle? 


We have a simple equation, based on Coulomb’s law, for the elec- 
tric field surrounding a charged particle. Looking at figure n, we can 
imagine that if the current segment dé was very short, then it might 
only contain one electron. It’s tempting, then, to interpret the Biot- 
Savart law as a similar equation for the magnetic field surrounding a 
moving charged particle. Tempting but wrong! Suppose you stand 
at a certain point in space and watch a charged particle move by. 
It has an electric field, and since it’s moving, you will also detect 
a magnetic field on top of that. Both of these fields change over 
time, however. Not only do they change their magnitudes and di- 
rections due to your changing geometric relationship to the particle, 
but they are also time-delayed, because disturbances in the electro- 
magnetic field travel at the speed of light, which is finite. The fields 
you detect are the ones corresponding to where the particle used 
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to be, not where it is now. Coulomb’s law and the Biot-Savart law 
are both false in this situation, since neither equation includes time 
as a variable. It’s valid to think of Coulomb’s law as the equation 
for the field of a stationary charged particle, but not a moving one. 
The Biot-Savart law fails completely as a description of the field of 
a charged particle, since stationary particles don’t make magnetic 
fields, and the Biot-Savart law fails in the case where the particle is 
moving. 


If you look back at the long chain of reasoning that led to the 
Biot-Savart law, it all started from the relativistic arguments at the 
beginning of this chapter, where we assumed a steady current in an 
infinitely long wire. Everything that came later was built on this 
foundation, so all our reasoning depends on the assumption that the 
currents are steady. In a steady current, any charge that moves away 
from a certain spot is replaced by more charge coming up behind it, 
so even though the charges are all moving, the electric and magnetic 
fields they produce are constant. Problems of this type are called 
electrostatics and magnetostatics problems, and it is only for these 
problems that Coulomb’s law and the Biot-Savart law are valid. 


You might think that we could patch up Coulomb’s law and the 
Biot-Savart law by inserting the appropriate time delays. However, 
we’ve already seen a clear example of a phenomenon that wouldn’t 
be fixed by this patch: on page 552, we found that a changing 
magnetic field creates an electric field. Induction effects like these 
also lead to the existence of light, which is a wave disturbance in the 
electric and magnetic fields. We could try to apply another band- 
aid fix to Coulomb’s law and the Biot-Savart law to make them deal 
with induction, but it won’t work. 


So what are the fundamental equations that describe how sources 
give rise to electromagnetic fields? We’ve already encountered two 
of them: Gauss’ law for electricity and Gauss’ law for magnetism. 
Experiments show that these are valid in all situations, not just 
static ones. But Gauss’ law for magnetism merely says that the 
magnetic flux through a closed surface is zero. It doesn’t tell us 
how to make magnetic fields using currents. It only tells us that we 
can’t make them using magnetic monopoles. The following section 
develops a new equation, called Ampére’s law, which is equivalent 
to the Biot-Savart law for magnetostatics, but which, unlike the 
Biot-Savart law, can easily be extended to nonstatic situations. 
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11.3 Magnetic Fields by Ampeére’s Law 


11.3.1 Ampere’s law 


As discussed at the end of subsection 11.2.4, our goal now is to 
find an equation for magnetism that, unlike the Biot-Savart law, will 
not end up being a dead end when we try to extend it to nonstatic 
situations.° Experiments show that Gauss’ law is valid in both static 
and nonstatic situations, so it would be reasonable to look for an 
approach to magnetism that is similar to the way Gauss’ law deals 
with electricity. 


How can we do this? Figure a, reproduced from page 618, is our 
roadmap. Electric fields spread out from charges. Magnetic fields 
curl around currents. In figure b/1, we define a Gaussian surface, 
and we define the flux in terms of the electric field pointing out 
through this surface. In the magnetic case, b/2, we define a surface, 
called an Ampeérian surface, and we define a quantity called the 
circulation, [ (uppercase Greek gamma), in terms of the magnetic 
field that points along the edge of the Ampérian surface, c. We 
break the edge into tiny parts s;, and for each of these parts, we 
define a contribution to the circulation using the dot product of ds 
with the magnetic field: 

r=) °s;-B 


The circulation is a measure of how curly the field is. Like a Gaus- 
sian surface, an Ampérian surface is purely a mathematical con- 
struction. It is not a physical object. 


In figure b/2, the field is perpendicular to the edges on the ends, 
but parallel to the top and bottom edges. A dot product is zero 
when the vectors are perpendicular, so only the top and bottom 
edges contribute to I. Let these edges have length s. Since the 
field is constant along both of these edges, we don’t actually have 
to break them into tiny parts; we can just have s; on the top edge, 
pointing to the left, and sg on the bottom edge, pointing to the right. 
The vector s; is in the same direction as the field B,, and sg is in 
the same direction as Bz, so the dot products are simply equal to 
the products of the vectors’ magnitudes. The resulting circulation 
is 


= |s:||Bi| + |s2||Ba| 
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But ns is (current /length) (length), i-e., it is the amount of current 
that pierces the Amperian surface. We’ll call this current [tnrough- 


°If you didn’t read this optional subsection, don’t worry, because the point is 
that we need to try a whole new approach anyway. 
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a/ The electric field of a sheet of 
charge, and the magnetic field of 
a sheet of current. 


(1) sheet of 
So charge 

(2) sheet of 
current 


b/A Gaussian surface and 
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c/The definition of the circu- 
lation, T. 
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We have found one specific example of the general law of nature 
known as Ampére’s law: 


Atk 
T= ee Linrough 


Positive and negative signs 


Figures d/1 and d/2 show what happens to the circulation when 
we reverse the direction of the current Itnrougn. Reversing the cur- 
rent causes the magnetic field to reverse itself as well. The dot 
products occurring in the circulation are all negative in d/2, so the 
total circulation is now negative. To preserve Ampére’s law, we need 
to define the current in d/2 as a negative number. In general, deter- 
mine these plus and minus signs using the right-hand rule shown in 
the figure. As the fingers of your hand sweep around in the direction 
of the s vectors, your thumb defines the direction of current which is 
positive. Choosing the direction of the thumb is like choosing which 
way to insert an ammeter in a circuit: on a digital meter, reversing 
the connections gives readings which are opposite in sign. 


A solenoid example 13 
> What is the field inside a long, straight solenoid of length @ and 
radius a, and having N loops of wire evenly wound along it, which 
carry acurrent /? 


> This is an interesting example, because it allows us to get a 
very good approximation to the field, but without some experi- 
mental input it wouldn’t be obvious what approximation to use. 
Figure e/1 shows what we’d observe by measuring the field of a 
real solenoid. The field is nearly constant inside the tube, as long 
as we stay far away from the mouths. The field outside is much 
weaker. For the sake of an approximate calculation, we can ideal- 
ize this field as shown in figure e/2. Of the edges of the Ampeérian 
surface shown in e/3, only AB contributes to the flux — there is 
zero field along CD, and the field is perpendicular to edges BC 
and DA. Ampére’s law gives 


Ark 
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(B)(length of AB) = a (n)(length of AB) 


self-check D 


What direction is the current in figure e? > Answer, p. 871 
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self-check E 
Based on how £¢ entered into the derivation in example 13, how should 
it be interpreted? Is it the total length of the wire? > Answer, p. 872 


self-check F 

Surprisingly, we never needed to know the radius of the solenoid in 
example 13. Why is it physically plausible that the answer would be 
independent of the radius? > Answer, p. 872 


Example 13 shows how much easier it can sometimes be to cal- 
culate a field using Ampére’s law rather than the approaches de- 
veloped previously in this chapter. However, if we hadn’t already 
known something about the field, we wouldn’t have been able to 
get started. In situations that lack symmetry, Ampére’s law may 
make things harder, not easier. Anyhow, we will have no choice in 
nonstatic cases, where Ampére’s law is true, and static equations 
like the Biot-Savart law are false. 


11.3.2 A quick and dirty proof 


Here’s an informal sketch for a proof of Ampére’s law, with no 
pretensions to rigor. Even if you don’t care much for proofs, it would 
be a good idea to read it, because it will help to build your ability 
to visualize how Ampére’s law works. 


First we establish by a direct computation (homework problem 
26) that Ampére’s law holds for the geometry shown in figure f/1, 
a circular Ampérian surface with a wire passing perpendicularly 
through its center. If we then alter the surface as in figure f/2, 
Ampére’s law still works, because the straight segments, being per- 
pendicular to the field, don’t contribute to the circulation, and the 
new arc makes the same contribution to the circulation as the old 
one it replaced, because the weaker field is compensated for by the 
greater length of the arc. It is clear that by a series of such modifi- 
cations, we could mold the surface into any shape, f/3. 


Next we prove Ampére’s law in the case shown in figure f/4: 
a small, square Ampérian surface subject to the field of a distant 
square dipole. This part of the proof can be most easily accom- 
plished by the methods of section 11.4. It should, for example, be 
plausible in the case illustrated here. The field on the left edge is 
stronger than the field on the right, so the overall contribution of 
these two edges to the circulation is slightly counterclockwise. How- 
ever, the field is not quite perpendicular to the top and bottoms 
edges, so they both make small clockwise contributions. The clock- 
wise and counterclockwise parts of the circulation end up canceling 
each other out. Once Ampére’s law is established for a square sur- 
face like f/4, it follows that it is true for an irregular surface like f/5, 
since we can build such a shape out of squares, and the circulations 
are additive when we paste the surfaces together this way. 


By pasting a square dipole onto the wire, f/6, like a flag attached 
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to a flagpole, we can cancel out a segment of the wire’s current 
and create a detour. Ampére’s law is still true because, as shown 
in the last step, the square dipole makes zero contribution to the 
circulation. We can make as many detours as we like in this manner, 
thereby morphing the wire into an arbitrary shape like f/7. 


What about a wire like f/8? It doesn’t pierce the Ampérian sur- 
face, so it doesn’t add anything to [jprough, and we need to show that 
it likewise doesn’t change the circulation. This wire, however, could 
be built by tiling the half-plane on its right with square dipoles, and 
we’ve already established that the field of a distant dipole doesn’t 
contribute to the circulation. (Note that we couldn’t have done this 
with a wire like f/7, because some of the dipoles would have been 
right on top of the Ampérian surface.) 


If Ampére’s law holds for cases like f/7 and f/8, then it holds 
for any complex bundle of wires, including some that pass through 
the Ampérian surface and some that don’t. But we can build just 
about any static current distribution we like using such a bundle of 
wires, so it follows that Ampére’s law is valid for any static current 
distribution. 


11.3.3 Maxwell’s equations for static fields 


Static electric fields don’t curl the way magnetic fields do, so we 
can state a version of Ampére’s law for electric fields, which is that 
the circulation of the electric field is zero. Summarizing what we 
know so far about static fields, we have 


Op = 4rkdin 
p= 0 
Tz=0 
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This set of equations is the static case of the more general relations 
known as Maxwell’s equations. On the left side of each equation, we 
have information about a field. On the right is information about 
the field’s sources. 


It is vitally important to realize that these equations are only 
true for statics. They are incorrect if the distribution of charges 
or currents is changing over time. For example, we saw on page 
552 that the changing magnetic field in an inductor gives rise to 
an electric field. Such an effect is completely inconsistent with the 
static version of Maxwell’s equations; the equations don’t even refer 
to time, so if the magnetic field is changing over time, they will 
not do anything special. The extension of Maxwell’s equations to 
nonstatic fields is discussed in section 11.6. 
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Discussion Questions 


A Figure g/1 shows a wire with a circular Amperian surface drawn 
around its waist; in this situation, Ampére’s law can be verified easily 
based on the equation for the field of a wire. In panel 2, a second wire has 
been added. Explain why it’s plausible that Ampére’s law still holds. 


B- Figure h is like figure g, but now the second wire is perpendicular 
to the first, and lies in the plane of, and outside of, the Ampérian surface. 
Carry out a similar analysis. 


C_ This discussion question is similar to questions A and B, but now the 
Ampérian surface has been moved off center. 


D_ The left-hand wire has been nudged over a little. Analyze as before. 


E You know what to do. 


g / Discussion question A. 


0 


as 


h/ Discussion question B. 


k / Discussion question E. 
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11.4 Ampere’s Law In Differential Form (Optional) 


11.4.1 The curl operator 


The differential form of Gauss’ law is more physically satisfying 
than the integral form, because it relates the charges that are present 
at some point to the properties of the electric field at the same point. 
Likewise, it would be more attractive to have a differential version 
of Ampére’s law that would relate the currents to the magnetic field 
at a single point. intuitively, the divergence was based on the idea 
of the div-meter, a/1. The corresponding device for measuring the 
curliness of a field is the curl-meter, a/2. If the field is curly, then 
the torques on the charges will not cancel out, and the wheel will 
twist against the resistance of the spring. If your intuition tells you 
that the curlmeter will never do anything at all, then your intuition 
is doing a beautiful job on static fields; for nonstatic fields, however, 
it is perfectly possible to get a curly electric field. 


Gauss’ law in differential form relates div E, a scalar, to the 
charge density, another scalar. Ampére’s law, however, deals with 
directions in space: if we reverse the directions of the currents, it 
makes a difference. We therefore expect that the differential form of 
Ampére’s law will have vectors on both sides of the equal sign, and 
we should be thinking of the curl-meter’s result as a vector. First 
we find the orientation of the curl-meter that gives the strongest 
torque, and then we define the direction of the curl vector using the 
right-hand rule shown in figure a/3. 


direction of 
the curl vector 


a/The div-meter, 1, and the To convert the div-meter concept to a mathematical definition, 

curl-meter, 2 and 3. we found the infinitesimal flux, d® through a tiny cubical Gaussian 
surface containing a volume dv. By analogy, we imagine a tiny 
square Ampérian surface with infinitesimal area dA. We assume this 
surface has been oriented in order to get the maximum circulation. 
The area vector dA will then be in the same direction as the one 
defined in figure a/3. Ampére’s law is 


Atk 
dP = ee dLihrough 


We define a current density per unit area, j, which is a vector 
pointing in the direction of the current and having magnitude j = 
dI/|dA|. In terms of this quantity, we have 


Ark ., 
dr = — jij] |dA| 
c 
dr = Ark ii 
dA) 2 
With this motivation, we define the magnitude of the curl as 


dr 


1 B] = — 
|curl B| dA 
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Note that the curl, just like a derivative, has a differential divided 
by another differential. In terms of this definition, we find Ampére’s 
law in differential form: 


4 
curl B = ——j 
c 


The complete set of Maxwell’s equations in differential form is col- 
lected on page 859. 


11.4.2 Properties of the curl operator 
The curl is a derivative. 


As an example, let’s calculate the curl of the field x shown in 
figure c. For our present purposes, it doesn’t really matter whether 
this is an electric or a magnetic field; we’re just getting out feet wet 
with the curl as a mathematical definition. Applying the definition 
of the curl directly, we construct an Ampérian surface in the shape 
of an infinitesimally small square. Actually, since the field is uni- 
form, it doesn’t even matter very much whether we make the square 
finite or infinitesimal. The right and left edges don’t contribute to 
the circulation, since the field is perpendicular to these edges. The 
top and bottom do contribute, but the top’s contribution is clock- 
wise, i.e., into the page according to the right-hand rule, while the 
bottom contributes an equal amount in the counterclockwise direc- 
tion, which corresponds to an out-of-the-page contribution to the 
curl. They cancel, and the circulation is zero. We could also have 
determined this by imagining a curl-meter inserted in this field: the 
torques on it would have canceled out. 


It makes sense that the curl of a constant field is zero, because 
the curl is a kind of derivative. The derivative of a constant is zero. 


The curl is rotationally invariant. 


Figure c looks just like figure c, but rotated by 90 degrees. Phys- 
ically, we could be viewing the same field from a point of view that 
was rotated. Since the laws of physics are the same regardless of 
rotation, the curl must be zero here as well. In other words, the curl 
is rotationally invariant. If a certain field has a certain curl vector, 
then viewed from some other angle, we simply see the same field and 
the same curl vector, viewed from a different angle. A zero vector 
viewed from a different angle is still a zero vector. 


As a less trivial example, let’s compute the curl of the field F = 
xy shown in figure e, at the point (« = 0,y = 0). The circulation 
around a square of side s centered on the origin can be approximated 


L, 


Z 


b/ The coordinate 


system 


used in the following examples. 
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c/ The field x. 
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e/ The field xy. 
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f / The field —yx. 
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g/ Example 14. 


h/ Example 15. 
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by evaluating the field at the midpoints of its sides, 


= s/2 y= F = (s/2)y 81: F = 57/2 
re) p= s2 F=0 so: F=0 
g=—s/2 y=0 F = —(s/2)y s3-F = s7/2 
£=D y=—s/2 F=0 s4:-F=0 : 


which gives a circulation of s?, and a curl with a magnitude of 
s*/area = s*/s? = 1. By the right-hand rule, the curl points out of 
the page, i.e., along the positive z axis, so we have 


curl cy =2 


Now consider the field —yx, shown in figure f. This is the same 
as the previous field, but with your book rotated by 90 degrees about 
the z axis. Rotating the result of the first calculation, z, about the 
z axis doesn’t change it, so the curl of this field is also z. 


Scaling 


When yow’re taking an ordinary derivative, you have the rule 


d 


cf ()| =e f(2) 


| 
dx 
In other words, multiplying a function by a constant results in a 
derivative that is multiplied by that constant. The curl is a kind of 


derivative operator, and the same is true for a curl. 


Multiplying the field by —1. 
> What is the curl of the field —xy at the origin? 


example 14 


> Using the scaling property just discussed, we can make this into 
a curl that we’ve already calculated: 


curl (—xy) = —curl (xy) 
=-2 


This is in agreement with the right-hand rule. 


The curl is additive. 


We have only calculated each field’s curl at the origin, but each 
of these fields actually has the same curl everywhere. In example 
14, for instance, it is obvious that the curl is constant along any 
vertical line. But even if we move along the « axis, there is still an 
imbalance between the torques on the left and right sides of the curl- 
meter. More formally, suppose we start from the origin and move 
to the left by one unit. We find ourselves in a region where the field 
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is very much as it was before, except that all the field vectors have 
had one unit worth of y added to them. But what do we get if we 
take the curl of —xy + y? The curl, like any god-fearing derivative 
operation, has the additive property 


curl (F + G) = curl F+ curl G ; 
so 
curl (—zy + y) = curl (—xy) + curl (y) 
But the second term is zero, so we get the same result as at the 
origin. 
A field that goes in a circle example 15 
> What is the curl of the field xy — yx? 


> Using the linearity of the curl, and recognizing each of the terms 
as one whose curl we have already computed, we find that this 
field’s curl is a constant 22. This agrees with the right-hand rule. 


The field inside a long, straight wire example 16 
> What is the magnetic field inside a long, straight wire in which 
the current density is /? 


> Let the wire be along the z axis, so j = /Z. Ampére’s law gives 


nA 


4n 
curl B= a it 


In other words, we need a magnetic field whose curl is a constant. 
We’ve encountered several fields with constant curls, but the only 
one that has the same symmetry as the cylindrical wire is xy— yx, 
so the answer must be this field or some constant multiplied by it, 


B = b(xy— yX) 
The curl of this field is 2bZ, so 


C 


and thus 


The curl in component form 


Now consider the field 


F, =ar+by+e 
F,y=dr+eyt+f ; 
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\, 


i/A cyclic permutation of x, 


y, and z. 
a ie eae oe 
t t hing 
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a eae 
j/ Example 17. 
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F = arx + byx+ cx +dzry + eyy + fy 


The only terms whose curls we haven’t yet explicitly computed are 
the a, e, and f terms, and their curls turn out to be zero (homework 
problem 50). Only the b and d terms have nonvanishing curls. The 
curl of this field is 


curl F = curl (byx) + curl (dry) 
= bcurl (yx) + dcurl (zy) [scaling] 
= b(—z) + d(z) [found previously] 
= (d— b)z 


But any field in the x — y plane can be approximated with this 
type of field, as long as we only need to get a good approximation 
within a small region. The infinitesimal Ampérian surface occurring 
in the definition of the curl is tiny enough to fit in a pretty small re- 
gion, so we can get away with this here. The d and 6 coefficients can 
then be associated with the partial derivatives OF, /0x and OF; /Oy. 
We therefore have 


for any field in the x—y plane. In three dimensions, we just need to 
generate two more equations like this by doing a cyclic permutation 
of the variables x, y, and z: 


_ OF, OF, 
(curl F), = By OE. 
OF, “OF 
(curl F), = ae ae 
OF, OF; 
(curl F), in _ dy 
A sine wave example 17 


> Find the curl of the following electric field 
E = (sin x)y ; 
and interpret the result. 
> The only nonvanishing partial derivative occurring in this curl is 


dE, 
(curl E)z = —* 


=cosx., 
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sO 


curl E = cos2 


This is visually reasonable: the curl-meter would spin if we put its 
wheel in the plane of the page, with its axle poking out along the 
Z axis. In some areas it would spin clockwise, in others counter- 
clockwise, and this makes sense, because the cosine is positive 
in some placed and negative in others. 


This is a perfectly reasonable field pattern: it the electric field 
pattern of a light wave! But Ampeére’s law for electric fields says 
the curl of E is supposed to be zero. What’s going on? What's 
wrong is that we can’t assume the static version of Ampere’s law. 
All we've really proved is that this pattern is impossible as a static 
field: we can’t have a light wave that stands still. 


Figure k is a summary of the vector calculus presented in the 
optional sections of this book. The first column shows that one func- 
tion is a related to another by a kind of differentiation. The second 
column states the fundamental theorem of calculus, which says that 
if you integrate the derivative over the interior of a region, you get 
some information about the original function at the boundary of 
that region. 
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k/ Asummary of the derivative, gradient, curl, and divergence. 
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Induced Electric Fields 


Faraday’s experiment 


Nature is simple, but the simplicity may not become evident un- 
til a hundred years after the discovery of some new piece of physics. 
We've already seen, on page 552, that the time-varying magnetic 
field in an inductor causes an electric field. This electric field is not 
created by charges. That argument, however, only seems clear with 
hindsight. The discovery of this phenomenon of induced electric 
fields — fields that are not due to charges — was a purely exper- 
imental accomplishment by Michael Faraday (1791-1867), the son 
of a blacksmith who had to struggle against the rigid class struc- 
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ture of 19th century England. Faraday, working in 1831, had only a 
vague and general idea that electricity and magnetism were related 
to each other, based on Oersted’s demonstration, a decade before, 
that magnetic fields were caused by electric currents. 


Figure b is a simplified drawing of the following experiment, as 
described in Faraday’s original paper: “Two hundred and three feet 
of copper wire ... were passed round a large block of wood; [another] 
two hundred and three feet of similar wire were interposed as a spiral 
between the turns of the first, and metallic contact everywhere pre- 
vented by twine [insulation]. One of these [coils] was connected with 
a galvanometer [voltmeter], and the other with a battery... When 
the contact was made, there was a sudden and very slight effect at 
the galvanometer, and there was also a similar slight effect when 
the contact with the battery was broken. But whilst the ... current 
was continuing to pass through the one [coil], no ...effect ... upon 
the other [coil] could be perceived, although the active power of the 
battery was proved to be great, by its heating the whole of its own 
coil [through ordinary resistive heating] ...” 


From Faraday’s notes and publications, it appears that the situ- 
ation in figure b/3 was a surprise to him, and he probably thought 
it would be a surprise to his readers, as well. That’s why he offered 
evidence that the current was still flowing: to show that the bat- 
tery hadn’t just died. The induction effect occurred during the short 
time it took for the black coil’s magnetic field to be established, b/2. 
Even more counterintuitively, we get an effect, equally strong but in 
the opposite direction, when the circuit is broken, b/4. The effect 
occurs only when the magnetic field is changing, and it appears to 
be proportional to the derivative 0OB/Ot, which is in one direction 
when the field is being established, and in the opposite direction 
when it collapses. 


The effect is proportional to 0B/Ot, but what is the effect? A 
voltmeter is nothing more than a resistor with an attachment for 
measuring the current through it. A current will not flow through 
a resistor unless there is some electric field pushing the electrons, 
so we conclude that the changing magnetic field has produced an 
electric field in the surrounding space. Since the white wire is not 
a perfect conductor, there must be electric fields in it as well. The 
remarkable thing about the circuit formed by the white wire is that 
as the electrons travel around and around, they are always being 
pushed forward by electric fields. This violates the loop rule, which 
says that when an electron makes a round trip, there is supposed 
to be just as much “uphill” (moving against the electric field) as 
“downhill” (moving with it). That’s OK. The loop rule is only true 
for statics. Faraday’s experiments show that an electron really can 
go around and around, and always be going “downhill,” as in the 
famous drawing by M.C. Escher shown in figure c. That’s just what 
happens when you have a curly field. 


£20 BANK 
Sy ae 


ENGLAND 2k 


a/Faraday on a British ban- 


knote. 


b/Faraday’s experiment, 


sim- 


plified and shown with modern 


equipment. 
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c/ Detail from Ascending and 
Descending, M.C. Escher, 1960. 


Ey \ 
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d/The_ relationship between 
the change in the magnetic field, 
and the electric field it produces. 


e/The electric circulation is 
the sum of the voltmeter read- 
ings. 
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When a field is curly, we can measure its curliness using a circu- 
lation. Unlike the magnetic circulation Ig, the electric circulation 
Ig is something we can measure directly using ordinary tools. A 
circulation is defined by breaking up a loop into tiny segments, ds, 
and adding up the dot products of these distance vectors with the 
field. But when we multiply electric field by distance, what we get 
is an indication of the amount of work per unit charge done on a 
test charge that has been moved through that distance. The work 
per unit charge has units of volts, and it can be measured using a 
voltmeter, as shown in figure e, where I'g equals the sum of the volt- 
meter readings. Since the electric circulation is directly measurable, 
most people who work with circuits are more familiar with it than 
they are with the magnetic circulation. They usually refer to gp 
using the synonym “emf,” which stands for “electromotive force,” 
and notate it as €. (This is an unfortunate piece of terminology, 
because its units are really volts, not newtons.) The term emf can 
also be used when the path is not a closed loop. 


Faraday’s experiment demonstrates a new relationship 


where the negative sign is a way of showing the observed left-handed 
relationship, d. This is similar to the structure of of Ampére’s law: 


[pa Ithrough ’ 


which also relates the curliness of a field to something that is going 
on nearby (a current, in this case). 


It’s important to note that even though the emf, Iz, has units 
of volts, it isn’t a voltage. A voltage is a measure of the electrical 
energy a charge has when it is at a certain point in space. The curly 
nature of nonstatic fields means that this whole concept becomes 
nonsense. In a curly field, suppose one electron stays at home while 
its friend goes for a drive around the block. When they are reunited, 
the one that went around the block has picked up some kinetic 
energy, while the one who stayed at home hasn’t. We simply can’t 
define an electrical energy Ue = qV so that Ue+K stays the same for 
each electron. No voltage pattern, V, can do this, because then it 
would predict the same kinetic energies for the two electrons, which 
is incorrect. When we’re dealing with nonstatic fields, we need to 
think of the electrical energy in terms of the energy density of the 
fields themselves. 


It might sound as though an electron could get a free lunch 
by circling around and around in a curly electric field, resulting in 
a violation of conservation of energy. The following examples, in 
addition to their practical interest, both show that energy is in fact 
conserved. 
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The generator example 18 
A basic generator, f, consists of a permanent magnet that rotates 
within a coil of wire. The magnet is turned by a motor or crank, 
(not shown). As it spins, the nearby magnetic field changes. This 
changing magnetic field results in an electric field, which has a 
curly pattern. This electric field pattern creates a current that 
whips around the coils of wire, and we can tap this current to light 
the lightbulb. 


If the magnet was on a frictionless bearing, could we light the bulb 
for free indefinitely, thus violating conservation of energy? No. 
Mechanical work has to be done to crank the magnet, and that’s 
where the energy comes from. If we break the light-bulb circuit, 
it suddenly gets easier to crank the magnet! This is because 
the current in the coil sets up its own magnetic field, and that field 
exerts a torque on the magnet. If we stopped cranking, this torque 
would quickly make the magnet stop turning. 


self-check G 

When you're driving your car, the engine recharges the battery continu- 
ously using a device called an alternator, which is really just a generator. 
Why can’t you use the alternator to start the engine if your car’s battery 
is dead? > Answer, p. 872 


The transformer example 19 
In example 17 on page 494, we discussed the advantages of 
transmitting power over electrical lines using high voltages and 
low currents. However, we don’t want our wall sockets to oper- 
ate at 10000 volts! For this reason, the electric company uses a 
device called a transformer, g, to convert everything to lower volt- 
ages and higher currents inside your house. The coil on the input 
side creates a magnetic field. Transformers work with alternating 
current, so the magnetic field surrounding the input coil is always 
changing. This induces an electric field, which drives a current 
around the output coil. 


Since the electric field is curly, an electron can keep gaining more 
and more energy by circling through it again and again. Thus the 
output voltage can be controlled by changing the number of coils 
of wire on the output side. Changing the number of coils on the 
input side also has an effect (homework problem 33). 


In any case, conservation of energy guarantees that the amount 
of power on the output side must equal the amount put in orig- 
inally, linVin = lout Vout, SO NO matter what factor the voltage is 
reduced by, the current is increased by the same factor. 


Discussion Questions 


A Suppose the bar magnet in figure f on page 641 has a magnetic 
field pattern that emerges from its top, circling around and coming back 
in the bottom. This field is created by electrons orbiting atoms inside 


f / A generator. 


output 


low voltage 


= 5 current 


e 


g/A transformer. 
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matter 
its the coil or the permanent 
magnet that spins. Either way, we 
get a functioning generator. 
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whether 


the magnet. Are these atomic currents clockwise or counterclockwise as 
seen from above? In what direction is the current flowing in the circuit? 


We have a circling atomic current inside the circling current in the wires. 
When we have two circling currents like this, they will make torques on 
each other that will tend to align them in a certain way. Since currents in 
the same direction attract one another, which way is the torque made by 
the wires on the bar magnet? Verify that due to this torque, mechanical 
work has to be done in order to crank the generator. 


11.5.2 Why induction? 


Faraday’s results leave us in the dark about several things: 


e They don’t explain why induction effects occur. 


e The relationship 'g x —OB/0t tells us that a changing mag- 
netic field creates an electric field in the surrounding region of 
space, but the phrase “surrounding region of space” is vague, 
and needs to be made mathematical. 


Suppose that we can make the “surrounding region of space” 
idea more well defined. We would then want to know the pro- 
portionality constant that has been hidden by the « symbol. 
Although experiments like Faraday’s could be used to find a 
numerical value for this constant, we would like to know why 
it should have that particular value. 


We can get some guidance from the example of a car’s alternator 
(which just means generator), referred to in the self-check on page 
641. To keep things conceptually simple, I carefully avoided men- 
tioning that in a real car’s alternator, it isn’t actually the permanent 
magnet that spins. The coil is what spins. The choice of design h/1 
or h/2 is merely a matter of engineering convenience, not physics. 
All that matters is the relative motion of the two objects. 


This is highly suggestive. As discussed at the beginning of this 
chapter, magnetism is a relativistic effect. From arguments about 
relative motion, we concluded that moving electric charges create 
magnetic fields. Now perhaps we can use reasoning with the same 
flavor to show that changing magnetic fields produce curly electric 
fields. Note that figure h/2 doesn’t even require induction. The 
protons and electrons in the coil are moving through a magnetic 
field, so they experience forces. The protons can’t flow, because 
the coil is a solid substance, but the electrons can, so a current is 
induced.” 


Now if we’re convinced that figure h/2 produces a current in 
the coil, then it seems very plausible that the same will happen in 


"Note that the magnetic field never does work on a charged particle, because 
its force is perpendicular to the motion; the electric power is actually coming 
from the mechanical work that had to be done to spin the coil. Spinning the coil 
is more difficult due to the presence of the magnet. 
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figure h/1, which implies the existence of induction effects. But 
this example involves circular motion, so it doesn’t quite work as a 
way of proving that induction exists. When we say that motion is 
relative, we only mean straight-line motion, not circular motion. 


A more ironclad relativistic argument comes from the arrange- 
ment shown in figure i. This is also a generator — one that is 
impractical, but much easier to understand. 


Flea 1 doesn’t believe in this modern foolishness about induction. 
She’s sitting on the bar magnet, which to her is obviously at rest. As 
the square wire loop is dragged away from her and the magnet, its 
protons experience a force out of the page, because the cross product 
F = qv x Bis out of the page. The electrons, which are negatively 
charged, feel a force into the page. The conduction electrons are 
free to move, but the protons aren’t. In the front and back sides of 
the loop, this force is perpendicular to the wire. In the right and 
left sides, however, the electrons are free to respond to the force. 
Note that the magnetic field is weaker on the right side. It’s as 
though we had two pumps in a loop of pipe, with the weaker pump 
trying to push in the opposite direction; the weaker pump loses the 
argument.® We get a current that circulates around the loop.? There 
is no induction going on in this frame of reference; the forces that 
cause the current are just the ordinary magnetic forces experienced 
by any charged particle moving through a magnetic field. 


Flea 2 is sitting on the loop, which she considers to be at rest. 
In her frame of reference, it’s the bar magnet that is moving. Like 
flea 1, she observes a current circulating around the loop, but unlike 
flea 1, she cannot use magnetic forces to explain this current. As far 
as she is concerned, the electrons were initially at rest. Magnetic 
forces are forces between moving charges and other moving charges, 
so a magnetic field can never accelerate a charged particle starting 
from rest. A force that accelerates a charge from rest can only be 
an electric force, so she is forced to conclude that there is an electric 
field in her region of space. This field drives electrons around and 
around in circles, so it is apparently violating the loop rule — it is 
a curly field. What reason can flea 2 offer for the existence of this 
electric field pattern? Well, she’s been noticing that the magnetic 


SIf the pump analogy makes you uneasy, consider what would happen if all 
the electrons moved into the page on both sides of the loop. We’d end up with 
a net negative charge at the back side, and a net positive charge on the front. 
This actually would happen in the first nanosecond after the loop was set in 
motion. This buildup of charge would start to quench both currents due to 
electrical forces, but the current in the right side of the wire, which is driven by 
the weaker magnetic field, would be the first to stop. Eventually, an equilibrium 
will be reached in which the same amount of current is flowing at every point 
around the loop, and no more charge is being piled up. 

°The wire is not a perfect conductor, so this current produces heat. The 
energy required to produce this heat comes from the hands, which are doing 
mechanical work as they separate the magnet from the loop. 
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field in her region of space has been changing, possibly because that 
bar magnet over there has been getting farther away. She observes 
that a changing magnetic field creates a curly electric field. 


We therefore conclude that induction effects must exist based on 
the fact that motion is relative. If we didn’t want to admit induction 
effects, we would have to outlaw flea 2’s frame of reference, but 
the whole idea of relative motion is that all frames of reference are 
created equal, and there is no way to determine which one is really 
at rest. 


This whole line of reasoning was not available to Faraday and 
his contemporaries, since they thought the relative nature of mo- 
tion only applied to matter, not to electric and magnetic fields.!° 
But with the advantage of modern hindsight, we can understand 
in fundamental terms the facts that Faraday had to take simply as 
mysterious experimental observations. For example, the geometric 
relationship shown in figure d follows directly from the direction of 
the current we deduced in the story of the two fleas. 


11.5.3 Faraday’s law 


We can also answer the other questions posed on page 642. The 
divide-and-conquer approach should be familiar by now. We first 
determine the circulation ['g in the case where the wire loop is very 
tiny, j. Then we can break down any big loop into a grid of small 
ones; we’ve already seen that when we make this kind of grid, the 
circulations add together. Although we’ll continue to talk about a 
physical loop of wire, as in figure i, the tiny loop can really be just 
like the edges of an Ampérian surface: a mathematical construct 
that doesn’t necessarily correspond to a real object. 


In the close-up view shown in figure j, the field looks simpler. 
Just as a tiny part of a curve looks straight, a tiny part of this 
magnetic field looks like the field vectors are just getting shorter by 
the same amount with each step to the right. Writing dx for the 
width of the loop, we therefore have 


OB 
B(x +dz) — B(x) = Du dx 
for the difference in the strength of the field between the left and 
right sides. In the frame of reference where the loop is moving, a 
charge gq moving along with the loop at velocity v will experience 
a magnetic force Fg = quBy. In the frame moving along with 
the loop, this is interpreted as an electrical force, Fg = qEy. Ob- 
servers in the two frames agree on how much force there is, so in 
the loop’s frame, we have an electric field E = vBy. This field is 


They can’t be blamed too much for this. As a consequence of Faraday’s 
work, it soon became apparent that light was an electromagnetic wave, and to 
reconcile this with the relative nature of motion requires Einstein’s version of 
relativity, with all its subversive ideas how space and time are not absolute. 
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perpendicular to the front and back sides of the loop, BC and DA, 
so there is no contribution to the circulation along these sides, but 
there is a counterclockwise contribution to the circulation on CD, 
and smaller clockwise one on AB. The result is a circulation that is 
counterclockwise, and has an absolute value 


[Pe] = |B(x)dy — E(a + dx)dy| 
= |v[B(x) — B(x + da)]||dy 
= ad dady 
= < eal dady 
= ead dA 


Using a right-hand rule, the counterclockwise circulation is repre- 
sented by pointing one’s thumb up, but the vector 0B/0Ot is down. 
This is just a rephrasing of the geometric relationship shown in fig- 
ure d on page 640. We can represent the opposing directions using 
a minus sign, 


OB 
eens 
at 


Although this derivation was carried out with everything aligned 
in a specific way along the coordinate axes, it turns out that this 
relationship can be generalized as a vector dot product, 


Finally, we can take a finite-sized loop and break down the cir- 
culation around its edges into a grid of tiny loops. The circulations 
add, so we have 

OB; 
Tre=- ——!) .dA. 
= at j 
This is known as Faraday’s law. (I don’t recommend memorizing all 
these names.) Mathematically, Faraday’s law is very similar to the 
structure of Ampére’s law: the circulation of a field around the edges 
of a surface is equal to the sum of something that points through 
the 


If the loop itself isn’t moving, twisting, or changing shape, then 
the area vectors don’t change over time, and we can move the deriva- 
tive outside the sum, and rewrite Faraday’s law in a slightly more 
transparent form: 


a 
Tg= —5; 2 By dA; 
_ 8p 
at 
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A changing magnetic flux makes a curly electric field. You might 
think based on Gauss’ law for magnetic fields that @g would be 
identically zero. However, Gauss’ law only applies to surfaces that 
are closed, i.e., have no edges. 


self-check H 

Check that the units in Faraday’s law work out. An easy way to approach 
this is to use the fact that vB has the same units as E, which can be 
seen by comparing the equations for magnetic and electric forces used 
above. > Answer, p. 872 


A pathetic generator example 20 
> The horizontal component of the earth’s magnetic field varies 
from zero, at a magnetic pole, to about 10~* T near the equator. 
Since the distance from the equator to a pole is about 10’ m, 
we can estimate, very roughly, that the horizontal component of 
the earth’s magnetic field typically varies by about 10-'' T/m as 
you go north or south. Suppose you connect the terminals of 
a one-ohm lightbulb to each other with a loop of wire having an 
area of 1 m?. Holding the loop so that it lies in the east-west- 
up-down plane, you run straight north at a speed of 10 m/s, how 
much current will flow? Next, repeat the same calculation for the 
surface of a neutron star. The magnetic field on a neutron star is 
typically 10° T, and the radius of an average neutron star is about 
104 m. 


> Let’s work in the frame of reference of the running person. In 
this frame of reference, the earth is moving, and therefore the 
local magnetic field is changing in strength by 10~° T/s. This rate 
of change is almost exactly the same throughout the interior of the 
loop, SO we can dispense with the summation, and simply write 


Faraday’s law as a 

Le= oF A 
Since what we estimated was the rate of change of the horizontal 
component, and the vector A is horizontal (perpendicular to the 
loop), we can find this dot product simply by multiplying the two 
numbers: 


Te = (10-9 T/s)(1 m?) 
= 107° T-m?/s 
=-10-29v 


This is certainly not enough to light the bulb, and would not even 
be easy to measure using the most sensitive laboratory instru- 
ments. 


Now what about the neutron star? We'll pretend you’re tough 
enough that its gravity doesn’t instantly crush you. The spatial 
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variation of the magnetic field is on the order of (109 T/10* m) = 
10° T/m. If you can run north at the same speed of 10 m/s, then 
in your frame of reference there is a temporal (time) variation of 
about 10® T/s, and a calculation similar to the previous one results 
in an emf of 10° V! This isn’t just strong enough to light the bulb, 
it's sufficient to evaporate it, and kill you as well! 


It might seem as though having access to a region of rapidly 
changing magnetic field would therefore give us an infinite supply 
of free energy. However, the energy that lights the bulb is actually 
coming from the mechanical work you do by running through the 
field. A tremendous force would be required to make the wire loop 
move through the neutron star’s field at any significant speed. 


Speed and power in a generator example 21 
> Figure k shows three graphs of the magnetic flux through a 
generator’s coils as a function of time. In graph 2, the generator 
is being cranked at twice the frequency. In 3, a permanent mag- 
net with double the strength has been used. In 4, the generator 
is being cranked in the opposite direction. Compare the power 
generated in figures 2-4 with the the original case, 1. 


> If the flux varies as ® = Asin wf, then the time derivative occur- 
ring in Faraday’s law is 00 /0t = Awcos wt. The absolute value 
of this is the same as the absolute value of the emf, Tg. The 
current through the lightbulb is proportional to this emf, and the 
power dissipated depends on the square of the current (P = /*R), 
so P x A®w*. Figures 2 and 3 both give four times the output 
power (and require four times the input power). Figure 4 gives 
the same result as figure 1; we can think of this as a negative 
amplitude, which gives the same result when squared. 


An approximate loop rule example 22 
Figure I/1 shows a simple RL circuit of the type discussed in the 
last chapter. A current has already been established in the coil, 
let's say by a battery. The battery was then unclipped from the 
coil, and we now see the circuit as the magnetic field in and 
around the inductor is beginning to collapse. I’ve already cau- 
tioned you that the loop rule doesn’t apply in nonstatic situations, 
so we can’t assume that the readings on the four voltmeters add 
up to zero. The interesting thing is that although they don’t add 
up to exactly zero in this circuit, they very nearly do. Why is the 
loop rule even approximately valid in this situation? 


The reason is that the voltmeters are measuring the emf Ic around 
the path shown in figure I/2, and the stray field of the solenoid is 
extremely weak out there. In the region where the meters are, 
the arrows representing the magnetic field would be too small to 
allow me to draw them to scale, so | have simply omitted them. 
Since the field is so weak in this region, the flux through the loop 
is nearly zero, and the rate of change of the flux, 0Dg/0t, is also 
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nearly zero. By Faraday’s law, then, the emf around this loop is 
nearly zero. 


Now consider figure |/3. The flux through the interior of this path 
is not zero, because the strong part of the field passes through 
it, and not just once but many times. To visualize this, imagine 
that we make a wire frame in this shape, dip it in a tank of soapy 
water, and pull it out, so that there is a soap-bubble film span- 
ning its interior. Faraday’s law refers to the rate of change of the 
flux through a surface such as this one. (The soap film tends 
to assume a certain special shape which results in the minimum 
possible surface area, but Faraday’s law would be true for any 
surface that filled in the loop.) In the coiled part of the wire, the 
soap makes a three-dimensional screw shape, like the shape you 
would get if you took the steps of a spiral staircase and smoothed 
them into a ramp. The loop rule is going to be strongly violated 
for this path. 


We can interpret this as follows. Since the wire in the solenoid 
has a very low resistance compared to the resistances of the light 
bulbs, we can expect that the electric field along the corkscrew 
part of loop 1/3 will be very small. As an electron passes through 
the coil, the work done on it is therefore essentially zero, and the 
true emf along the coil is zero. In figure I/1, the meter on top is 
therefore not telling us the actual emf experienced by an electron 
that passes through the coil. It is telling us the emf experienced 
by an electron that passes through the meter itself, which is a 
different quantity entirely. The other three meters, however, really 
do tell us the emf through the bulbs, since there are no magnetic 
fields where they are, and therefore no funny induction effects. 
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11.6 Maxwell’s Equations 


11.6.1 Induced magnetic fields 


We have almost, but not quite, done figuring out the complete 
set of physical laws, called Maxwell’s equations, governing electricity 
and magnetism. We are only missing one more term. For clarity, 
T’ll state Maxwell’s equations with the missing part included, and 
then discuss the physical motivation and experimental evidence for 
sticking it in: 


Maxwell’s equations 
For any closed surface, the fluxes through the surface are 


On = 4tkdin and 
Op =0 


For any surface that is not closed, the circulations around the edges 
of the surface are given by 


O® 
Tg = aera and 
O® a/James Clerk Maxwell (1831- 
CT p = ae + ATKL prough c 1879) 


The ®p equation is Gauss’ law: charges make diverging electric 
fields. The corresponding equation for ®g tells us that magnetic / 
“charges” (monopoles) don’t exist, so magnetic fields never diverge. 
The third equation says that changing magnetic fields induce curly foroy 4 | 
electric fields, whose curliness we can measure using the emf, Ip, n 
around a closed loop. The final equation, for [g, is the only one 


where anything new has been added. Without the new time deriva- ae — ie 

tive term, this equation would simply be Ampére’s law. (I’ve chosen : 

to move the c? over to the left because it simplifies the writing, and b/ Where is the moving charge 
also because it more clearly demonstrates the analogous roles played responsible for this magnetic 


by charges and currents in the ®g and I'g equations.) field? 
This new 0®,/0t term says that just as a changing magnetic 

field can induce a curly electric field, a changing electric field can 

induce a curly magnetic field. Why should this be so? The following 

examples show that Maxwell’s equations would not make sense in 

general without it. 


Figure b shows a mysterious curly magnetic field. Magnetic fields 
are supposed to be made by moving charges, but there don’t seem 
to be any moving charges in this landscape. Where are they? One 
reasonable guess would be that they’re behind your head, where you 
can’t see them. Suppose there’s a positively charged particle about 
to hit you in the back of the head. This particle is like a current 
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c/ An 


Ampérian surface  su- 


perimposed on the landscape. 


d/An electron jumps through a 


hoop. 


e/An 
surface. 
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going into the page. We’re used to dealing with currents made by 
many charged particles, but logically we can’t have some minimum 
number that would qualify as a current. This is not a static current, 
however, because the current at a given point in space is not staying 
the same over time. If the particle is pointlike, then it takes zero 
time to pass any particular location, and the current is then infinite 
at that point in space. A moment later, when the particle is passing 
by some other location, there will be an infinite current there, and 
zero current in the previous location. If this single particle qualifies 
as a current, then it should be surrounded by a curly magnetic field, 
just like any other current.!! 


This explanation is simple and reasonable, but how do we know 
it’s correct? Well, it makes another prediction, which is that the 
positively charged particle should be making an electric field as well. 
Not only that, but if it’s headed for the back of your head, then 
it’s getting closer and closer, so the electric field should be getting 
stronger over time. But this is exactly what Maxwell’s equations 
require. There is no current [through piercing the Ampérian surface 
shown in figure c, so Maxwell’s equation for 'g becomes c?I'p = 
0®;/Ot. The only reason for an electric field to change is if there are 
charged particles making it, and those charged particles are moving. 
When charged particles are moving, they make magnetic fields as 
well. 


Note that the above example is also sufficient to prove the pos- 
itive sign of the 0®,/Ot term in Maxwell’s equations, which is dif- 
ferent from the negative sign of Faraday’s —O®,/0t term. 


The addition of the 0®;/Ot term has an even deeper and more 
important physical meaning. With the inclusion of this term, Max- 
well’s equations can describe correctly the way in which disturbances 
in the electric and magnetic fields ripple outwards at the speed of 
light. Indeed, Maxwell was the first human to understand that light 
was in fact an electromagnetic wave. Legend has it that it was on a 
starry night that he first realized this implication of his equations. 
He went for a walk with his wife, and told her she was the only other 
person in the world who really knew what starlight was. 


To see how the 0®,/Ot term relates to electromagnetic waves, 
let’s look at an example where we would get nonsense without it. 
Figure d shows an electron that sits just on one side of an imagi- 
nary Ampérian surface, and then hops through it at some randomly 


“4One way to prove this rigorously is that in a frame of reference where the 
particle is at rest, it has an electric field that surrounds it on all sides. If 
the particle has been moving with constant velocity for a long time, then this 
is just an ordinary Coulomb’s-law field, extending off to very large distances, 
since disturbances in the field ripple outward at the speed of light. In a frame 
where the particle is moving, this pure electric field is experienced instead as a 
combination of an electric field and a magnetic field, so the magnetic field must 
exist throughout the same vast region of space. 
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chosen moment. Unadorned with the 0®,/0t term, Maxwell’s equa- 
tion for pg reads as CT p = AtkIthrough, Which is Ampére’s law. If 
the electron is a pointlike particle, then we have an infinite current 
TIihrough at the moment when it pierces the imaginary surface, and 
zero current at all other times. An infinite magnetic circulation Ig 
can only be produced by an infinite magnetic field, so without the 
O® 7 /Ot term, Maxwell’s equations predict nonsense: the edge of the 
surface would experience an infinite magnetic field at one instant, 
and zero magnetic field at all other times. Even if the infinity didn’t 
upset us, it doesn’t make sense that anything special would happen 
at the moment the electron passed through the surface, because the 
surface is an imaginary mathematical construct. We could just as 
well have chosen the curved surface shown in figure e, which the 
electron never crosses at all. We are already clearly getting non- 
sensical results by omitting the 0®,/Ot term, and this shouldn’t 
surprise us because Ampére’s law only applies to statics. More to 
the point, Ampére’s law doesn’t have time in it, so it predicts that 
this effect is instantaneous. According to Ampére’s law, we could 
send Morse code signals by wiggling the electron back and forth, and 
these signals would be received at distant locations instantly, with- 
out any time delay at all. This contradicts the theory of relativity, 
one of whose predictions is that information cannot be transmitted 
at speeds greater than the speed of light. 


Discussion Questions 


A __ Induced magnetic fields were introduced in the text via the imaginary 
landscape shown in figure b on page 649, and | argued that the magnetic 
field could have been produced by a positive charge coming from behind 
your head. This is a specific assumption about the number of charges 
(one), the direction of motion, and the sign of the charge. What are some 
other scenarios that could explain this field? 


11.6.2 Light waves 


We could indeed send signals using this scheme, and the signals 
would be a form of light. A radio transmitting antenna, for instance, 
is simply a device for whipping electrons back and forth at mega- 
hertz frequencies. Radio waves are just like visible light, but with a 
lower frequency. With the addition of the 0®¢/0t term, Maxwell’s 
equations are capable of describing electromagnetic waves. It would 
be possible to use Maxwell’s equations to calculate the pattern of the 
electric and magnetic fields rippling outward from a single electron 
that fidgets at irregular intervals, but let’s pick a simpler example 
to analyze. 


The simplest wave pattern is a sine wave like the one shown 
in figure f. Let’s assume a magnetic field of this form, and see 
what Maxwell’s equations tell us about it. If the wave is traveling 
through empty space, then there are no charges or currents present, 


f/ A magnetic field in the form of 
a sine wave. 


J 


Se 


J 


g/The wave pattern is curly. 
For example, the circulation 
around this reactangle is nonzero 
and counterclockwise. 
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The equation ® = 0 has already been verified for this type of 
wave pattern in example 35 on page 583. Even if you haven’t learned 
the techniques from that section, it should be visually plausible that 
this field pattern doesn’t diverge or converge on any particular point. 


Geometry of the electric and magnetic fields 


The equation c?I'g = 0®,/0t tells us that there can be no such 
thing as a purely magnetic wave. The wave pattern clearly does have 
a nonvanishing circulation around the edge of the surface suggested 
in figure g, so there must be an electric flux through the surface. 
This magnetic field pattern must be intertwined with an electric 
field pattern that fills the same space. There is also no way that the 
two sides of the equation could stay synchronized with each other 
unless the electric field pattern is also a sine wave, and one that 
has the same wavelength, frequency, and velocity. Since the electric 
field is making a flux through the indicated surface, it’s plausible 
that the electric field vectors lie in a plane perpendicular to that 
of the magnetic field vectors. The resulting geometry is shown in 
figure h. Further justification for this geometry is given later in this 
subsection. 


direction of motion of wave xo 


plane of 
vibration 
of electric 
field 


plane of vibration 
of magnetic field 
yr 


h/ The geometry of an electromagnetic wave. 


One feature of figure h that is easily justified is that the electric 
and magnetic fields are perpendicular not only to each other, but 
also to the direction of propagation of the wave. In other words, the 
vibration is sideways, like people in a stadium “doing the wave,” 
not lengthwise, like the accordion pattern in figure i. (In standard 
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wave terminology, we say that the wave is transverse, not longitudi- 
nal.) The wave pattern in figure i is impossible, because it diverges 
from the middle. For virtually any choice of Gaussian surface, the 
magnetic and electric fluxes would be nonzero, contradicting the 
equations ®g = 0 and ®g = 0. 


Polarization 


Two electromagnetic waves traveling in the same direction through 
space can differ by having their electric and magnetic fields in dif- 
ferent directions, a property of the wave called its polarization. 


The speed of light 


What is the velocity of the waves described by Maxwell’s equa- 
tions? Maxwell convinced himself that light was an electromagnetic 
wave partly because his equations predicted waves moving at the ve- 
locity of light, c. The only velocity that appears in the equations is 
c, so this is fairly plausible, although a real calculation is required in 
order to prove that the velocity of the waves isn’t something like 2c 
or c/a — or zero, which is also c multiplied by a constant! The fol- 
lowing discussion, leading up to a proof that electromagnetic waves 
travel at c, is meant to be understandable even if you’re reading this 
book out of order, and haven’t yet learned much about waves. As 
always with proofs in this book, the reason to read it isn’t to con- 
vince yourself that it’s true, but rather to build your intuition. The 
style will be visual. In all the following figures, the wave patterns 
are moving across the page (let’s say to the right), and it usually 
doesn’t matter whether you imagine them as representing the wave’s 
magnetic field or its electric field, because Maxwell’s equations in 
a vacuum have the same form for both fields. Whichever field we 
imagine the figures as representing, the other field is coming in and 
out of the page. 


The velocity of the waves is not zero. If the wave pattern was 
standing still in space, then the right sides of the I equations would 
be zero, because there would be no change in the field over time 
at a particular point. But the left sides are not zero, so this is 
impossible.!? 


The velocity of the waves is a fixed number for a given wave pat- 
tern. Consider a typical sinusoidal wave of visible light, with a dis- 


ven if the fields can’t be parallel to the direction of propagation, one might 
wonder whether they could form some angle other than 90 degrees with it. No. 
One proof is given on page 657. A alternative argument, which is simpler but 
more esoteric, is that if there was such a pattern, then there would be some other 
frame of reference in which it would look like figure i. 

13 A young Einstein worried about what would happen if you rode a motorcycle 
alongside a light wave, traveling at the speed of light. Would the light wave have 
a zero velocity in this frame of reference? The only solution lies in the theory of 
relativity, one of whose consequences is that a material object like a student or 
a motorcycle cannot move at the speed of light. 
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tance of half a micrometer from one peak to the next peak. Suppose 
this wave pattern provides a valid solution to Maxwell’s equations 
when it is moving with a certain velocity. We then know, for in- 
stance, that there cannot be a valid solution to Maxwell’s equations 
in which the same wave pattern moves with double that velocity. 
The time derivatives on the right sides of Maxwell’s equations for 
Tg and Ig would be twice as big, since an observer at a certain 
point in space would see the wave pattern sweeping past at twice 
the rate. But the left sides would be the same, so the equations 
wouldn’t equate. 


The velocity is the same for all wave patterns. In other words, 
it isn’t 0.878c for one wave pattern, and 1.067c for some other pat- 
tern. This is surprising, since, for example, water waves with dif- 
ferent shapes do travel at different speeds. Similarly, even though 
we speak of “the speed of sound,” sound waves do travel at slightly 
different speeds depending on their pitch and loudness, although the 
differences are small unless you’re talking about cannon blasts or ex- 
tremely high frequency ultrasound. To see how Maxwell’s equations 
give a consistent velocity, consider figure j. Along the right and left 
edges of the same Ampérian surface, the more compressed wave pat- 
tern of blue light has twice as strong a field, so the circulations on 
the left sides of Maxwell’s equations are twice as large.'4 To satisfy 
Maxwell’s equations, the time derivatives of the fields must also be 
twice as large for the blue light. But this is true only if the blue 
light’s wave pattern is moving to the right at the same speed as the 
red light’s: if the blue light pattern is sweeping over an observer 
with a given velocity, then the time between peaks is half as much, 
like the clicking of the wheels on a train whose cars are half the 
length.!° 


We can also check that bright and dim light, as shown in figure 
k, have the same velocity. If you haven’t yet learned much about 
waves, then this might be surprising. A material object with more 
energy goes faster, but that’s not the case for waves. The circulation 
around the edge of the Ampérian surface shown in the figure is twice 
as strong for the light whose fields are doubled in strength, so the 
left sides of Maxwell’s [ equations are doubled. The right sides are 
also doubled, because the derivative of twice a function is twice the 
derivative of the original function. Thus if dim light moving with 
a particular velocity is a solution, then so is bright light, provided 
that it has the same velocity. 


We can now see that all sinusoidal waves have the same velocity. 


4 Actually, this is only exactly true of the rectangular strip is made infinitesi- 
mally thin. 

Vou may know already that different colors of light have different speeds 
when they pass through a material substance, such as the glass or water. This 
is not in contradiction with what I’m saying here, since this whole analysis is for 
light in a vacuum. 
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What about nonsinusoidal waves like the one in figure 1? There 
is a mathematical theorem, due to Fourier, that says any function 
can be made by adding together sinusoidal functions. For instance, 
3 sin x—7 cos 3x can be made by adding together the functions 3 sin x 
and —7cos3z, but Fourier proved that this can be done even for 
functions, like figure 1, that aren’t obviously built out of sines and 
cosines in the first place. Therefore our proof that sinusoidal waves 
all have the same velocity is sufficient to demonstrate that other 
waves also have this same velocity. 


We’re now ready to prove that this universal speed for all electro- 
magnetic waves is indeed c. Since we’ve already convinced ourselves 
that all such waves travel at the same speed, it’s sufficient to find 
the velocity of one wave in particular. Let’s pick the wave whose 
fields have magnitudes 


E = Esin(x + vt) and 
B = Bsin(x + vt) : 


which is about as simple as we can get. The peak electric field 
of this wave has a strength E, and the peak magnetic field is B. 
The sine functions go through one complete cycle as x increases by 
27 = 6.28..., so the distance from one peak of this wave to the 
next — its wavelength — is 6.28...meters. This means that it is 
not a wave of visible light but rather a radio wave (its wavelength 
is on the same order of magnitude as the size of a radio antenna). 
That’s OK. What was glorious about Maxwell’s work was that it 
unified the whole electromagnetic spectrum. Light is simple. Radio 
waves aren’t fundamentally any different than light waves, x-rays, 
or gamma rays. !° 


The justification for putting «+ vt inside the sine functions is as 
follows. As the wave travels through space, the whole pattern just 
shifts over. The fields are zero at x = 0, t = 0, since the sine of 
zero is zero. This zero-point of the wave pattern shifts over as time 
goes by; at any time ¢ its location is given by x + vt = 0. After one 
second, the zero-point is located at x = —(1 s)v. The distance it 
travels in one second is therefore numerically equal to v, and this 
is exactly the concept of velocity: how far something goes per unit 
time. 


The wave has to satisfy Maxwell’s equations for Tg and Ig 
regardless of what Ampérian surfaces we pick, and by applying them 
to any surface, we could determine the speed of the wave. The 
surface shown in figure m turns out to result in an easy calculation: 
a narrow strip of width 2@ and height h, coinciding with the position 
of the zero-point of the field at t = 0. 


16What makes them appear to be unrelated phenomena is that we experience 
them through their interaction with atoms, and atoms are complicated, so they 
respond to various kinds of electromagnetic waves in complicated ways. 


m/The magnetic field of the 
wave. The electric field, not 
shown, is perpendicular to the 
page. 


Section 11.6 Maxwell’s Equations 655 


656 


Chapter 11 


Now let’s apply the equation c?I'g = 0®,/dt at t = 0. Since 
the strip is narrow, we can approximate the magnetic field using 
sin x & x, which is valid for small x. The magnetic field on the right 
edge of the strip, at z = @, is then BZ, so the right edge of the strip 
contributes Bh to the circulation. The left edge contributes the 
same amount, so the left side of Maxwell’s equation is 


CTp=c’- 2BCh 


The other side of the equation is 


Ion A 
Ot a PA) 
OE 

= 2th, 


where we can dispense with the usual sum because the strip is nar- 
row and there is no variation in the field as we go up and down 
the strip. The derivative equals vE cos(a + vt), and evaluating the 
cosine at x = 0, t = 0 gives 


OO: QvEth 
Ot 


Maxwell’s equation for 'g therefore results in 


22 Blh = 2EChv 


e?B=vE 
An application of [zg = —0®,/0t gives a similar result, except 
that there is no factor of c? 
E=vB 


(The minus sign simply represents the right-handed relationship of 
the fields relative to their direction of propagation.) 


Multiplying these last two equations by each other, we get 


CBE =vEB 
C = ye 
VU==XC 


This is the desired result. (The plus or minus sign shows that the 
wave can travel in either direction.) 


As a byproduct of this calculation, we can find the relationship 
between the strengths of the electric and magnetic fields in an elec- 
tromagnetic wave. If, instead of multiplying the equations c?B = vE 
and E = vB, we divide them, we can easily show that E = cB. 
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Figure n shows the complete spectrum of light waves. The wave- 
length A (number of meters per cycle) and frequency f (number of 
cycles per second) are related by the equation c = fr. Maxwell’s 
equations predict that all light waves have the same structure, re- 
gardless of wavelength and frequency, so even though radio and x- 
rays, for example, hadn’t been discovered, Maxwell predicted that 
such waves would have to exist. Maxwell’s 1865 prediction passed 
an important test in 1888, when Heinrich Hertz published the re- 
sults of experiments in which he showed that radio waves could be 
manipulated in the same ways as visible light waves. Hertz showed, 
for example, that radio waves could be reflected from a flat surface, 
and that the directions of the reflected and incoming waves were 
related in the same way as with light waves, forming equal angles 
with the surface. Likewise, light waves can be focused with a curved, 
dish-shaped mirror, and Hertz demonstrated the same thing with a 
dish-shaped radio antenna. 


Momentum of light waves 


A light wave consists of electric and magnetic fields, and fields 
contain energy. Thus a light wave carries energy with it when it trav- 
els from one place to another. If a material object has kinetic energy 
and moves from one place to another, it must also have momentum, 
so it is logical to ask whether light waves have momentum as well. 
It can be proved based on relativity that it does, and that the mo- 
mentum and energy are related by the equation U = p/c, where p 
is the magnitude of the momentum vector, and U = U. + Um is 
the sum of the energy of the electric and magnetic fields. We can 
now demonstrate this without explicitly referring to relativity, and 
connect it to the specific structure of a light wave. 


The energy density of a light wave is related to the magnitudes 
of the fields in a specific way — it depends on the squares of their 
magnitudes, E? and B?, which are the same as the dot products 
E-E and B- B. We argued on page 536 that since energy is a 
scalar, the only possible expressions for the energy densities of the 
fields are dot products like these, multiplied by some constants. This 
is because the dot product is the only mathematically sensible way 
of multiplying two vectors to get a scalar result. (Any other way 
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violates the symmetry of space itself.) 


How does this relate to momentum? Well, we know that if we 
double the strengths of the fields in a light beam, it will have four 
times the energy, because the energy depends on the square of the 
fields. But we then know that this quadruple-energy light beam 
must have quadruple the momentum as well. If there wasn’t this 
kind of consistency between the momentum and the energy, then we 
could violate conservation of momentum by combining light beams 
or splitting them up. We therefore know that the momentum den- 
sity of a light beam must depend on a field multiplied by a field. 
Momentum, however, is a vector, and there is only one physically 
meaningful way of multiplying two vectors to get a vector result, 
which is the cross product (see page 857). The momentum density 
can therefore only depend on the cross products E x E, B x B, and 
E x B. But the first two of these are zero, since the cross product 
vanishes when there is a zero angle between the vectors. Thus the 
momentum per unit volume must equal E x B multiplied by some 
constant, 

dp = (constant)E x Bdv 


This predicts something specific about the direction of propagation 
of a light wave: it must be along the line perpendicular to the electric 
and magnetic fields. We’ve already seen that this is correct, and also 
that the electric and magnetic fields are perpendicular to each other. 
Therefore this cross product has a magnitude 


|E x B| = |E||B] sin 90° 
= |B||B 
E/? 
aol | = ¢|B|? 
Cc 


where in the last step the relation |E| = c|B| has been used. 


We now only need to find one physical example in order to fix the 
constant of proportionality. Indeed, if we didn’t know relativity, it 
would be possible to believe that the constant of proportionality was 
zero! The simplest example of which I know is as follows. Suppose a 
piece of wire of length @ is bathed in electromagnetic waves coming 
in sideways, and let’s say for convenience that this is a radio wave, 
with a wavelength that is large compared to ¢, so that the fields 
don’t change significantly across the length of the wire. Let’s say the 
electric field of the wave happens to be aligned with the wire. Then 
there is an emf between the ends of the wire which equals EF’, and 
since the wire is small compared to the wavelength, we can pretend 
that the field is uniform, not curly, in which case voltage is a well- 
defined concept, and this is equivalent to a voltage difference AV = 
Eé between the ends of the wire. The wire obeys Ohm’s law, and 
a current flows in response to the wave.!7 Equating the expressions 


'7This current will soon come to a grinding halt, because we don’t have a 
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dU/dt and [AV for the power dissipated by ohmic heating, we have 


dU = [Edt 


for the energy the wave transfers to the wire in a time interval dt. 


Note that although some electrons have been set in motion in the 
wire, we haven’t yet seen any momentum transfer, since the protons 
are experiencing the same amount of electric force in the opposite 
direction. However, the electromagnetic wave also has a magnetic 
field, and a magnetic field transfers momentum to (exerts a force 
on) acurrent. This is only a force on the electrons, because they’re 
what make the current. The magnitude of this force equals (JB 
(homework problem 6), and using the definition of force, dp/dt, we 
find for the magnitude of the momentum transferred: 


dp = €IBat 


We now know both the amount of energy and the amount of 
momentum that the wave has lost by interacting with the wire. 
Dividing these two equations, we find 


dp _8B 

dU £E 
<a 
=- 


which is what we expected based on relativity. This can now be 
restated in the form dp = (constant)E x Bdv (homework problem 
40). 


Note that although the equations p = U/c and dp = (constant)Ex 


Bdv are consistent with each other for a sine wave, they are not con- 
sistent with each other in general. The relativistic argument leading 
up to p = U/c assumed that we were only talking about a single 
thing traveling in a single direction, whereas no such assumption 
was made in arguing for the E x B form. For instance, if two light 
beams of equal strength are traveling through one another, going in 
opposite directions, their total momentum is zero, which is consis- 
tent with the E x B form, but not with U/c. 


Some examples were given in chapter 3 of situations where it 
actually matters that light has momentum. 
Angular momentum of light waves 


For completeness, we note that since light carries momentum, it 
must also be possible for it to have angular momentum. If you’ve 
studied chemistry, here’s an example of why this can be important. 


complete circuit, but let’s say we’re talking about the first picosecond during 
which the radio wave encounters the wire. This is why real radio antennas are 
not very short compared to a wavelength! 


0/A_ classical calculation of 
the momentum of a light wave. 
An antenna of length @ is bathed 
in an electromagnetic wave. 
The black arrows represent the 
electric field, the white circles the 
magnetic field coming out of the 
page. The wave is traveling to the 
right. 
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You know that electrons in atoms can exist in states labeled s, p, 
d, f, and so on. What you might not have realized is that these 
are angular momentum labels. The s state, for example, has zero 
angular momentum. If light didn’t have angular momentum, then, 
for example, it wouldn’t be possible for a hydrogen atom in a p state 
to change to the lower-energy s state by emitting light. Conservation 
of angular momentum requires that the light wave carry away all 
the angular momentum originally possessed by the electron in the 
p state, since in the s state it has none. 


660 Chapter 11 Electromagnetism 


11.7 Electromagnetic Properties of Materials 


Different types of matter have a variety of useful electrical and mag- 
netic properties. Some are conductors, and some are insulators. 
Some, like iron and nickel, can be magnetized, while others have 
useful electrical properties, e.g., dielectrics, discussed qualitatively 
in the discussion question on page 546, which allow us to make 
capacitors with much higher values of capacitance than would oth- 
erwise be possible. We need to organize our knowledge about the 
properties that materials can possess, and see whether this knowl- 
edge allows us to calculate anything useful with Maxwell’s equations. 


11.7.1 Conductors 


A perfect conductor, such as a superconductor, has no DC elec- 
trical resistance. It is not possible to have a static electric field 
inside it, because then charges would move in response to that field, 
and the motion of the charges would tend to reduce the field, con- 
trary to the assumption that the field was static. Things are a little 
different at the surface of a perfect conductor than on the interior. 
We expect that any net charges that exist on the conductor will 
spread out under the influence of their mutual repulsion, and settle 
on the surface. As we saw in chapter 10, Gauss’s law requires that 
the fields on the two sides of a sheet of charge have |E, ; — E, 9| 
proportional to the surface charge density, and since the field inside 
the conductor is zero, we infer that there can be a field on or im- 
mediately outside the conductor, with a nonvanishing component 
perpendicular to the surface. The component of the field parallel 
to the surface must vanish, however, since otherwise it would cause 
the charges to move along the surface. 


On a hot summer day, the reason the sun feels warm on your 
skin is that the oscillating fields of the light waves excite currents 
in your skin, and these currents dissipate energy by ohmic heating. 
In a perfect conductor, however, this could never happen, because 
there is no such thing as ohmic heating. Since electric fields can’t 
penetrate a perfect conductor, we also know that an electromag- 
netic wave can never pass into one. By conservation of energy, we 
know that the wave can’t just vanish, and if the energy can’t be 
dissipated as heat, then the only remaining possibility is that all of 
the wave’s energy is reflected. This is why metals, which are good 
electrical conductors, are also highly reflective. They are not perfect 
electrical conductors, however, so they are not perfectly reflective. 
The wave enters the conductor, but immediately excites oscillating 
currents, and these oscillating currents dissipate the energy both by 
ohmic heating and by reradiating the reflected wave. Since the parts 
of Maxwell’s equations describing radiation have time derivatives in 
them, the efficiency of this reradiation process depends strongly on 
frequency. When the frequency is high and the material is a good 
conductor, reflection predominates, and is so efficient that the wave 
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only penetrates to a very small depth, called the skin depth. In 
the limit of poor conduction and low frequencies, absorption pre- 
dominates, and the skin depth becomes much greater. In a high- 
frequency AC circuit, the skin depth in a copper wire is very small, 
and therefore the signals in such a circuit are propagated entirely at 
the surfaces of the wires. In the limit of low frequencies, i.e., DC, 
the skin depth approaches infinity, so currents are carried uniformly 
over the wires’ cross-sections. 


We can quantify how well a particular material conducts elec- 
tricity. We know that the resistance of a wire is proportional to its 
length, and inversely proportional to its cross-sectional area. The 
constant of proportionality is 1/o, where o (not the same a as the 
surface charge density) is called the electrical conductivity. Exposed 
to an electric field E, a conductor responds with a current per unit 
cross-sectional area J = oE. The skin depth is proportional to 
1/\/fo, where f is the frequency of the wave. 


—_— 


11.7.2. Dielectrics 


A material with a very low conductivity is an insulator. Such 
materials are usually composed of atoms or molecules whose elec- 
trons are strongly bound to them; since the atoms or molecules 
have zero total charge, their motion cannot create an electric cur- 
= rent. But even though they have zero charge, they may not have 
88 zero dipole moment. Imagine such a substance filling in the space 
a8 between the plates of a capacitor, as in figure a. For simplicity, 
a8 we assume that the molecules are oriented randomly at first, a/1, 
and then become completely aligned when a field is applied, a/2. 
The effect has been to take all of the negatively charged black ands 
3 of the molecules and shift them upward, and the opposite for the 
++ _+_+ + + + + + + + + positively charged white ends. Where the black and white charges 

overlap, there is still zero net charge, but we have a strip of nega- 

tive charge at the top, and a strip of positive charge at the bottom, 

—_ a/3. The effect has been to cancel out part of the charge that was 
| deposited on the plates of the capacitor. Now this is very subtle, be- 

cause Maxwell’s equations treat these charges on an equal basis, but 
in terms of practical measurements, they are completely different. 
The charge on the plates can be measured be inserting an ammeter 
in the circuit, and integrating the current over time. But the charges 
in the layers at the top and bottom of the dielectric never flowed 
through any wires, and cannot be detected by an ammeter. In other 
words, the total charge, g, appearing in Maxwell’s equartions is ac- 
tualy ¢ = free — Gbound, Where Gfree is the charge that moves freely 
through wires, and can be detected in an ammeter, while gpounq is 
the charge bound onto the individual molecules, which can’t. We 
will, however, detect the presence of the bound charges via their 
electric fields. Since their electric fields partially cancel the fields of 
the free charges, a voltmeter will register a smaller than expected 
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a/A capacitor with a_ dielec- 
tric between the plates. 
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voltage difference between the plates. If we measure dfree/V, we have 
a result that is larger than the capacitance we would have expected. 


Although the relationship E «© q between electric fields and their 
sources is unalterably locked in by Gauss’s law, that’s not what we 
see in practical measurements. In this example, we can measure the 
voltage difference between the plates of the capacitor and divide 
by the distance between them to find E, and then integrate an 
ammeter reading to find dfree, and we will find that Gauss’s law 
appears not to hold. We have E © qfree/(constant), where the 
constant fudge factor is greater than one. This constant is a property 
of the dielectric material, and tells us how many dipoles there are, 
how strong they are, and how easily they can be reoriented. The 
conventional notation is to incorporate this fudge factor into Gauss’s 
law by defining an altered version of the electric field, 


D=cE ; 
and to rewrite Gauss’s law as 
Op = din, free 


The constant € is a property of the material, known as its permittiv- 
ity. In a vacuum, € takes on a value known as €,, defined as 1/(47k). 
In a dielectric, € is greater than €,. When a dielectric is present be- 
tween the plates of a capacitor, its capacitance is proportional to € 
(problem 38). The following table gives some sample values of the 
permittivities of a few substances. 


substance €/€ at zero frequency 
vacuum 1 

air 1.00054 

water 80 


barium titanate 1250 


A capacitor with a very high capacitance is potentially a superior 
replacement for a battery, but until the 1990’s this was impracti- 
cal because capacitors with high enough values couldn’t be made, 
even with dielectrics having the largest known permittivities. Such 
supercapactors, some with values in the kilofarad range, are now 
available. Most of them do not use dielectric at all; the very high 
capacitance values are instead obtained by using electrodes that are 
not parallel metal plates at all, but exotic materials such as aerogels, 
which allows the spacing between the “electrodes” to be very small. 


Although figure a/2 shows the dipoles in the dielectric being 
completely aligned, this is not a situation commonly encountered 
in practice. In such a situation, the material would be as polarized 
as it could possibly be, and if the field was increased further, it 
would not respond. In reality, a capacitor, for example, would nor- 
mally be operated with fields that produced quite a small amount 
of alignment, and it would be under these conditions that the linear 


b/A stud finder is used to 
locate the wooden beams, or 
studs, that form the frame behind 
the wallboard. It is a capacitor 
whose’ capacitance changes 
when it is brought close to a 
substance with a_ particular 
permittivity. Although the wall 
is external to the capacitor, a 
change in capacitance is still 
observed, because the capacitor 
has “fringing fields” that extend 
outside the region between its 
plates. 
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magnetic 
figure a. A magnetically perme- 
able material is placed at the 
center of a solenoid. 


N amt i 


d/Example 23: 
view of a solenoid. 


a cutaway 
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relationship D = €E would actually be a good approximation. Be- 
fore a material’s maximum polarization is reached, it may actually 
spark or burn up. 


self-check | 

Suppose a parallel-plate capacitor is built so that a slab of dielectric 
material can be slid in or out. (This is similar to the way the stud finder 
in figure b works.) We insert the dielectric, hook the capacitor up to 
a battery to charge it, and then use an ammeter and a voltmeter to 
observe what happens when the dielectric is withdrawn. Predict the 
changes observed on the meters, and correlate them with the expected 
change in capacitance. Discuss the energy transformations involved, 
and determine whether positive or negative work is done in removing 
the dielectric. > Answer, p. 872 


11.7.3 Magnetic materials 


Atoms and molecules may have magnetic dipole moments as 
well as electric dipole moments. Just as an electric dipole contains 
bound charges, a magnetic dipole has bound currents, which come 
from the motion of the electrons as they orbit the nucleus, c/1. Such 
a substance, subjected to a magnetic field, tends to align itself, c/2, 
so that a sheet of current circulates around the externally applied 
field. Figure c/3 is closely analogous to figure a/3; in the central 
gray area, the atomic currents cancel out, but the atoms at the 
outer surface form a sheet of bound current. However, whereas like 
charges repel and opposite charges attract, it works the other way 
around for currents: currents in the same direction attract, and 
currents in opposite directions repel. Therefore the bound currents 
in a material inserted inside a solenoid tend to reinforce the free 
currents, and the result is to strengthen the field. The total current 
is I = Igee+Jpound, and we define an altered version of the magnetic 
field, 


B 
H=— , 
Lb 


and rewrite Ampére’s law as 


[y= Ithrough, free 


The constant jz is the permeability, with a vacuum value of wo = 


4rk/c?. Here are the magnetic permeabilities of some substances: 
substance [/ Lo 
vacuum 1 
aluminum 1.00002 
steel 700 
transformer iron 4,000 
mu-metal 20,000 
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An iron-core electromagnet example 23 
> A solenoid has 1000 turns of wire wound along a cylindrical 
core with a length of 10 cm. If a current of 1.0 A is used, find the 
magnetic field inside the solenoid if the core is air, and if the core 
is made of iron with u/U, = 4,000. 


> Air has essentially the same permability as vacuum, so using 
the result of example 13 on page 628, we find that the field is 
0.013 T. 


We now consider the case where the core is filled with iron. The 
original derivation in example 13 started from Ampére’s law, which 
we now rewrite as Ty = |through, free. AS argued previously, the only 
significant contributions to the circulation come from line segment 
AB. This segment lies inside the iron, where H = B/u. The H 
field is the same as in the air-core case, since the new form of 
Ampere’s law only relates H to the current in the wires (the free 
current). This means that B = pH is greater by a factor of 4,000 
than in the air-core case, or 52 T. This is an extremely intense 
field — so intense, in fact, that the iron’s magnetic polarization 
would probably become saturated before we could actually get 
the field that high. 


The electromagnet of example 23 could also be used as an in- 
ductor, and its inductance would be proportional to the permittivity 
of the core. This makes it possible to construct high-value induc- 
tors that are relatively compact. Permeable cores are also used in 
transformers. 


A transformer or inductor with a permeable core does have 
some disadvantages, however, in certain applications. The oscil- 
lating magnetic field induces an electric field, and because the core 
is typically a metal, these currents dissipate energy strongly as heat. 
This behaves like a fairly large resistance in series with the coil. Fig- 
ure f shows a method for reducing this effect. The iron core of this 
transformer has been constructed out of laminated layers, which has 
the effect of blocking the conduction of the eddy currents. 


‘A ferrite bead example 24 
Cables designed to carry audio signals are typically made with 
two adjacent conductors, such that the current flowing out through 
one conductor comes back through the other one. Computer ca- 
bles are similar, but usually have several such pairs bundled in- 
side the insulator. This paired arrangement is known as differen- 
tial mode, and has the advantage of cutting down on the reception 
and transmission of interference. In terms of transmission, the 
magnetic field created by the outgoing current is almost exactly 
canceled by the field from the return current, so electromagnetic 
waves are only weakly induced. In reception, both conductors are 
bathed in the same electric and magnetic fields, so an emf that 
adds current on one side subtracts current from the other side, 


e/Example 23: without the 
iron core, the field is so weak that 
it barely deflects the compass. 
With it, the deflection is nearly 


90°. 


f/A transformer with a_ lami- 
nated iron core. The input and 
output coils are inside the paper 
wrapper. The iron core is the 
black part that passes through 
the coils at the center, and 
also wraps around them on the 
outside. 


g/Example 24: ferrite beads. 
The top panel shows a clip-on 
type, while the bottom shows one 
built into a cable. 
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resulting in cancellation. 


The opposite of differential mode is called common mode. In 
common mode, all conductors have currents flowing in the same 
direction. Even when a circuit is designed to operate in differential 
mode, it may not have exactly equal currents in the two conduc- 
tors with /, + 4 = 0, meaning that current is leaking off to ground at 
one end of the circuit or the other. Although paired cables are rel- 
atively immune to differential-mode interference, they do not have 
any automatic protection from common-mode interference. 


Figure g shows a device for reducing common-mode interference 
called a ferrite bead, which surrounds the cable like a bead on 
a string. Ferrite is a magnetically permeable alloy. In this appli- 
cation, the ohmic properties of the ferrite actually turn put to be 
advantageous. 


Let’s consider common-mode transmission of interference. The 
bare cable has some DC resistance, but is also surrounded by a 
magnetic field, so it has inductance as well. This means that it 
behaves like a series L-R circuit, with an impedance that varies 
as R+iwL, where both R and L are very small. When we add the 
ferrite bead, the inductance is increased by orders of magnitude, 
but so is the resistance. Neither R nor L is actually constant with 
respect to frequency, but both are much greater than for the bare 
cable. 


Suppose, for example, that a signal is being transmitted from a 
digital camera to a computer via a USB cable. The camera has 
an internal impedance that is on the order of 10 OQ, the computer’s 
input also has a ~ 10 O impedance, and in differential mode the 
ferrite bead has no effect, so the cable’s impedance has its low, 
designed value (probably also about 10 ©, for good impedance 
matching). The signal is transmitted unattenuated from the cam- 
era to the computer, and there is almost no radiation from the 
cable. 


But in reality there will be a certain amount of common-mode 
current as well. With respect to common mode, the ferrite bead 
has a large impedance, with the exact value depending on fre- 
quency, but typically on the order of 100 © for frequencies in 
the MHz range. We now have a series circuit consisting of three 
impedances: 10, 100, and 10 ©. For a given emf applied by an 
external radio wave, the current induced in the circuit has been 
attenuated by an order of magnitude, relative to its value without 
the ferrite bead. 


Why is the ferrite is necessary at all? Why not just insert ordinary 
air-core inductors in the circuit? We could, for example, have two 
solenoidal coils, one in the outgoing line and one in the return line, 
interwound with one another with their windings oriented so that 
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their differential-mode fields would cancel. There are two good 
reasons to prefer the ferrite bead design. One is that it allows 
a clip-on device like the one in the top panel of figure g, which 
can be added without breaking the circuit. The other is that our 
circuit will inevitably have some stray capacitance, and will there- 
fore act like an LRC circuit, with a resonance at some frequency. 
At frequencies close to the resonant frequency, the circuit would 
absorb and transmit common-mode interference very strongly, 
which is exactly the opposite of the effect we were hoping to 
produce. The resonance peak could be made low and broad by 
adding resistance in series, but this extra resistance would atten- 
uate the differential-mode signals as well as the common-mode 
ones. The ferrite’s resistance, however, is actually a purely mag- 
netic effect, so it vanishes in differential mode. 


Surprisingly, some materials have magnetic permeabilities less 
than to. This cannot be accounted for in the model above, and 
although there are semiclassical arguments that can explain it to 
some extent, it is fundamentally a quantum mechanical effect. Ma- 
terials with 4 > po are called paramagnetic, while those with u < po 
are referred to as diamagnetic. Diamagnetism is generally a much 
weaker effect than paramagnetism, and is easily masked if there is 
any trace of contamination from a paramagnetic material. Diamag- 
netic materials have the interesting property that they are repelled 
from regions of strong magnetic field, and it is therefore possible to 
levitate a diamagnetic object above a magnet, as in figure h. 


A complete statement of Maxwell’s equations in the presence of 
electric and magnetic materials is as follows: 


Op = free 
®p=0 
d®p 
fea 
= dt 
d® 
Ty = a So Lire 


Comparison with the vacuum case shows that the speed of an 
electromagnetic wave moving through a substance described by per- 
mittivity and permeability € and p is 1/,/ep. For most substances, 
[L & Mo, and € is highly frequency-dependent. 


Suppose we have a boundary between two substances. By con- 
structing a Gaussian or Ampérian surface that extends across the 
boundary, we can arrive at various constraints on how the fields 
must behave as me move from one substance into the other, when 
there are no free currents or charges present, and the fields are 
static. An interesting example is the application of Faraday’s law, 
ly = 0, to the case where one medium — let’s say it’s air — has 


h/A frog is levitated diamagnet- 
ically inside a powerful magnet. 
Evidently frog has pt < po. 


i/At a 
two substances with ps > uy, the 
H field has a continuous com- 
ponent parallel to the surface, 
which implies a discontinuity in 
the parallel component of the 
magnetic field B. 


boundary between 
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a low permeability, while the other one has a very high one. We 
will violate Faraday’s law unless the component of the H field par- 
allel to the boundary is a continuous function, Hj; = Hy 2. This 
means that if 4/~> is very high, the component of B = wH par- 
allel to the surface will have an abrupt discontinuity, being much 
stronger inside the high-permeability material. The result is that 
when a magnetic field enters a high-permeability material, it tends 
to twist abruptly to one side, and the pattern of the field tends to 
be channeled through the material like water through a funnel. In 
a transformer, a permeable core functions to channel more of the 
magnetic flux from the input coil to the output coil. Figure j shows 
another example, in which the effect is to shield the interior of the 
sphere from the externally imposed field. Special high-permeability 
alloys, with trade names like Mu-Metal, are sold for this purpose. 
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j/A hollow sphere with u/u. = 10, is immersed in a uniform, externally imposed magnetic field. The 
interior of the sphere is shielded from the field. The arrows map the magnetic field B. (See homework problem 
47, page 681.) 


The very last magnetic phenomenon we'll discuss is probably the 
very first experience you ever had of magnetism. Ferromagnetism is 
a phenomenon in which a material tends to organize itself so that it 
has a nonvanishing magnetic field. It is exhibited strongly by iron 
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and nickel, which explains the origin of the name. 


Figure k/1 is a simple one-dimensional model of ferromagnetism. 
Each magnetic compass needle represents an atom. The compasses 
in the chain are stable when aligned with one another, because each 
one’s north end is attracted to its neighbor’s south end. The chain 
can be turned around, k/2, without disrupting its organization, and 
the compasses do not realign themselves with the Earth’s field, be- 
cause their torques on one another are stronger than the Earth’s 
torques on them. The system has a memory. For example, if I want 
to remind myself that my friend’s address is 137 Coupling Ct., I can 
align the chain at an angle of 137 degrees. The model fails, how- 
ever, as an explanation of real ferromagnetism, because in two or 
more dimensions, the most stable arrangement of a set of interacting 
magnetic dipoles is something more like k/3, in which alternating 
rows point in opposite directions. In this two-dimensional pattern, 
every compass is aligned in the most stable way with all four of its 
neighbors. This shows that ferromagnetism, like diamagnetism, has 
no purely classical explanation; a full explanation requires quantum 
mechanics. 


Because ferromagnetic substances “remember” the history of 
how they were prepared, they are commonly used to store infor- 
mation in computers. Figure 1 shows 16 bits from an ancient (ca. 
1970) 4-kilobyte random-access memory, in which each doughnut- 
shaped iron “core” can be magnetized in one of two possible direc- 
tions, so that it stores one bit of information. Today, RAM is made 
of transistors rather than magnetic cores, but a remnant of the old 
technology remains in the term “core dump,” meaning “memory 
dump,” as in “my girlfriend gave me a total core dump about her 
mom’s divorce.” Most computer hard drives today do store their in- 
formation on rotating magnetic platters, but the platter technology 
may be obsoleted by flash memory in the near future. 


The memory property of ferromagnets can be depicted on the 
type of graph shown in figure m, known as a hysteresis curve. The 
y axis is the magnetization of a sample of the material — a mea- 
sure of the extent to which its atomic dipoles are aligned with one 
another. If the sample is initially unmagnetized, 1, and a field H 


k / A model of ferromagnetism. 


| / Magnetic core memory. 
Mi 5 3 4 
H 
6 
m/A hysteresis curve. 
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is externally applied, the magnetization increases, 2, but eventu- 
ally becomes saturated, 3, so that higher fields do not result in any 
further magnetization, 4. The external field can then be reduced, 
5, and even eliminated completely, but the material will retain its 
magnetization. It is a permanent magnet. To eliminate its magneti- 
zation completely, a substantial field must be applied in the opposite 
direction. If this reversed field is made stronger, then the substance 
will eventually become magnetized just as strongly in the opposite 
direction. Since the hysteresis curve is nonlinear, and is not a func- 
tion (it has more than one value of M for a particular value of B), a 
ferromagnetic material does not have a single, well-defined value of 
the permeability j; a value like 4,000 for transformer iron represents 
some kind of a rough average. 


‘The fluxgate compass example 25 
The fluxgate compass is a type of magnetic compass without 
moving parts, commonly used on ships and aircraft. An AC cur- 
rent is applied in a coil wound around a ferromagnetic core, driv- 
ing the core repeatedly around a hysteresis loop. Because the 
hysteresis curve is highly nonlinear, the addition of an external 
field such as the Earth’s alters the core’s behavior. Suppose, for 
example, that the axis of the coil is aligned with the magnetic 
north-south. The core will reach saturation more quickly when 
the coil’s field is in the same direction as the Earth’s, but will not 
saturate as early in the next half-cycle, when the two fields are 
in opposite directions. With the use of multiple coils, the com- 
ponents of the Earth’s field can be measured along two or three 
n/A fluxgate compass. axes, permitting the compass’s orientation to be determined in 
two or (for aircraft) three dimensions. 


This chapter is summarized on page 904. Notation and terminology 
are tabulated on pages 885-886. 


670 Chapter 11 Electromagnetism 


Problems 


The symbols Vv, “, etc. are explained on page 684. 


1 A particle with a charge of 1.0 C and a mass of 1.0 kg is 
observed moving past point P with a velocity (1.0 m/s)x. The 
electric field at point P is (1.0 V/m)y, and the magnetic field is 
(2.0 T)y. Find the force experienced by the particle. Vv 


2 For a positively charged particle moving through a magnetic 
field, the directions of the v, B, and F vectors are related by a 
right-hand rule: 


v_ along the fingers, with the hand flat 
B_ along the fingers, with the knuckles bent 
F along the thumb 


Make a three-dimensional model of the three vectors using pencils 
or rolled-up pieces of paper to represent the vectors assembled with 
their tails together. Make all three vectors perpendicular to each 
other. Now write down every possible way in which the rule could be 
rewritten by scrambling up the three symbols v, B, and F. Referring 
to your model, which are correct and which are incorrect? 


3 A charged particle is released from rest. We see it start to 
move, and as it gets going, we notice that its path starts to curve. 
Can we tell whether this region of space has E # 0, or B ¥ 0, or 
both? Assume that no other forces are present besides the possible 
electrical and magnetic ones, and that the fields, if they are present, 
are uniform. 


4 A charged particle is in a region of space in which there is a 
uniform magnetic field B = Bz. There is no electric field, and no 
other forces act on the particle. In each case, describe the future 
motion of the particle, given its initial velocity. 

(a) Vo = 0 

(b) vo = (1 m/s)z 

(c) Vo = (1 m/s)y 

5 (a) A line charge, with charge per unit length \, moves at 
velocity v along its own length. How much charge passes a given 
point in time dt? What is the resulting current? > Answer, p. 875 
(b) Show that the units of your answer in part a work out correctly. 


Remark: This constitutes a physical model of an electric current, and it would 
be a physically realistic model of a beam of particles moving in a vacuum, such 
as the electron beam in a television tube. It is not a physically realistic model 
of the motion of the electrons in a current-carrying wire, or of the ions in your 
nervous system; the motion of the charge carriers in these systems is much more 
complicated and chaotic, and there are charges of both signs, so that the total 
charge is zero. But even when the model is physically unrealistic, it still gives the 
right answers when you use it to compute magnetic effects. This is a remarkable 
fact, which we will not prove. The interested reader is referred to E.M. Purcell, 
Electricity and Magnetism, McGraw Hill, 1963. 
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6 Two parallel wires of length LD carry currents 1; and Ig. They 
are separated by a distance R, and we assume R is much less than 
L, so that our results for long, straight wires are accurate. The goal 
of this problem is to compute the magnetic forces acting between 
the wires. 

(a) Neither wire can make a force on itself. Therefore, our first step 
in computing wire 1’s force on wire 2 is to find the magnetic field 
made only by wire 1, in the space occupied by wire 2. Express this 
field in terms of the given quantities. v 
(b) Let’s model the current in wire 2 by pretending that there is 
a line charge inside it, possessing density per unit length A» and 
moving at velocity vg. Relate A2 and v2 to the current Jo, using the 
result of problem 5a. Now find the magnetic force wire 1 makes on 
wire 2, in terms of Jj, Io, L, and R. > Answer, p. 875 
(c) Show that the units of the answer to part b work out to be 
newtons. x 


7 Suppose a charged particle is moving through a region of 
space in which there is an electric field perpendicular to its velocity 
vector, and also a magnetic field perpendicular to both the particle’s 
velocity vector and the electric field. Show that there will be one 
particular velocity at which the particle can be moving that results 
in a total force of zero on it. Relate this velocity to the magnitudes 
of the electric and magnetic fields. (Such an arrangement, called a 
velocity filter, is one way of determining the speed of an unknown 
particle.) 


8 The following data give the results of two experiments in which 
charged particles were released from the same point in space, and 
the forces on them were measured: 
ga=1pC, vw=(lm/s)x, F, =(-l mN)y 
qg=—2uC, ve=(-lm/s)k, Fo=(-2 mN)y 
The data are insufficient to determine the magnetic field vector; 
demonstrate this by giving two different magnetic field vectors, both 
of which are consistent with the data. a 


9 The following data give the results of two experiments in which 
charged particles were released from the same point in space, and 
the forces on them were measured: 
qg=1nC, vwi=(1m/s)2, Fi = (5 pN)X+ (2 pN)y 
gg=1nC, ve=(3m/s)z, Fo = (10 pN)x+ (4 pN)y 


Is there a nonzero electric field at this point? A nonzero magnetic 
field? az 


10 This problem is a continuation of problem 6. Note that the 
answer to problem 6b is given on page 875. 

(a) Interchanging the 1’s and 2’s in the answer to problem 6b, what 
is the magnitude of the magnetic force from wire 2 acting on wire 
1? Is this consistent with Newton’s third law? 
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(b) Suppose the currents are in the same direction. Make a sketch, 
and use the right-hand rule to determine whether wire 1 pulls wire 
2 towards it, or pushes it away. 

(c) Apply the right-hand rule again to find the direction of wire 2’s 
force on wire 1. Does this agree with Newton’s third law? 

(d) What would happen if wire 1’s current was in the opposite di- 
rection compared to wire 2’s? & 


11 (a) In the photo of the vacuum tube apparatus in figure o on 
page 610, infer the direction of the magnetic field from the motion 
of the electron beam. (The answer is given in the answer to the 
self-check on that page.) 

(b) Based on your answer to part a, find the direction of the currents 
in the coils. 

(c) What direction are the electrons in the coils going? 

(d) Are the currents in the coils repelling the currents consisting of 
the beam inside the tube, or attracting them? Check your answer 
by comparing with the result of problem 10. 


12 A charged particle of mass m and charge q moves in a circle 
due to a uniform magnetic field of magnitude B, which points per- 
pendicular to the plane of the circle. 

(a) Assume the particle is positively charged. Make a sketch show- 
ing the direction of motion and the direction of the field, and show 
that the resulting force is in the right direction to produce circular 
motion. 

(b) Find the radius, r, of the circle, in terms of m, q, v, and B. V 
(c) Show that your result from part b has the right units. 

(d) Discuss all four variables occurring on the right-hand side of your 
answer from part b. Do they make sense? For instance, what should 
happen to the radius when the magnetic field is made stronger? 
Does your equation behave this way? 

(e) Restate your result so that it gives the particle’s angular fre- 
quency, w, in terms of the other variables, and show that v drops 
out. Vv 


Remark: A charged particle can be accelerated in a circular device called a 
cyclotron, in which a magnetic field is what keeps them from going off straight. 
This frequency is therefore known as the cyclotron frequency. The particles are 
accelerated by other forces (electric forces), which are AC. As long as the electric 
field is operated at the correct cyclotron frequency for the type of particles being 
manipulated, it will stay in sync with the particles, giving them a shove in the 
right direction each time they pass by. The particles are speeding up, so this 
only works because the cyclotron frequency is independent of velocity. 


13 Each figure represents the motion of a positively charged 
particle. The dots give the particles’ positions at equal time inter- 
vals. In each case, determine whether the motion was caused by 
an electric force, a magnetic force, or a frictional force, and explain 
your reasoning. If possible, determine the direction of the magnetic 
or electric field. All fields are uniform. In (a), the particle stops for 
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an instant at the upper right, but then comes back down and to the 
left, retracing the same dots. In (b), it stops on the upper right and 
stays there. a 


Problem 13. 


14 One model of the hydrogen atom has the electron circling 
around the proton at a speed of 2.2 x 10° m/s, in an orbit with a 
radius of 0.05 nm. (Although the electron and proton really orbit 
around their common center of mass, the center of mass is very close 
to the proton, since it is 2000 times more massive. For this problem, 
assume the proton is stationary.) 

(a) Treat the circling electron as a current loop, and calculate the 
current. 

(b) Estimate the magnetic field created at the center of the atom 
by the electron. Vv 
(c) Does the proton experience a nonzero force from the electron’s 
magnetic field? Explain. 

(d) Does the electron experience a magnetic field from the proton? 
Explain. 

(e) Does the electron experience a magnetic field created by its own 
current? Explain. 

(f) Is there an electric force acting between the proton and electron? 
If so, calculate it. Vv 
(g) Is there a gravitational force acting between the proton and elec- 
tron? If so, calculate it. 

(h) An inward force is required to keep the electron in its orbit — 
otherwise it would obey Newton’s first law and go straight, leaving 
the atom. Based on your answers to the previous parts, which force 
or forces (electric, magnetic and gravitational) contributes signifi- 
cantly to this inward force? (Based on a problem by Arnold Arons.) 


~ 


15 The equation B, = 8kI.A/c?r?® was found on page 620 for the 
distant field of a dipole. Show, as asserted there, that the constant 
6 must be unitless. 


16 The following data give the results of three experiments in 
which charged particles were released from the same point in space, 
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and the forces on them were measured: 
m=1C, vi=0, Py =(L Ny 
q=H=1C,, v= 7 m/s)x,. Fo=—(1 N 
gg=1C, vg=(1m/s)z, F3= 


Determine the electric and magnetic fields. vo 


17 If you put four times more current through a solenoid, how 
many times more energy is stored in its magnetic field? Vv 


18 A Helmholtz coil is defined as a pair of identical circular 
coils lying in parallel planes and separated by a distance, h, equal 
to their radius, b. (Each coil may have more than one turn of wire.) 
Current circulates in the same direction in each coil, so the fields 
tend to reinforce each other in the interior region. This configuration 
has the advantage of being fairly open, so that other apparatus can 
be easily placed inside and subjected to the field while remaining 
visible from the outside. The choice of h = 6 results in the most 
uniform possible field near the center. A photograph of a Helmholtz 
coil was shown in figure o on page 610. 
(a) Find the percentage drop in the field at the center of one coil, 
compared to the full strength at the center of the whole apparatus. 
Vv 


(b) What value of h (not equal to 6) would make this percentage 
difference equal to zero? vou 


19 The figure shows a nested pair of circular wire loops used 
to create magnetic fields. (The twisting of the leads is a practical 
trick for reducing the magnetic fields they contribute, so the fields 
are very nearly what we would expect for an ideal circular current 
loop.) The coordinate system below is to make it easier to discuss 
directions in space. One loop is in the y — z plane, the other in the 
x—y plane. Each of the loops has a radius of 1.0 cm, and carries 
1.0 A in the direction indicated by the arrow. 

(a) Calculate the magnetic field that would be produced by one such 
loop, at its center. Vv 
(b) Describe the direction of the magnetic field that would be pro- 
duced, at its center, by the loop in the x — y plane alone. 

(c) Do the same for the other loop. 

(d) Calculate the magnitude of the magnetic field produced by the 
two loops in combination, at their common center. Describe its 
direction. vo 
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20 Four long wires are arranged, as shown, so that their cross- 
section forms a square, with connections at the ends so that current 
flows through all four before exiting. Note that the current is to the 
right in the two back wires, but to the left in the front wires. If the 
dimensions of the cross-sectional square (height and front-to-back) 
are b, find the magnetic field (magnitude and direction) along the 
long central axis. Vv o 


21 = Inproblem 16, the three experiments gave enough information 
to determine both fields. Is it possible to design a procedure so that, 
using only two such experiments, we can always find E and B? If 
so, design it. If not, why not? ns 


22 Use the Biot-Savart law to derive the magnetic field of a long, 
straight wire, and show that this reproduces the result of example 
6 on page 612. z 


23 (a) Modify the calculation on page 617 to determine the 
component of the magnetic field of a sheet of charge that is perpen- 
dicular to the sheet. v 
(b) Show that your answer has the right units. 

(c) Show that your answer approaches zero as z approaches infinity. 
(d) What happens to your answer in the case of a = b? Explain 
why this makes sense. ~ 


24 Consider two solenoids, one of which is smaller so that it 
can be put inside the other. Assume they are long enough so that 
each one only contributes significantly to the field inside itself, and 
the interior fields are nearly uniform. Consider the configuration 
where the small one is inside the big one with their currents circu- 
lating in the same direction, and a second configuration in which 
the currents circulate in opposite directions. Compare the energies 
of these configurations with the energy when the solenoids are far 
apart. Based on this reasoning, which configuration is stable, and 
in which configuration will the little solenoid tend to get twisted 
around or spit out? > Hint, p. 865 L 


25 (a) A solenoid can be imagined as a series of circular current 
loops that are spaced along their common axis. Integrate the result 
of example 12 on page 625 to show that the field on the axis of a 
solenoid can be written as B = (27kn/c”)(cos 8 + cosy), where the 
angles 6 and ¥ are defined in the figure. 

(b) Show that in the limit where the solenoid is very long, this 
exact result agrees with the approximate one derived in example 13 
on page 628 using Ampére’s law. 

(c) Note that, unlike the calculation using Ampére’s law, this one 
is valid at points that are near the mouths of the solenoid, or even 
outside it entirely. If the solenoid is long, at what point on the axis 
is the field equal to one half of its value at the center of the solenoid? 
(d) What happens to your result when you apply it to points that 
are very far away from the solenoid? Does this make sense? a 
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26 The first step in the proof of Ampére’s law on page 629 is to 
show that Ampére’s law holds in the case shown in figure f/1, where 
a circular Ampérian loop is centered on a long, straight wire that is 
perpendicular to the plane of the loop. Carry out this calculation, 
using the result for the field of a wire that was established without 
using Ampére’s law. B 


27 A certain region of space has a magnetic field given by B = 
bay. Find the electric current flowing through the square defined 
by z=0,0<2<a,and0<y<a. / Oo 


28 Perform a calculation similar to the one in problem 54, but 
for a logarithmic spiral, defined by r = we”’, and show that the 
field is B = (kI/c?u)(1/a—1/b). Note that the solution to problem 
54 is given in the back of the book. te 


29 (a) For the geometry described in example 8 on page 615, 
find the field at a point the lies in the plane of the wires, but not 
between the wires, at a distance b from the center line. Use the 
same technique as in that example. 

(b) Now redo the calculation using the technique demonstrated on 
page 620. The integrals are nearly the same, but now the reasoning 
is reversed: you already know 6 = 1, and you want to find an 
unknown field. The only difference in the integrals is that you are 
tiling a different region of the plane in order to mock up the currents 
in the two wires. Note that you can’t tile a region that contains 
a point of interest, since the technique uses the field of a distant 
dipole. Vv Oo 


30 (a) A long, skinny solenoid consists of N turns of wire 
wrapped uniformly around a hollow cylinder of length ¢ and cross- 
sectional area A. Find its inductance. Vv 
(b) Show that your answer has the right units to be an inductance. 


~— 


31 Consider two solenoids, one of which is smaller so that it 
can be put inside the other. Assume they are long enough to act 
like ideal solenoids, so that each one only contributes significantly 
to the field inside itself, and the interior fields are nearly uniform. 
Consider the configuration where the small one is partly inside and 
partly hanging out of the big one, with their currents circulating in 
the same direction. Their axes are constrained to coincide. 

(a) Find the difference in the magnetic energy between the configu- 
ration where the solenoids are separate and the configuration where 
the small one is inserted into the big one. Your equation will in- 
clude the length x of the part of the small solenoid that is inside 
the big one, as well as other relevant variables describing the two 


solenoids. Vv 
(b) Based on your answer to part a, find the force acting between 
the solenoids. vo 


A nautilus shell is  approxi- 
mately a logarithmic spiral, of the 
type in problem 28. 
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32 Verify Ampére’s law in the case shown in the figure, as- 
suming the known equation for the field of a wire. A wire carrying 
current J passes perpendicularly through the center of the rectan- 
gular Ampérian surface. The length of the rectangle is infinite, so 
it’s not necessary to compute the contributions of the ends. g 


33 The purpose of this problem is to find how the gain of a 
transformer depends on its construction. 

(a) The number of loops of wire, N, in a solenoid is changed, while 
keeping the length constant. How does the impedance depend on N? 
State your answer as a proportionality, e.g., Z « N° or Z «x N~°. 
(b) For a given AC voltage applied across the inductor, how does 
the magnetic field depend on N? You need to take into account 
both the dependence of a solenoid’s field on N for a given current 
and your answer to part a, which affects the current. 

(c) Now consider a transformer consisting of two solenoids. The 
input side has Nj loops, and the output No. We wish to find how the 
output voltage V2 depends on Nj, No, and the input voltage Vj. The 
text has already established V2 « V;.No, so it only remains to find 
the dependence on N,. Use your result from part b to accomplish 
this. The ratio V2/V; is called the voltage gain. E 


34 ~~ Problem 33 dealt with the dependence of a transformer’s gain 
on the number of loops of wire in the input solenoid. Carry out a 
similar analysis of how the gain depends on the frequency at which 
the circuit is operated. z 


35 A U-shaped wire makes electrical contact with a second, 
straight wire, which rolls along it to the right, as shown in the 
figure. The whole thing is immersed in a uniform magnetic field, 
which is perpendicular to the plane of the circuit. The resistance of 
the rolling wire is much greater than that of the U. 

(a) Find the direction of the force on the wire based on conservation 
of energy. 

(b) Verify the direction of the force using right-hand rules. 

(c) Find magnitude of the force acting on the wire. There is more 
than one way to do this, but please do it using Faraday’s law (which 
works even though it’s the Ampérian surface itself that is changing, 
rather than the field). Vv 
(d) Consider how the answer to part a would have changed if the 
direction of the field had been reversed, and also do the case where 
the direction of the rolling wire’s motion is reversed. Verify that 
this is in agreement with your answer to part c. a 


Problem 35. 


36 A charged particle is in motion at speed v, in a region of 
vacuum through which an electromagnetic wave is passing. In what 

2 direction should the particle be moving in order to minimize the 
ZZ» total force acting on it? Consider both possibilities for the sign of 


the charge. (Based on a problem by David J. Raymond.) a 
Problem 37. 
37 A wire loop of resistance R and area A, lying in the y — z 


678 Chapter 11 Electromagnetism 


plane, falls through a nonuniform magnetic field B = kzx, where k 
is a constant. The z axis is vertical. 

(a) Find the direction of the force on the wire based on conservation 
of energy. 

(b) Verify the direction of the force using right-hand rules. 

(c) Find the magnetic force on the wire. Vv oO 


38 A capacitor has parallel plates of area A, separated by a 
distance h. If there is a vacuum between the plates, then Gauss’s 
law gives E = 4nko = 4rkq/A for the field between the plates, and 
combining this with E = V/h, we find C = q/V = (1/4ak)A/h. 
(a) Generalize this derivation to the case where there is a dielectric 
between the plates. (b) Suppose we have a list of possible materials 
we could choose as dielectrics, and we wish to construct a capacitor 
that will have the highest possible energy density, U-/v, where v is 
the volume. For each dielectric, we know its permittivity €, and also 
the maximum electric field FE it can sustain without breaking down 
and allowing sparks to cross between the plates. Write the maximum 
energy density in terms of these two variables, and determine a figure 
of merit that could be used to decide which material would be the 
best choice. le 


39 (a) For each term appearing on the right side of Maxwell’s 
equations, give an example of an everyday situation it describes. 
(b) Most people doing calculations in the SI system of units don’t 
use k and k/c?. Instead, they express everything in terms of the 
constants 


1 
€0 rear and 
Atk 
Mo = er 


Rewrite Maxwell’s equations in terms of these constants, eliminating 
k and c everywhere. g 


40 (a) Prove that in an electromagnetic wave, half the energy is 
in the electric field and half in the magnetic field. 

(b) Based on your result from part a, find the proportionality con- 
stant in the relation dp « E x Bdv, where dp is the momentum 
of the part of a plane light wave contained in the volume dv. The 
vector E x B is known as the Poynting vector. (To do this problem, 
you need to know the relativistic relationship between the energy 
and momentum of a beam of light.) vo 


41 (a) A beam of light has cross-sectional area A and power 
P, i.e., P is the number of joules per second that enter a window 
through which the beam passes. Find the energy density U/v in 
terms of P, A, and universal constants. 

(b) Find E and B, the amplitudes of the electric and magnetic fields, 
in terms of P, A, and universal constants (i-e., your answer should 
not include U or v). You will need the result of problem 40a. A real 
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beam of light usually consists of many short wavetrains, not one big 
sine wave, but don’t worry about that. Vv p> Hint, p. 865 
(c) A beam of sunlight has an intensity of P/A = 1.35 x 10° W/m’, 
assuming no clouds or atmospheric absorption. This is known as the 
solar constant. Compute E and B, and compare with the strengths 
of static fields you experience in everyday life: E ~ 10° V/m in a 
thunderstorm, and B ~ 107? T for the Earth’s magnetic field. 
Vv 


42 The circular parallel-plate capacitor shown in the figure is 
being charged up over time, with the voltage difference across the 
plates varying as V = st, where s is a constant. The plates have 
radius b, and the distance between them is d. We assume d < }, 
so that the electric field between the plates is uniform, and parallel 
to the axis. Find the induced magnetic field at a point between the 
plates, at a distance R from the axis. > Hint, p. 865 V 


43 A positively charged particle is released from rest at the origin 
at t = 0, in a region of vacuum through which an electromagnetic 
wave is passing. The particle accelerates in response to the wave. 
In this region of space, the wave varies as E = XEsinwt, B = 
yBsinwt, and we assume that the particle has a relatively large 
value of m/q, so that its response to the wave is sluggish, and it 
never ends up moving at any speed comparable to the speed of light. 
Therefore we don’t have to worry about the spatial variation of the 
wave; we can just imagine that these are uniform fields imposed by 
some external mechanism on this region of space. 


(a) Find the particle’s coordinates as functions of time. v 
(b) Show that the motion is confined to —Zmar < z < Zmax, where 
Dingo TOL (?EB/m.?). = 
44 Electromagnetic waves are supposed to have their electric 


and magnetic fields perpendicular to each other. (Throughout this 
problem, assume we’re talking about waves traveling through a vac- 
uum, and that there is only a single sine wave traveling in a single 
direction, not a superposition of sine waves passing through each 
other.) Suppose someone claims they can make an electromagnetic 
wave in which the electric and magnetic fields lie in the same plane. 
Prove that this is impossible based on Maxwell’s equations. a 


45 Repeat the self-check on page 664, but with one change in 
the procedure: after we charge the capacitor, we open the circuit, 
and then continue with the observations. a 


46 On page 667, I proved that Hy; = Hy). at the boundary 
between two substances if there is no free current and the fields are 
static. In fact, each of Maxwell’s four equations implies a constraint 
with a similar structure. Some are constraints on the field compo- 
nents parallel to the boundary, while others are constraints on the 
perpendicular parts. Since some of the fields referred to in Maxwell’s 
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equations are the electric and magnetic fields E and B, while others 
are the auxiliary fields D and H, some of the constraints deal with 
E and B, others with D and H. Find the other three constraints. 


~ 


A7 (a) Figure j on page 668 shows a hollow sphere with / {io = 
x, inner radius a, and outer radius b, which has been subjected 
to an external field B,. Finding the fields on the exterior, in the 
shell, and on the interior requires finding a set of fields that satisfies 
five boundary conditions: (1) far from the sphere, the field must 
approach the constant B.; (2) at the outer surface of the sphere, 
the field must have H) ; = H),2, as discussed on page 667; (3) the 
same constraint applies at the inner surface of the sphere; (4) and 
(5) there is an additional constraint on the fields at the inner and 
outer surfaces, as found in problem 46. The goal of this problem 
is to find the solution for the fields, and from it, to prove that the 
interior field is uniform, and given by 


B= 9x : B, 
Qe+ Dle+2)-26@ 1)? 


This is a very difficult problem to solve from first principles, because 
it’s not obvious what form the fields should have, and if you hadn’t 
been told, you probably wouldn’t have guessed that the interior field 
would be uniform. We could, however, guess that once the sphere 
becomes polarized by the external field, it would become a dipole, 
and at r > b, the field would be a uniform field superimposed on 
the field of a dipole. It turns out that even close to the sphere, the 
solution has exactly this form. In order to complete the solution, 
we need to find the field in the shell (a < r < 6), but the only way 
this field could match up with the detailed angular variation of the 
interior and exterior fields would be if it was also a superposition of 
a uniform field with a dipole field. The final result is that we have 
four unknowns: the strength of the dipole component of the external 
field, the strength of the uniform and dipole components of the field 
within the shell, and the strength of the uniform interior field. These 
four unknowns are to be determined by imposing constraints (2) 
through (5) above. 

(b) Show that the expression from part a has physically reasonable 
behavior in its dependence on x and a/b. ba 


48 ‘Two long, parallel strips of thin metal foil form a configuration 
like a long, narrow sandwich. The air gap between them has height 
h, the width of each strip is w, and their length is @. Each strip 
carries current J, and we assume for concreteness that the currents 
are in opposite directions, so that the magnetic force, F', between 
the strips is repulsive. 

(a) Find the force in the limit of w > h. Vv 
(b) Find the force in the limit of w < h, which is like two ordinary 
wires. 
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(c) Discuss the relationship between the two results. is 


49 Suppose we are given a permanent magnet with a compli- 
cated, asymmetric shape. Describe how a series of measurements 
with a magnetic compass could be used to determine the strength 
and direction of its magnetic field at some point of interest. Assume 
that you are only able to see the direction to which the compass 
needle settles; you cannot measure the torque acting on it. = 


50 On page 634, the curl of ry was computed. Now consider 
the fields xx and yy. 

(a) Sketch these fields. 

(b) Using the same technique of explicitly constructing a small 
square, prove that their curls are both zero. Do not use the compo- 
nent form of the curl; this was one step in deriving the component 
form of the curl. a 


51 If you watch a movie played backwards, some vectors reverse 
their direction. For instance, people walk backwards, with their 
velocity vectors flipped around. Other vectors, such as forces, keep 
the same direction, e.g., gravity still pulls down. An electric field 
is another example of a vector that doesn’t turn around: positive 
charges are still positive in the time-reversed universe, so they still 
make diverging electric fields, and likewise for the converging fields 
around negative charges. 

(a) How does the momentum of a material object behave under 
time-reversal? > Solution, p. 882 
(b) The laws of physics are still valid in the time-reversed universe. 
For example, show that if two material objects are interacting, and 
momentum is conserved, then momentum is still conserved in the 


time-reversed universe. > Solution, p. 882 
(c) Discuss how currents and magnetic fields would behave under 
time reversal. > Hint, p. 865 
(d) Similarly, show that the equation dp «x E x B is still valid under 
time reversal. z 

52 This problem is a more advanced exploration of the time- 


reversal ideas introduced in problem 51. 

(a) In that problem, we assumed that charge did not flip its sign un- 
der time reversal. Suppose we make the opposite assumption, that 
charge does change its sign. This is an idea introduced by Richard 
Feynman: that antimatter is really matter traveling backward in 
time! Determine the time-reversal properties of E and B under this 
new assumption, and show that dp « E x B is still valid under 
time-reversal. 

(b) Show that Maxwell’s equations are time-reversal symmetric, i.e., 
that if the fields E(z, y, z,t) and B(z, y, z,t) satisfy Maxwell’s equa- 
tions, then so do E(z, y, z, -t) and B(z, y, z, —t). Demonstrate this 
under both possible assumptions about charge, gq > q and q > —q. 

Bes 


Electromagnetism 


53 The purpose of this problem is to prove that the constant 
of proportionality a in the equation dU,, = aB?dv, for the energy 
density of the magnetic field, is given by a = c?/87k as asserted on 
page 619. The geometry we'll use consists of two sheets of current, 
like a sandwich with nothing in between but some vacuum in which 
there is a magnetic field. The currents are in opposite directions, 
and we can imagine them as being joined together at the ends to 
form a complete circuit, like a tube made of paper that has been 
squashed almost flat. The sheets have lengths L in the direction 
parallel to the current, and widths w. They are separated by a dis- 
tance d, which, for convenience, we assume is small compared to LD 
and w. Thus each sheet’s contribution to the field is uniform, and 
can be approximated by the expression 27k7/c?. 

(a) Make a drawing similar to the one in figure 11.2.1 on page 618, 
and show that in this opposite-current configuration, the magnetic 
fields of the two sheets reinforce in the region between them, pro- 
ducing double the field, but cancel on the outside. 

(b) By analogy with the case of a single strand of wire, one sheet’s 
force on the other is JLB, were I = nw is the total current in one 
sheet, and B, = B/2 is the field contributed by only one of the 
sheets, since the sheet can’t make any net force on itself. Based on 
your drawing and the right-hand rule, show that this force is repul- 
sive. 

For the rest of the problem, consider a process in which the sheets 
start out touching, and are then separated to a distance d. Since 
the force between the sheets is repulsive, they do mechanical work 
on the outside world as they are separated, in much the same way 
that the piston in an engine does work as the gases inside the cylin- 
der expand. At the same time, however, there is an induced emf 
which would tend to extinguish the current, so in order to maintain 
a constant current, energy will have to be drained from a battery. 
There are three types of energy involved: the increase in the mag- 
netic field energy, the increase in the energy of the outside world, 
and the decrease in energy as the battery is drained. (We assume 
the sheets have very little resistance, so there is no ohmic heating 
involved.) v 
(c) Find the mechanical work done by the sheets, which equals the 
increase in the energy of the outside world. Show that your result 
can be stated in terms of 7, the final volume v = wld, and nothing 
else but numerical and physical constants. v 
(d) The power supplied by the battery is P = IT g (like P = IAV, 
but with an emf instead of a voltage difference), and the circulation 
is given by . = —d®pg/dt. The negative sign indicates that the bat- 
tery is being drained. Calculate the energy supplied by the battery, 
and, as in part c, show that the result can be stated in terms of 7, 
v, and universal constants. Vv 
(e) Find the increase in the magnetic-field energy, in terms of 7, v, 
and the unknown constant a. 
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(f) Use conservation of energy to relate your answers from parts c, 
d, and e, and solve for a. Vv 


54 Magnet coils are often wrapped in multiple layers. The 
figure shows the special case where the layers are all confined to a 
single plane, forming a spiral. Since the thickness of the wires (plus 
their insulation) is fixed, the spiral that results is a mathematical 
type known as an Archimedean spiral, in which the turns are evenly 
spaced. The equation of the spiral is r = w6, where w is a constant. 
For a spiral that starts from r = a and ends at r = b, show that the 
field at the center is given by (kI/c?w) Inb/a. 
> Solution, p. 883 © 


Key to symbols: 
easy © typical challenging M&difficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 
Exercise 11B: Polarization 
Apparatus: 
calcite (Iceland spar) crystal 
polaroid film 


1. Lay the crystal on a piece of paper that has print on it. You will observe a double image. 
See what happens if you rotate the crystal. 


Evidently the crystal does something to the light that passes through it on the way from the 
page to your eye. One beam of light enters the crystal from underneath, but two emerge from 
the top; by conservation of energy the energy of the original beam must be shared between 
them. Consider the following three possible interpretations of what you have observed: 


(a) The two new beams differ from each other, and from the original beam, only in energy. 
Their other properties are the same. 


(b) The crystal adds to the light some mysterious new property (not energy), which comes in 
two flavors, X and Y. Ordinary light doesn’t have any of either. One beam that emerges from 
the crystal has some X added to it, and the other beam has Y. 


(c) There is some mysterious new property that is possessed by all light. It comes in two flavors, 
X and Y, and most ordinary light sources make an equal mixture of type X and type Y light. 
The original beam is an even mixture of both types, and this mixture is then split up by the 
crystal into the two purified forms. 


In parts 2 and 3 you'll make observations that will allow you to figure out which of these is 
correct. 


2. Now place a polaroid film over the crystal and see what you observe. What happens when 
you rotate the film in the horizontal plane? Does this observation allow you to rule out any of 
the three interpretations? 


3. Now put the polaroid film under the crystal and try the same thing. Putting together all 
your observations, which interpretation do you think is correct? 


4. Look at an overhead light fixture through the polaroid, and try rotating it. What do you 
observe? What does this tell you about the light emitted by the lightbulb? 
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5. Now position yourself with your head under a light fixture and directly over a shiny surface, 
such as a glossy tabletop. You’ll see the lamp’s reflection, and the light coming from the lamp 
to your eye will have undergone a reflection through roughly a 180-degree angle (i.e., it very 
nearly reversed its direction). Observe this reflection through the polaroid, and try rotating it. 
Finally, position yourself so that you are seeing glancing reflections, and try the same thing. 
Summarize what happens to light with properties X and Y when it is reflected. (This is the 
principle behind polarizing sunglasses.) 
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Chapter 12 
Optics 


12.1 The Ray Model of Light 


Ads for one Macintosh computer bragged that it could do an arith- 
metic calculation in less time than it took for the light to get from the 
screen to your eye. We find this impressive because of the contrast 
between the speed of light and the speeds at which we interact with 
physical objects in our environment. Perhaps it shouldn’t surprise 
us, then, that Newton succeeded so well in explaining the motion of 
objects, but was far less successful with the study of light. 


The climax of our study of electricity and magnetism was discov- 
ery that light is an electromagnetic wave. Knowing this, however, is 
not the same as knowing everything about eyes and telescopes. In 
fact, the full description of light as a wave can be rather cumber- 
some. We will instead spend most of our treatment of optics making 
use of a simpler model of light, the ray model, which does a fine job 
in most practical situations. Not only that, but we will even back- 
track a little and start with a discussion of basic ideas about light 
and vision that predated the discovery of electromagnetic waves. 
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12.1.1 The nature of light 
The cause and effect relationship in vision 


Despite its title, this chapter is far from your first look at light. 
That familiarity might seem like an advantage, but most people have 
never thought carefully about light and vision. Even smart people 
who have thought hard about vision have come up with incorrect 
ideas. The ancient Greeks, Arabs and Chinese had theories of light 
and vision, all of which were mostly wrong, and all of which were 
accepted for thousands of years. 


One thing the ancients did get right is that there is a distinction 
between objects that emit light and objects that don’t. When you 
see a leaf in the forest, it’s because three different objects are doing 
their jobs: the leaf, the eye, and the sun. But luminous objects 
like the sun, a flame, or the filament of a light bulb can be seen by 
the eye without the presence of a third object. Emission of light 
is often, but not always, associated with heat. In modern times, 
we are familiar with a variety of objects that glow without being 
heated, including fluorescent lights and glow-in-the-dark toys. 


How do we see luminous objects? The Greek philosophers Pythago- 
ras (b. ca. 560 BC) and Empedocles of Acragas (b. ca. 492 
BC), who unfortunately were very influential, claimed that when 
you looked at a candle flame, the flame and your eye were both 
sending out some kind of mysterious stuff, and when your eye’s stuff 
collided with the candle’s stuff, the candle would become evident to 
your sense of sight. 


Bizarre as the Greek “collision of stuff theory” might seem, it 
had a couple of good features. It explained why both the candle 
and your eye had to be present for your sense of sight to function. 
The theory could also easily be expanded to explain how we see 
nonluminous objects. If a leaf, for instance, happened to be present 
at the site of the collision between your eye’s stuff and the candle’s 
stuff, then the leaf would be stimulated to express its green nature, 
allowing you to perceive it as green. 


Modern people might feel uneasy about this theory, since it sug- 
gests that greenness exists only for our seeing convenience, implying 
a human precedence over natural phenomena. Nowadays, people 
would expect the cause and effect relationship in vision to be the 
other way around, with the leaf doing something to our eye rather 
than our eye doing something to the leaf. But how can you tell? 
The most common way of distinguishing cause from effect is to de- 
termine which happened first, but the process of seeing seems to 
occur too quickly to determine the order in which things happened. 
Certainly there is no obvious time lag between the moment when 
you move your head and the moment when your reflection in the 
mirror moves. 
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Today, photography provides the simplest experimental evidence 
that nothing has to be emitted from your eye and hit the leaf in order 
to make it “greenify.”. A camera can take a picture of a leaf even 
if there are no eyes anywhere nearby. Since the leaf appears green 
regardless of whether it is being sensed by a camera, your eye, or 
an insect’s eye, it seems to make more sense to say that the leaf’s 
greenness is the cause, and something happening in the camera or 
eye is the effect. 


Light is a thing, and it travels from one point to another. 


Another issue that few people have considered is whether a can- 
dle’s flame simply affects your eye directly, or whether it sends out 
light which then gets into your eye. Again, the rapidity of the effect 
makes it difficult to tell what’s happening. If someone throws a rock 
at you, you can see the rock on its way to your body, and you can 
tell that the person affected you by sending a material substance 
your way, rather than just harming you directly with an arm mo- 
tion, which would be known as “action at a distance.” It is not easy 
to do a similar observation to see whether there is some “stuff” that 
travels from the candle to your eye, or whether it is a case of action 
at a distance. 


Newtonian physics includes both action at a distance (e.g., the 
earth’s gravitational force on a falling object) and contact forces 
such as the normal force, which only allow distant objects to exert 
forces on each other by shooting some substance across the space 
between them (e.g., a garden hose spraying out water that exerts a 
force on a bush). 


One piece of evidence that the candle sends out stuff that travels 
to your eye is that as in figure a, intervening transparent substances 
can make the candle appear to be in the wrong location, suggesting 
that light is a thing that can be bumped off course. Many peo- 
ple would dismiss this kind of observation as an optical illusion, 
however. (Some optical illusions are purely neurological or psycho- 
logical effects, although some others, including this one, turn out to 
be caused by the behavior of light itself.) 


A more convincing way to decide in which category light belongs 
is to find out if it takes time to get from the candle to your eye; in 
Newtonian physics, action at a distance is supposed to be instan- 
taneous. The fact that we speak casually today of “the speed of 
light” implies that at some point in history, somebody succeeded in 
showing that light did not travel infinitely fast. Galileo tried, and 
failed, to detect a finite speed for light, by arranging with a person 
in a distant tower to signal back and forth with lanterns. Galileo 
uncovered his lantern, and when the other person saw the light, he 
uncovered his lantern. Galileo was unable to measure any time lag 
that was significant compared to the limitations of human reflexes. 


Section 12.1 


df 


glass 


a/Light from a candle is bumped 
off course by a piece of glass. 
Inserting the glass causes the 
apparent location of the candle 
to shift. The same effect can 
be produced by taking off your 
eyeglasses and looking at which 
you see near the edge of the 
lens, but a flat piece of glass 
works just as well as a lens for 
this purpose. 
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b/An image of Jupiter and 
its moon lo (left) from the Cassini 
probe. 


Jupiter 


Om 


earth 


c/The earth is moving to- 
ward Jupiter and lo. Since the 
distance is shrinking, it is taking 
less and less time for the light to 
get to us from lo, and lo appears 
to circle Jupiter more quickly than 
normal. Six months later, the 
earth will be on the opposite side 
of the sun, and receding from 
Jupiter and lo, so lo will appear 
to revolve around Jupiter more 
slowly. 


The first person to prove that light’s speed was finite, and to 
determine it numerically, was Ole Roemer, in a series of measure- 
ments around the year 1675. Roemer observed Io, one of Jupiter’s 
moons, over a period of several years. Since Io presumably took the 
same amount of time to complete each orbit of Jupiter, it could be 
thought of as a very distant, very accurate clock. A practical and ac- 
curate pendulum clock had recently been invented, so Roemer could 
check whether the ratio of the two clocks’ cycles, about 42.5 hours 
to 1 orbit, stayed exactly constant or changed a little. If the process 
of seeing the distant moon was instantaneous, there would be no 
reason for the two to get out of step. Even if the speed of light was 
finite, you might expect that the result would be only to offset one 
cycle relative to the other. The earth does not, however, stay at a 
constant distance from Jupiter and its moons. Since the distance is 
changing gradually due to the two planets’ orbital motions, a finite 
speed of light would make the “Io clock” appear to run faster as the 
planets drew near each other, and more slowly as their separation 
increased. Roemer did find a variation in the apparent speed of Io’s 
orbits, which caused Io’s eclipses by Jupiter (the moments when Io 
passed in front of or behind Jupiter) to occur about 7 minutes early 
when the earth was closest to Jupiter, and 7 minutes late when it 
was farthest. Based on these measurements, Roemer estimated the 
speed of light to be approximately 2 x 10° m/s, which is in the right 
ballpark compared to modern measurements of 3x 10° m/s. (I’m not 
sure whether the fairly large experimental error was mainly due to 
imprecise knowledge of the radius of the earth’s orbit or limitations 
in the reliability of pendulum clocks.) 


Light can travel through a vacuum. 


Many people are confused by the relationship between sound 
and light. Although we use different organs to sense them, there are 
some similarities. For instance, both light and sound are typically 
emitted in all directions by their sources. Musicians even use visual 
metaphors like “tone color,” or “a bright timbre” to describe sound. 
One way to see that they are clearly different phenomena is to note 
their very different velocities. Sure, both are pretty fast compared to 
a flying arrow or a galloping horse, but as we have seen, the speed of 
light is so great as to appear instantaneous in most situations. The 
speed of sound, however, can easily be observed just by watching a 
group of schoolchildren a hundred feet away as they clap their hands 
to a song. There is an obvious delay between when you see their 
palms come together and when you hear the clap. 


The fundamental distinction between sound and light is that 
sound is an oscillation in air pressure, so it requires air (or some 
other medium such as water) in which to travel. Today, we know 
that outer space is a vacuum, so the fact that we get light from the 
sun, moon and stars clearly shows that air is not necessary for the 
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propagation of light. 
Discussion Questions 


A If you observe thunder and lightning, you can tell how far away the 
storm is. Do you need to know the speed of sound, of light, or of both? 


B When phenomena like X-rays and cosmic rays were first discovered, 
suggest a way one could have tested whether they were forms of light. 


Cc Why did Roemer only need to know the radius of the earth’s orbit, 
not Jupiter’s, in order to find the speed of light? 


12.1.2 Interaction of light with matter 
Absorption of light 


The reason why the sun feels warm on your skin is that the 
sunlight is being absorbed, and the light energy is being transformed 
into heat energy. The same happens with artificial light, so the net 
result of leaving a light turned on is to heat the room. It doesn’t 
matter whether the source of the light is hot, like the sun, a flame, 
or an incandescent light bulb, or cool, like a fluorescent bulb. (If 
your house has electric heat, then there is absolutely no point in 
fastidiously turning off lights in the winter; the lights will help to 
heat the house at the same dollar rate as the electric heater.) 


This process of heating by absorption is entirely different from 
heating by thermal conduction, as when an electric stove heats 
spaghetti sauce through a pan. Heat can only be conducted through 
matter, but there is vacuum between us and the sun, or between us 
and the filament of an incandescent bulb. Also, heat conduction can 
only transfer heat energy from a hotter object to a colder one, but a 
cool fluorescent bulb is perfectly capable of heating something that 
had already started out being warmer than the bulb itself. 


How we see nonluminous objects 


Not all the light energy that hits an object is transformed into 
heat. Some is reflected, and this leads us to the question of how 
we see nonluminous objects. If you ask the average person how we 
see a light bulb, the most likely answer is “The light bulb makes 
light, which hits our eyes.” But if you ask how we see a book, they 
are likely to say “The bulb lights up the room, and that lets me 
see the book.” All mention of light actually entering our eyes has 
mysteriously disappeared. 


Most people would disagree if you told them that light was re- 
flected from the book to the eye, because they think of reflection as 
something that mirrors do, not something that a book does. They 
associate reflection with the formation of a reflected image, which 
does not seem to appear in a piece of paper. 


Imagine that you are looking at your reflection in a nice smooth 
piece of aluminum foil, fresh off the roll. You perceive a face, not a 
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piece of metal. Perhaps you also see the bright reflection of a lamp 
over your shoulder behind you. Now imagine that the foil is just 
a little bit less smooth. The different parts of the image are now 
a little bit out of alignment with each other. Your brain can still 
recognize a face and a lamp, but it’s a little scrambled, like a Picasso 
painting. Now suppose you use a piece of aluminum foil that has 
been crumpled up and then flattened out again. The parts of the 
image are so scrambled that you cannot recognize an image. Instead, 
your brain tells you you’re looking at a rough, silvery surface. 


Mirror-like reflection at a specific angle is known as specular 
reflection, and random reflection in many directions is called diffuse 
reflection. Diffuse reflection is how we see nonluminous objects. 
Specular reflection only allows us to see images of objects other 
than the one doing the reflecting. In top part of figure d, imagine 
that the rays of light are coming from the sun. If you are looking 
down at the reflecting surface, there is no way for your eye-brain 
system to tell that the rays are not really coming from a sun down 
below you. 


Figure f shows another example of how we can’t avoid the con- 
clusion that light bounces off of things other than mirrors. The 
lamp is one I have in my house. It has a bright bulb, housed in a 
completely opaque bowl-shaped metal shade. The only way light 
can get out of the lamp is by going up out of the top of the bowl. 
The fact that I can read a book in the position shown in the figure 
means that light must be bouncing off of the ceiling, then bouncing 
off of the book, then finally getting to my eye. 


This is where the shortcomings of the Greek theory of vision 
become glaringly obvious. In the Greek theory, the light from the 
bulb and my mysterious “eye rays” are both supposed to go to the 
book, where they collide, allowing me to see the book. But we now 
have a total of four objects: lamp, eye, book, and ceiling. Where 
does the ceiling come in? Does it also send out its own mysterious 
“ceiling rays,” contributing to a three-way collision at the book? 
That would just be too bizarre to believe! 


The differences among white, black, and the various shades of 
gray in between is a matter of what percentage of the light they 
absorb and what percentage they reflect. That’s why light-colored 
clothing is more comfortable in the summer, and light-colored up- 
holstery in a car stays cooler that dark upholstery. 
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Numerical measurement of the brightness of light 


We have already seen that the physiological sensation of loudness 
relates to the sound’s intensity (power per unit area), but is not 
directly proportional to it. If sound A has an intensity of 1 nW/m?, 
sound B is 10 nW/m?, and sound C is 100 nW/m?, then the increase 
in loudness from B to C is perceived to be the same as the increase 
from A to B, not ten times greater. That is, the sensation of loudness 
is logarithmic. 


The same is true for the brightness of light. Brightness is re- 
lated to power per unit area, but the psychological relationship is 
a logarithmic one rather than a proportionality. For doing physics, 
it’s the power per unit area that we’re interested in. The relevant 
unit is W/m?. One way to determine the brightness of light is to 
measure the increase in temperature of a black object exposed to 
the light. The light energy is being converted to heat energy, and 
the amount of heat energy absorbed in a given amount of time can 
be related to the power absorbed, using the known heat capacity 
of the object. More practical devices for measuring light intensity, 
such as the light meters built into some cameras, are based on the 
conversion of light into electrical energy, but these meters have to 
be calibrated somehow against heat measurements. 


Discussion Questions 


A __ The curtains in a room are drawn, but a small gap lets light through, 
illuminating a spot on the floor. It may or may not also be possible to see 
the beam of sunshine crossing the room, depending on the conditions. 
What's going on? 

B Laser beams are made of light. In science fiction movies, laser 


beams are often shown as bright lines shooting out of a laser gun on a 
spaceship. Why is this scientifically incorrect? 


C <A documentary film-maker went to Harvard’s 1987 graduation cer- 
emony and asked the graduates, on camera, to explain the cause of the 
seasons. Only two out of 23 were able to give a correct explanation, but 
you now have all the information needed to figure it out for yourself, as- 
suming you didn’t already know. The figure shows the earth in its winter 
and summer positions relative to the sun. Hint: Consider the units used 
to measure the brightness of light, and recall that the sun is lower in the 
sky in winter, so its rays are coming in at a shallower angle. 


a 


12.1.3 The ray model of light 
Models of light 


Note how I’ve been casually diagramming the motion of light 
with pictures showing light rays as lines on the page. More formally, 
this is known as the ray model of light. The ray model of light 
seems natural once we convince ourselves that light travels through 
space, and observe phenomena like sunbeams coming through holes 
in clouds. Having already been introduced to the concept of light 
as an electromagnetic wave, you know that the ray model is not the 


f/Light bounces off of the 
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h/ Three models of light. 


ultimate truth about light, but the ray model is simpler, and in any 
case science always deals with models of reality, not the ultimate 
nature of reality. The following table summarizes three models of 
light. 


ray model ~_——» Advantage: Simplicity. 


Advantage: Color is described 


naturally in terms of wavelength. 
wave model Required in order to explain the 
interaction of light with material 


objects of sizes comparable to or 
smaller than a wavelength of light. 


° Required in order to explain the 
particle model . _ interaction of light with individual 
° atoms. At the atomic level, it 
becomes apparent that a beam of 
: light has a certain graininess to it. 


The ray model is a generic one. By using it we can discuss the 
path taken by the light, without committing ourselves to any specific 
description of what it is that is moving along that path. We will 
use the nice simple ray model for most of our treatment of optics, 
and with it we can analyze a great many devices and phenomena. 
Not until section 12.5 will we concern ourselves specifically with 
wave optics, although in the intervening chapters I will sometimes 
analyze the same phenomenon using both the ray model and the 
wave model. 


Note that the statements about the applicability of the various 
models are only rough guides. For instance, wave interference effects 
are often detectable, if small, when light passes around an obstacle 
that is quite a bit bigger than a wavelength. Also, the criterion for 
when we need the particle model really has more to do with energy 
scales than distance scales, although the two turn out to be related. 


The alert reader may have noticed that the wave model is re- 
quired at scales smaller than a wavelength of light (on the order of a 
micrometer for visible light), and the particle model is demanded on 
the atomic scale or lower (a typical atom being a nanometer or so in 
size). This implies that at the smallest scales we need both the wave 
model and the particle model. They appear incompatible, so how 
can we simultaneously use both? The answer is that they are not 
as incompatible as they seem. Light is both a wave and a particle, 
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but a full understanding of this apparently nonsensical statement is 
a topic for section 13.2. 


Y 
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Ray diagrams 


Without even knowing how to use the ray model to calculate 
anything numerically, we can learn a great deal by drawing ray 
diagrams. For instance, if you want to understand how eyeglasses 
help you to see in focus, a ray diagram is the right place to start. 
Many students under-utilize ray diagrams in optics and instead rely 
on rote memorization or plugging into formulas. The trouble with 
memorization and plug-ins is that they can obscure what’s really 
going on, and it is easy to get them wrong. Often the best plan is to 
do a ray diagram first, then do a numerical calculation, then check 
that your numerical results are in reasonable agreement with what 
you expected from the ray diagram. 


Figure j shows some guidelines for using ray diagrams effectively. 
The light rays bend when they pass out through the surface of the 
water (a phenomenon that we’ll discuss in more detail later). The 
rays appear to have come from a point above the goldfish’s actual 
location, an effect that is familiar to people who have tried spear- 
fishing. 


e A stream of light is not really confined to a finite number of 
narrow lines. We just draw it that way. In j/1, it has been 
necessary to choose a finite number of rays to draw (five), 
rather than the theoretically infinite number of rays that will 
diverge from that point. 


i/ Examples of ray diagrams. 


j/1. Correct. 2. Incorrect: im- 
plies that diffuse reflection only 
gives one ray from each reflecting 
point. 3. Correct, but unneces- 
sarily complicated 
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e There is a tendency to conceptualize rays incorrectly as ob- 
jects. In his Optics, Newton goes out of his way to caution 
the reader against this, saying that some people “consider ... 
the refraction of ... rays to be the bending or breaking of them 
in their passing out of one medium into another.” But a ray 
is a record of the path traveled by light, not a physical thing 
that can be bent or broken. 


e In theory, rays may continue infinitely far into the past and 
future, but we need to draw lines of finite length. In j/1, a 
judicious choice has been made as to where to begin and end 
the rays. There is no point in continuing the rays any farther 
than shown, because nothing new and exciting is going to 
happen to them. There is also no good reason to start them 
earlier, before being reflected by the fish, because the direction 
of the diffusely reflected rays is random anyway, and unrelated 
to the direction of the original, incoming ray. 


e When representing diffuse reflection in a ray diagram, many 
students have a mental block against drawing many rays fan- 
ning out from the same point. Often, as in example j/2, the 
problem is the misconception that light can only be reflected 
in one direction from one point. 


Another difficulty associated with diffuse reflection, example 
j/3, is the tendency to think that in addition to drawing many 
rays coming out of one point, we should also be drawing many 
rays coming from many points. In j/1, drawing many rays 
coming out of one point gives useful information, telling us, 
for instance, that the fish can be seen from any angle. Drawing 
many sets of rays, as in j/3, does not give us any more useful 
information, and just clutters up the picture in this example. 
The only reason to draw sets of rays fanning out from more 
than one point would be if different things were happening to 
the different sets. 


Discussion Question 


A Suppose an intelligent tool-using fish is spear-hunting for humans. 
Draw a ray diagram to show how the fish has to correct its aim. Note 
that although the rays are now passing from the air to the water, the same 
rules apply: the rays are closer to being perpendicular to the surface when 
they are in the water, and rays that hit the air-water interface at a shallow 
angle are bent the most. 


12.1.4 Geometry of specular reflection 


To change the motion of a material object, we use a force. Is 
there any way to exert a force on a beam of light? Experiments 
show that electric and magnetic fields do not deflect light beams, so 
apparently light has no electric charge. Light also has no mass, so 
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until the twentieth century it was believed to be immune to gravity 
as well. Einstein predicted that light beams would be very slightly 
deflected by strong gravitational fields, and he was proved correct 
by observations of rays of starlight that came close to the sun, but 
obviously that’s not what makes mirrors and lenses work! 


If we investigate how light is reflected by a mirror, we will find 
that the process is horrifically complex, but the final result is sur- 
prisingly simple. What actually happens is that the light is made 
of electric and magnetic fields, and these fields accelerate the elec- 
trons in the mirror. Energy from the light beam is momentarily 
transformed into extra kinetic energy of the electrons, but because 
the electrons are accelerating they re-radiate more light, convert- 
ing their kinetic energy back into light energy. We might expect 
this to result in a very chaotic situation, but amazingly enough, the 
electrons move together to produce a new, reflected beam of light, 
which obeys two simple rules: 


e The angle of the reflected ray is the same as that of the incident 
ray. 


e The reflected ray lies in the plane containing the incident ray 
and the normal (perpendicular) line. This plane is known as 
the plane of incidence. 


The two angles can be defined either with respect to the normal, 
like angles B and C in the figure, or with respect to the reflecting 
surface, like angles A and D. There is a convention of several hundred 
years’ standing that one measures the angles with respect to the 
normal, but the rule about equal angles can logically be stated either 
as B=C or as A=D. 


The phenomenon of reflection occurs only at the boundary be- 
tween two media, just like the change in the speed of light that 
passes from one medium to another. As we have seen in section 6.2, 
this is the way all waves behave. 


Most people are surprised by the fact that light can be reflected 
back from a less dense medium. For instance, if you are diving and 
you look up at the surface of the water, you will see a reflection of 
yourself. 
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self-check A 

Each of these diagrams is supposed to show two different rays being 
reflected from the same point on the same mirror. Which are correct, 
and which are incorrect? 
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Tae > Answer, p. 872 


Reversibility of light rays 


The fact that specular reflection displays equal angles of inci- 
dence and reflection means that there is a symmetry: if the ray had 
come in from the right instead of the left in the figure above, the an- 
gles would have looked exactly the same. This is not just a pointless 
detail about specular reflection. It’s a manifestation of a very deep 
and important fact about nature, which is that the laws of physics 
do not distinguish between past and future. Cannonballs and plan- 
ets have trajectories that are equally natural in reverse, and so do 
light rays. This type of symmetry is called time-reversal symmetry. 


Typically, time-reversal symmetry is a characteristic of any pro- 
cess that does not involve heat. For instance, the planets do not 
experience any friction as they travel through empty space, so there 
is no frictional heating. We should thus expect the time-reversed 
versions of their orbits to obey the laws of physics, which they do. 
In contrast, a book sliding across a table does generate heat from 
friction as it slows down, and it is therefore not surprising that this 
type of motion does not appear to obey time-reversal symmetry. A 
book lying still on a flat table is never observed to spontaneously 
start sliding, sucking up heat energy and transforming it into kinetic 
energy. 


Similarly, the only situation we’ve observed so far where light 
does not obey time-reversal symmetry is absorption, which involves 
heat. Your skin absorbs visible light from the sun and heats up, 
but we never observe people’s skin to glow, converting heat energy 
into visible light. People’s skin does glow in infrared light, but 
that doesn’t mean the situation is symmetric. Even if you absorb 
infrared, you don’t emit visible light, because your skin isn’t hot 
enough to glow in the visible spectrum. 


These apparent heat-related asymmetries are not actual asym- 
metries in the laws of physics. The interested reader may wish to 
learn more about this from optional chapter 5 on thermodynamics. 


Ray tracing on a computer example 1 
A number of techniques can be used for creating artificial visual 
scenes in computer graphics. Figure | shows such a scene, which 
was created by the brute-force technique of simply constructing 
a very detailed ray diagram on a computer. This technique re- 
quires a great deal of computation, and is therefore too slow to 
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be used for video games and computer-animated movies. One 
trick for soeeding up the computation is to exploit the reversibility 
of light rays. If one was to trace every ray emitted by every illu- 
minated surface, only a tiny fraction of those would actually end 
up passing into the virtual “camera,” and therefore almost all of 
the computational effort would be wasted. One can instead start 
a ray at the camera, trace it backward in time, and see where it 
would have come from. With this technique, there is no wasted 
effort. 
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|/ This photorealistic image of a nonexistent countertop was _ pro- 
duced completely on a computer, by computing a complicated ray 
diagram. 
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n/ Discussion question C. 


o/The solid lines are physi- 
cally possible paths for light rays 
traveling from A to B and from 
A to C. They obey the principle 
of least time. The dashed lines 
do not obey the principle of 
least time, and are not physically 
possible. 


Discussion Questions 


A Ifa light ray has a velocity vector with components c, and c,, what 
will happen when it is reflected from a surface that lies along the y axis? 
Make sure your answer does not imply a change in the ray’s speed. 


B Generalizing your reasoning from discussion question A, what will 
happen to the velocity components of a light ray that hits a corner, as 
shown in the figure, and undergoes two reflections? 


C__ Three pieces of sheet metal arranged perpendicularly as shown in 
the figure form what is known as a radar corner. Let’s assume that the 
radar corner is large compared to the wavelength of the radar waves, so 
that the ray model makes sense. If the radar corner is bathed in radar 
rays, at least some of them will undergo three reflections. Making a fur- 
ther generalization of your reasoning from the two preceding discussion 
questions, what will happen to the three velocity components of such a 
ray? What would the radar corner be useful for? 


12.1.5 The principle of least time for reflection 


We had to choose between an unwieldy explanation of reflection 
at the atomic level and a simpler geometric description that was 
not as fundamental. There is a third approach to describing the 
interaction of light and matter which is very deep and beautiful. 
Emphasized by the twentieth-century physicist Richard Feynman, 
it is called the principle of least time, or Fermat’s principle. 


Let’s start with the motion of light that is not interacting with 
matter at all. In a vacuum, a light ray moves in a straight line. This 
can be rephrased as follows: of all the conceivable paths light could 
follow from P to Q, the only one that is physically possible is the 
path that takes the least time. 


What about reflection? If light is going to go from one point to 
another, being reflected on the way, the quickest path is indeed the 
one with equal angles of incidence and reflection. If the starting and 
ending points are equally far from the reflecting surface, 0, it’s not 
hard to convince yourself that this is true, just based on symmetry. 
There is also a tricky and simple proof, shown in figure p, for the 
more general case where the points are at different distances from 
the surface. 


Not only does the principle of least time work for light in a 
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vacuum and light undergoing reflection, we will also see in a later 
chapter that it works for the bending of light when it passes from 
one medium into another. 


Although it is beautiful that the entire ray model of light can 
be reduced to one simple rule, the principle of least time, it may 
seem a little spooky to speak as if the ray of light is intelligent, 
and has carefully planned ahead to find the shortest route to its 
destination. How does it know in advance where it’s going? What 
if we moved the mirror while the light was en route, so conditions 
along its planned path were not what it “expected?” The answer 
is that the principle of least time is really a shortcut for finding 
certain results of the wave model of light, which is the topic of the 
last chapter of this book. 


There are a couple of subtle points about the principle of least 
time. First, the path does not have to be the quickest of all pos- 
sible paths; it only needs to be quicker than any path that differs 
infinitesimally from it. In figure p, for instance, light could get from 
A to B either by the reflected path AQB or simply by going straight 
from A to B. Although AQB is not the shortest possible path, it 
cannot be shortened by changing it infinitesimally, e.g., by moving 
Q a little to the right or left. On the other hand, path APB is phys- 
ically impossible, because it is possible to improve on it by moving 
point P infinitesimally to the right. 


It’s not quite right to call this the principle of least time. In fig- 
ure q, for example, the four physically possible paths by which a ray 
can return to the center consist of two shortest-time paths and two 
longest-time paths. Strictly speaking, we should refer to the prin- 
ciple of least or greatest time, but most physicists omit the niceties, 
and assume that other physicists understand that both maxima and 
minima are possible. 
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p/Paths AQB and APB are 
two conceivable paths that a ray 
could follow to get from A to B 
with one reflection, but only AQB 
is physically possible. We wish 
to prove that the path AQB, with 
equal angles of incidence and 
reflection, is shorter than any 
other path, such as APB. The 
trick is to construct a third point, 
C, lying as far below the surface 
as B lies above it. Then path 
AQC is a straight line whose 
length is the same as AQB’s, and 
path APC has the same length as 
path APB. Since AQC is straight, 
it must be shorter than any other 
path such as APC that connects 
A and C, and therefore AQB must 
be shorter than any path such as 
APB. 


q/Light is emitted at the center 
of an elliptical mirror. There are 
four physically possible paths by 
which a ray can be reflected and 
return to the center. 
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12.2 Images by Reflection 


Infants are always fascinated by the antics of the Baby in the Mirror. 
Now if you want to know something about mirror images that most 
people don’t understand, try this. First bring this page closer and 
closer to your eyes, until you can no longer focus on it without 
straining. Then go in the bathroom and see how close you can 
get your face to the surface of the mirror before you can no longer 
easily focus on the image of your own eyes. You will find that 
the shortest comfortable eye-mirror distance is much less than the 
shortest comfortable eye-paper distance. This demonstrates that 
the image of your face in the mirror acts as if it had depth and 
existed in the space behind the mirror. If the image was like a flat 
picture in a book, then you wouldn’t be able to focus on it from 
such a short distance. 


In this chapter we will study the images formed by flat and 
curved mirrors on a qualitative, conceptual basis. Although this 
type of image is not as commonly encountered in everyday life as 
images formed by lenses, images formed by reflection are simpler to 
understand, so we discuss them first. In section 12.3 we will turn 
to a more mathematical treatment of images made by reflection. 
Surprisingly, the same equations can also be applied to lenses, which 
are the topic of section 12.4. 


12.2.1 A virtual image 


We can understand a mirror image using a ray diagram. Figure 
a shows several light rays, 1, that originated by diffuse reflection at 
the person’s nose. They bounce off the mirror, producing new rays, 
2. To anyone whose eye is in the right position to get one of these 
rays, they appear to have come from a behind the mirror, 3, where 
they would have originated from a single point. This point is where 
the tip of the image-person’s nose appears to be. A similar analysis 
applies to every other point on the person’s face, so it looks as 
though there was an entire face behind the mirror. The customary 
way of describing the situation requires some explanation: 


Customary description in physics: There is an image of the face 
behind the mirror. 


Translation: The pattern of rays coming from the mirror is exactly 
the same as it would be if there were a face behind the mirror. 
Nothing is really behind the mirror. 


This is referred to as a virtual image, because the rays do not 
actually cross at the point behind the mirror. They only appear to 
have originated there. 


self-check B 


Optics 


Imagine that the person in figure a moves his face down quite a bit — 
a couple of feet in real life, or a few inches on this scale drawing. The 
mirror stays where it is. Draw a new ray diagram. Will there still be an 
image? If so, where is it visible from? > Answer, p. 872 


The geometry of specular reflection tells us that rays 1 and 2 
are at equal angles to the normal (the imaginary perpendicular line 
piercing the mirror at the point of reflection). This means that 
ray 2’s imaginary continuation, 3, forms the same angle with the 
mirror as ray 3. Since each ray of type 3 forms the same angles with 
the mirror as its partner of type 1, we see that the distance of the 
image from the mirror is the same as that of the actual face from 
the mirror, and it lies directly across from it. The image therefore 
appears to be the same size as the actual face. 


image of b/ Example 2. 
eyeichart el eye chart 
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Figure b shows a typical setup in an optometrist’s examination 
room. The patient’s vision is supposed to be tested at a distance 
of 6 meters (20 feet in the U.S.), but this distance is larger than 
the amount of space available in the room. Therefore a mirror is 
used to create an image of the eye chart behind the wall. 


The Praxinoscope example 3 
Figure c shows an old-fashioned device called a praxinoscope, 
which displays an animated picture when spun. The removable 
strip of paper with the pictures printed on it has twice the radius 
of the inner circle made of flat mirrors, so each picture’s virtual 
image is at the center. As the wheel spins, each picture’s image 
is replaced by the next. 


c/ The praxinoscope. 


Section 12.2 Images by Reflection 703 


Discussion Question 


A __ The figure shows an object that is off to one side of a mirror. Draw 
a ray diagram. Is an image formed? If so, where is it, and from which 
directions would it be visible? 
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12.2.2 Curved mirrors 


An image in a flat mirror is a pretechnological example: even 
animals can look at their reflections in a calm pond. We now pass 
to our first nontrivial example of the manipulation of an image by 
technology: an image in a curved mirror. Before we dive in, let’s 
consider why this is an important example. If it was just a ques- 
tion of memorizing a bunch of facts about curved mirrors, then you 
would rightly rebel against an effort to spoil the beauty of your lib- 
erally educated brain by force-feeding you technological trivia. The 
reason this is an important example is not that curved mirrors are 
so important in and of themselves, but that the results we derive for 
curved bowl-shaped mirrors turn out to be true for a large class of 
other optical devices, including mirrors that bulge outward rather 
than inward, and lenses as well. A microscope or a telescope is sim- 
ply a combination of lenses or mirrors or both. What you’re really 
learning about here is the basic building block of all optical devices 
from movie projectors to octopus eyes. 


Because the mirror in figure d is curved, it bends the rays back 
closer together than a flat mirror would: we describe it as converging. 
Note that the term refers to what it does to the light rays, not to the 
physical shape of the mirror’s surface . (The surface itself would be 
described as concave. The term is not all that hard to remember, 
because the hollowed-out interior of the mirror is like a cave.) It 
is surprising but true that all the rays like 3 really do converge on 
a point, forming a good image. We will not prove this fact, but it 
is true for any mirror whose curvature is gentle enough and that 
is symmetric with respect to rotation about the perpendicular line 
passing through its center (not asymmetric like a potato chip). The 
old-fashioned method of making mirrors and lenses is by grinding 
them in grit by hand, and this automatically tends to produce an 
almost perfect spherical surface. 


Bending a ray like 2 inward implies bending its imaginary contin- 
uation 3 outward, in the same way that raising one end of a seesaw 
causes the other end to go down. The image therefore forms deeper 
behind the mirror. This doesn’t just show that there is extra dis- 
tance between the image-nose and the mirror; it also implies that 
the image itself is bigger from front to back. It has been magnified 
in the front-to-back direction. 


It is easy to prove that the same magnification also applies to the 
image’s other dimensions. Consider a point like E in figure e. The 
trick is that out of all the rays diffusely reflected by E, we pick the 
one that happens to head for the mirror’s center, C. The equal-angle 
property of specular reflection plus a little straightforward geometry 
easily leads us to the conclusion that triangles ABC and CDE are 
the same shape, with ABC being simply a scaled-up version of CDE. 
The magnification of depth equals the ratio BC/CD, and the up- 


d/An image formed by a 
curved mirror. 


y c f 


2 


| 
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e/The image is magnified 
by the same factor in depth and 
in its other dimensions. 


f / Increased magnification 
always comes at the expense of 
decreased field of view. 
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g/1. A virtual image. 2. A real 
image. As you'll verify in home- 
work problem 12, the image is 
upside-down 


down magnification is AB/DE. A repetition of the same proof shows 
that the magnification in the third dimension (out of the page) is 
also the same. This means that the image-head is simply a larger 
version of the real one, without any distortion. The scaling factor 
is called the magnification, M. The image in the figure is magnified 
by a factor M = 1.9. 


Note that we did not explicitly specify whether the mirror was 
a sphere, a paraboloid, or some other shape. However, we assumed 
that a focused image would be formed, which would not necessarily 
be true, for instance, for a mirror that was asymmetric or very deeply 
curved. 


12.2.3 Areal image 


If we start by placing an object very close to the mirror, g/1, 
and then move it farther and farther away, the image at first behaves 
as we would expect from our everyday experience with flat mirrors, 
receding deeper and deeper behind the mirror. At a certain point, 
however, a dramatic change occurs. When the object is more than 
a certain distance from the mirror, g/2, the image appears upside- 
down and in front of the mirror. 


N 


Here’s what’s happened. The mirror bends light rays inward, but 
when the object is very close to it, as in g/1, the rays coming from a 
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given point on the object are too strongly diverging (spreading) for 
the mirror to bring them back together. On reflection, the rays are 
still diverging, just not as strongly diverging. But when the object 
is sufficiently far away, g/2, the mirror is only intercepting the rays 
that came out in a narrow cone, and it is able to bend these enough 
so that they will reconverge. 


Note that the rays shown in the figure, which both originated at 
the same point on the object, reunite when they cross. The point 
where they cross is the image of the point on the original object. 
This type of image is called a real image, in contradistinction to the 
virtual images we’ve studied before. The use of the word “real” is 
perhaps unfortunate. It sounds as though we are saying the image 
was an actual material object, which of course it is not. 


The distinction between a real image and a virtual image is an 
important one, because a real image can projected onto a screen or 
photographic film. If a piece of paper is inserted in figure g/2 at 
the location of the image, the image will be visible on the paper 
(provided the object is bright and the room is dark). Your eye uses 
a lens to make a real image on the retina. 


self-check C 

Sketch another copy of the face in figure g/1, even farther from the 
mirror, and draw a ray diagram. What has happened to the location of 
the image? > Answer, p. 872 


12.2.4 Images of images 


If you are wearing glasses right now, then the light rays from the 
page are being manipulated first by your glasses and then by the lens 
of your eye. You might think that it would be extremely difficult 
to analyze this, but in fact it is quite easy. In any series of optical 
elements (mirrors or lenses or both), each element works on the rays 
furnished by the previous element in exactly the same manner as if 
the image formed by the previous element was an actual object. 


Figure h shows an example involving only mirrors. The Newto- 
nian telescope, invented by Isaac Newton, consists of a large curved 
mirror, plus a second, flat mirror that brings the light out of the 
tube. (In very large telescopes, there may be enough room to put 
a camera or even a person inside the tube, in which case the sec- 
ond mirror is not needed.) The tube of the telescope is not vital; it 
is mainly a structural element, although it can also be helpful for 
blocking out stray light. The lens has been removed from the front 
of the camera body, and is not needed for this setup. Note that the 
two sample rays have been drawn parallel, because an astronomical 
telescope is used for viewing objects that are extremely far away. 
These two “parallel” lines actually meet at a certain point, say a 
crater on the moon, so they can’t actually be perfectly parallel, but 
they are parallel for all practical purposes since we would have to 


h/A Newtonian telescope 


being used with a camera. 
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i/A Newtonian telescope 
being used for visual rather than 
photographic observing. In real 
life, an eyepiece lens is normally 
used for additional magnification, 
but this simpler setup will also 
work. 


j/ The angular size of the flower 
depends on its distance from the 
eye. 


follow them upward for a quarter of a million miles to get to the 
point where they intersect. 


The large curved mirror by itself would form an image I, but the 
small flat: mirror creates an image of the image, I’. The relationship 
between I and I’ is exactly the same as it would be if I was an actual 
object rather than an image: I and I’ are at equal distances from 
the plane of the mirror, and the line between them is perpendicular 
to the plane of the mirror. 


One surprising wrinkle is that whereas a flat mirror used by itself 
forms a virtual image of an object that is real, here the mirror is 
forming a real image of virtual image I. This shows how pointless it 
would be to try to memorize lists of facts about what kinds of images 
are formed by various optical elements under various circumstances. 
You are better off simply drawing a ray diagram. 


Although the main point here was to give an example of an image 
of an image, figure i also shows an interesting case where we need 
to make the distinction between magnification and angular mag- 
nification. If you are looking at the moon through this telescope, 
then the images I and I’ are much smaller than the actual moon. 
Otherwise, for example, image I would not fit inside the telescope! 
However, these images are very close to your eye compared to the 
actual moon. The small size of the image has been more than com- 
pensated for by the shorter distance. The important thing here is 
the amount of angle within your field of view that the image covers, 
and it is this angle that has been increased. The factor by which it 
is increased is called the angular magnification, Mg. 
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Discussion Questions 


A __ Locate the images of you that will be formed if you stand between 
two parallel mirrors. 


B Locate the images formed by two perpendicular mirrors, as in the 
figure. What happens if the mirrors are not perfectly perpendicular? 
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C Locate the images formed by the periscope. 
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12.3 Images, Quantitatively 


It sounds a bit odd when a scientist refers to a theory as “beauti- 
ful,” but to those in the know it makes perfect sense. One mark 
of a beautiful theory is that it surprises us by being simple. The 
mathematical theory of lenses and curved mirrors gives us just such 
a surprise. We expect the subject to be complex because there are 
sO many cases: a converging mirror forming a real image, a diverg- 
ing lens that makes a virtual image, and so on for a total of six 
possibilities. If we want to predict the location of the images in all 
these situations, we might expect to need six different equations, 
and six more for predicting magnifications. Instead, it turns out 
that we can use just one equation for the location of the image and 
one equation for its magnification, and these two equations work 
in all the different cases with no changes except for plus and minus 
signs. This is the kind of thing the physicist Eugene Wigner referred 
to as “the unreasonable effectiveness of mathematics.” Sometimes 
we can find a deeper reason for this kind of unexpected simplicity, 
but sometimes it almost seems as if God went out of Her way to 
make the secrets of universe susceptible to attack by the human 
thought-tool called math. 


12.3.1 Areal image formed by a converging mirror 
Location of the image 


We will now derive the equation for the location of a real image 
formed by a converging mirror. We assume for simplicity that the 
mirror is spherical, but actually this isn’t a restrictive assumption, 
because any shallow, symmetric curve can be approximated by a 
sphere. The shape of the mirror can be specified by giving the 
location of its center, C. A deeply curved mirror is a sphere with a 
small radius, so C is close to it, while a weakly curved mirror has 
C farther away. Given the point O where the object is, we wish to 
find the point I where the image will be formed. 


To locate an image, we need to track a minimum of two rays 
coming from the same point. Since we have proved in the previous 
chapter that this type of image is not distorted, we can use an on-axis 
point, O, on the object, as in figure a/1. The results we derive will 
also hold for off-axis points, since otherwise the image would have 
to be distorted, which we know is not true. We let one of the rays be 
the one that is emitted along the axis; this ray is especially easy to 
trace, because it bounces straight back along the axis again. As our 
second ray, we choose one that strikes the mirror at a distance of 1 
from the axis. “One what?” asks the astute reader. The answer is 
that it doesn’t really matter. When a mirror has shallow curvature, 
all the reflected rays hit the same point, so 1 could be expressed 
in any units you like. It could, for instance, be 1 cm, unless your 
mirror is smaller than 1 cm! 


get bigger 


gets 
smaller 


gets | O 
bigger Moves moves 
closer away 


a/The relationship between 
the object's position and the 
image’s can be expressed in 
terms of the angles 0, and 0. 
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b/The geometrical  interpre- 
tation of the focal angle. 


c/Example 4, an_ alternative 
test for finding the focal angle. 
The mirror is the same as in 
figure b. 


The only way to find out anything mathematical about the rays 
is to use the sole mathematical fact we possess concerning specular 
reflection: the incident and reflected rays form equal angles with 
respect to the normal, which is shown as a dashed line. Therefore 
the two angles shown in figure a/2 are the same, and skipping some 
straightforward geometry, this leads to the visually reasonable result 
that the two angles in figure a/3 are related as follows: 


6; + 65 = constant 


(Note that @; and 6,, which are measured from the image and the 
object, not from the eye like the angles we referred to in discussing 
angular magnification on page 708.) For example, move O farther 
from the mirror. The top angle in figure a/2 is increased, so the 
bottom angle must increase by the same amount, causing the image 
point, I, to move closer to the mirror. In terms of the angles shown in 
figure a/3, the more distant object has resulted in a smaller angle 05, 
while the closer image corresponds to a larger 6;; One angle increases 
by the same amount that the other decreases, so their sum remains 
constant. These changes are summarized in figure a/4. 


The sum 6; + 4, is a constant. What does this constant repre- 
sent? Geometrically, we interpret it as double the angle made by 
the dashed radius line. Optically, it is a measure of the strength of 
the mirror, i.e., how strongly the mirror focuses light, and so we call 
it the focal angle, 0;, 


0; + Oo = OF 


Suppose, for example, that we wish to use a quick and dirty optical 
test to determine how strong a particular mirror is. We can lay 
it on the floor as shown in figure c, and use it to make an image 
of a lamp mounted on the ceiling overhead, which we assume is 
very far away compared to the radius of curvature of the mirror, 
so that the mirror intercepts only a very narrow cone of rays from 
the lamp. This cone is so narrow that its rays are nearly parallel, 
and 6, is nearly zero. The real image can be observed on a piece of 
paper. By moving the paper nearer and farther, we can bring the 
image into focus, at which point we know the paper is located at 
the image point. Since 6) © 0, we have 6; © 67, and we can then 
determine this mirror’s focal angle either by measuring 6; directly 
with a protractor, or indirectly via trigonometry. A strong mirror 
will bring the rays together to form an image close to the mirror, 
and these rays will form a blunt-angled cone with a large 6; and 6,. 


An alternative optical test example 4 
> Figure c shows an alternative optical test. Rather than placing 
the object at infinity as in figure b, we adjust it so that the image 
is right on top of the object. Points O and | coincide, and the rays 
are reflected right back on top of themselves. If we measure the 
angle 8 shown in figure c, how can we find the focal angle? 
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> The object and image angles are the same; the angle labeled 
8 in the figure equals both of them. We therefore have 0; + 05 = 
8 = 0s. Comparing figures b and ¢, it is indeed plausible that the 
angles are related by a factor of two. 


At this point, we could consider our work to be done. Typically, 
we know the strength of the mirror, and we want to find the image 
location for a given object location. Given the mirror’s focal angle 
and the object location, we can determine 6, by trigonometry, sub- 
tract to find 6; = @f — @, and then do more trig to find the image 
location. 


There is, however, a shortcut that can save us from doing so 
much work. Figure a/3 shows two right triangles whose legs of 
length 1 coincide and whose acute angles are 0, and 6;. These can 
be related by trigonometry to the object and image distances shown 
in figure d: 


tan 6, = 1/d, tan 6; = 1/d; 


Ever since chapter 2, we’ve been assuming small angles. For small 
angles, we can use the small-angle approximation tanxz = «x (for x 
in radians), giving simply 
6, 17d Oc = 17d; 

We likewise define a distance called the focal length, f according to 
6 =1/f. In figure b, f is the distance from the mirror to the place 
where the rays cross. We can now reexpress the equation relating 
the object and image positions as 


Ey de 
fod do 


Figure e summarizes the interpretation of the focal length and focal 
1 
angle. 


Which form is better, Of = 0; + @, or 1/f = 1/d; + 1/d,? The 
angular form has in its favor its simplicity and its straightforward 
visual interpretation, but there are two reasons why we might prefer 
the second version. First, the numerical values of the angles depend 
on what we mean by “one unit” for the distance shown as 1 in 


'There is a standard piece of terminology which is that the “focal point” is 
the point lying on the optical axis at a distance from the mirror equal to the focal 
length. This term isn’t particularly helpful, because it names a location where 
nothing normally happens. In particular, it is not normally the place where the 
rays come to a focus! — that would be the image point. In other words, we 
don’t normally have d; = f, unless perhaps do = oo. A recent online discussion 
among some physics teachers (https://carnot.physics.buffalo.edu/archives, Feb. 
2006) showed that many disliked the terminology, felt it was misleading, or didn’t 
know it and would have misinterpreted it if they had come across it. That is, it 
appears to be what grammarians call a “skunked term” — a word that bothers 
half the population when it’s used incorrectly, and the other half when it’s used 
correctly. 


Section 12.3 


d/The object and image dis- 
tances 


AL 
2 
e/Mirror 1 is weaker than 
mirror 2. It has a shallower 


curvature, a longer focal length, 
and a smaller focal angle. It 
reflects rays at angles not much 
different than those that would be 
produced with a flat mirror. 
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figure a/1. Second, it is usually easier to measure distances rather 
than angles, so the distance form is more convenient for number 
crunching. Neither form is superior overall, and we will often need 
to use both to solve any given problem.” 


A searchlight example 5 
Suppose we need to create a parallel beam of light, as in a search- 
light. Where should we place the lightbulb? A parallel beam has 
zero angle between its rays, so 0; = 0. To place the lightbulb 
correctly, however, we need to know a distance, not an angle: 
the distance d) between the bulb and the mirror. The problem 
involves a mixture of distances and angles, so we need to get 
everything in terms of one or the other in order to solve it. Since 
the goal is to find a distance, let’s figure out the image distance 
corresponding to the given angle 0; = 0. These are related by 
d; = 1/0;, so we have dj = oo. (Yes, dividing by zero gives infin- 
ity. Don’t be afraid of infinity. Infinity is a useful problem-solving 
device.) Solving the distance equation for d,, we have 


/f—1/d))~" 


do = ( 
= (1/f—0)7' 


1 
1 


f 


The bulb has to be placed at a distance from the mirror equal to 
its focal point. 


Diopters example 6 
An equation like dj = 1/0; really doesn’t make sense in terms of 
units. Angles are unitless, since radians aren't really units, so 
the right-hand side is unitless. We can’t have a left-hand side 
with units of distance if the right-hand side of the same equation 
is unitless. This is an artifact of my cavalier statement that the 
conical bundles of rays spread out to a distance of 1 from the axis 
where they strike the mirror, without specifying the units used to 
measure this 1. In real life, optometrists define the thing we're 
calling 0; = 1/d; as the “dioptric strength” of a lens or mirror, 
and measure it in units of inverse meters (m~'), also known as 
diopters (1 D=1 m~’). 


Magnification 


We have already discussed in the previous chapter how to find 
the magnification of a virtual image made by a curved mirror. The 
result is the same for a real image, and we omit the proof, which 
is very similar. In our new notation, the result is M = d;/do. A 
numerical example is given in subsection 12.3.2. 


?1 would like to thank Fouad Ajami for pointing out the pedagogical advan- 
tages of using both equations side by side. 
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.2 Other cases with curved mirrors 


The equation d; = (1/f —1/d,)~! can easily produce a negative 
result, but we have been thinking of d; as a distance, and distances 
can’t be negative. A similar problem occurs with 6; = 6, — 6, for 
6, > Of. What’s going on here? 


The interpretation of the angular equation is straightforward. 
As we bring the object closer and closer to the image, 9, gets bigger 
and bigger, and eventually we reach a point where 6, = 6, and 
6; = 0. This large object angle represents a bundle of rays forming 
a cone that is very broad, so broad that the mirror can no longer 
bend them back so that they reconverge on the axis. The image 
angle 6; = 0 represents an outgoing bundle of rays that are parallel. 
The outgoing rays never cross, so this is not a real image, unless we 
want to be charitable and say that the rays cross at infinity. If we 
go on bringing the object even closer, we get a virtual image. 


real image 


unphysical — negative d, 
can be interpreted as 
placing the object on the nat 
other side of the mirror, but =": 
then no image is produced oS. 
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virtual image — negative d; 
can be interpreted as an 
image on the far side of 

the mirror 


To analyze the distance equation, let’s look at a graph of d; as 
a function of dy. The branch on the upper right corresponds to the 
case of a real image. Strictly speaking, this is the only part of the 
graph that we’ve proven corresponds to reality, since we never did 
any geometry for other cases, such as virtual images. As discussed in 
the previous section, making d, bigger causes d; to become smaller, 
and vice-versa. 


f/ A graph of the image distance 
d; as a function of the object dis- 


tance db. 
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Letting d, be less than f is equivalent to #, > 6+ : a virtual image 
is produced on the far side of the mirror. This is the first example 
of Wigner’s “unreasonable effectiveness of mathematics” that we 
have encountered in optics. Even though our proof depended on 
the assumption that the image was real, the equation we derived 
turns out to be applicable to virtual images, provided that we either 
interpret the positive and negative signs in a certain way, or else 
modify the equation to have different positive and negative signs. 


self-check D 

Interpret the three places where, in physically realistic parts of the graph, 
the graph approaches one of the dashed lines. [This will come more 
naturally if you have learned the concept of limits ina math class.] > 
Answer, p. 872 


A flat mirror example 7 
We can even apply the equation to a flat mirror. As a sphere gets 
bigger and bigger, its surface is more and more gently curved. 
The planet Earth is so large, for example, that we cannot even 
perceive the curvature of its surface. To represent a flat mirror, we 
let the mirror’s radius of curvature, and its focal length, become 
infinite. Dividing by infinity gives zero, so we have 


1/do = —1/d) 5 


or 


If we interpret the minus sign as indicating a virtual image on the 
far side of the mirror from the object, this makes sense. 


It turns out that for any of the six possible combinations of 
real or virtual images formed by converging or diverging lenses or 
mirrors, we can apply equations of the form 


0, =O; + A 
and 
eae 
fd; do , 


with only a modification of plus or minus signs. There are two pos- 
sible approaches here. The approach we have been using so far is 
the more popular approach in American textbooks: leave the equa- 
tion the same, but attach interpretations to the resulting negative 
or positive values of the variables. The trouble with this approach 
is that one is then forced to memorize tables of sign conventions, 
e.g., that the value of d; should be negative when the image is a 
virtual image formed by a converging mirror. Positive and negative 
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signs also have to be memorized for focal lengths. Ugh! It’s highly 
unlikely that any student has ever retained these lengthy tables in 
his or her mind for more than five minutes after handing in the final 
exam in a physics course. Of course one can always look such things 
up when they are needed, but the effect is to turn the whole thing 
into an exercise in blindly plugging numbers into formulas. 


As you have gathered by now, there is another method which I 
think is better, and which I’ll use throughout the rest of this book. 
In this method, all distances and angles are positive by definition, 
and we put in positive and negative signs in the equations depending 
on the situation. (I thought I was the first to invent this method, but 
I’ve been told that this is known as the European sign convention, 
and that it’s fairly common in Europe.) Rather than memorizing 
these signs, we start with the generic equations 


0, = +0; +0, 
1 1 1 
f . d; do 


and then determine the signs by a two-step method that depends on 
ray diagrams. There are really only two signs to determine, not four; 
the signs in the two equations match up in the way you’d expect. 
The method is as follows: 


1. Use ray diagrams to decide whether @, and 6; vary in the same 
way or in opposite ways. (In other words, decide whether making 6, 
greater results in a greater value of 6; or a smaller one.) Based on 
this, decide whether the two signs in the angle equation are the same 
or opposite. If the signs are opposite, go on to step 2 to determine 
which is positive and which is negative. 


2. If the signs are opposite, we need to decide which is the 
positive one and which is the negative. Since the focal angle is never 
negative, the smaller angle must be the one with a minus sign. 


In step 1, many students have trouble drawing the ray diagram 
correctly. For simplicity, you should always do your diagram for a 
point on the object that is on the axis of the mirror, and let one 
of your rays be the one that is emitted along the axis and reflected 
straight back on itself, as in the figures in subsection 12.3.1. As 
shown in figure a/4 in subsection 12.3.1, there are four angles in- 
volved: two at the mirror, one at the object (0.), and one at the 
image (0;). Make sure to draw in the normal to the mirror so that 
you can see the two angles at the mirror. These two angles are 
equal, so as you change the object position, they fan out or fan in, 
like opening or closing a book. Once you’ve drawn this effect, you 
should easily be able to tell whether 6, and 6; change in the same 
way or in opposite ways. 


Although focal lengths are always positive in the method used 
in this book, you should be aware that diverging mirrors and lenses 
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are assigned negative focal lengths in the other method, so if you 
see a lens labeled f = —30 cm, you’ll know what it means. 


An anti-shoplifting mirror example 8 
> Convenience stores often install a diverging mirror so that the 
clerk has a view of the whole store and can catch shoplifters. Use 
a ray diagram to show that the image is reduced, bringing more 
into the clerk’s field of view. If the focal length of the mirror is 3.0 
m, and the mirror is 7.0 m from the farthest wall, how deep is the 
image of the store? 


> As shown in ray diagram g/1, a; is less than dp. The magnifica- 
tion, M = dj/do, will be less than one, i.e., the image is actually 
reduced rather than magnified. 


Apply the method outlined above for determining the plus and 
minus signs. Step 1: The object is the point on the opposite 
wall. As an experiment, g/2, move the object closer. | did these 
drawings using illustration software, but if you were doing them 
by hand, you’d want to make the scale much larger for greater 
accuracy. Also, although | split figure g into two separate drawings 
in order to make them easier to understand, you’re less likely to 
make a mistake if you do them on top of each other. 


The two angles at the mirror fan out from the normal. Increasing 
8 has clearly made 9; larger as well. (All four angles got big- 
ger.) There must be a cancellation of the effects of changing the 
two terms on the right in the same way, and the only way to get 
such a cancellation is if the two terms in the angle equation have 
opposite signs: 

Of = +0; — Oo 
or 

0+ = —0; + Qo 
Step 2: Now which is the positive term and which is negative? 
Since the image angle is bigger than the object angle, the angle 
equation must be 

O,=O8;-O0 , 
in order to give a positive result for the focal angle. The signs of 
the distance equation behave the same way: 

1 1 1 


Solving for d;, we find 


The image of the store is reduced by a factor of 2.1/7.0 = 0.3, 
i.e., itis smaller by 70%. 


‘A shortcut for real images example 9 
In the case of a real image, there is a shortcut for step 1, the 
determination of the signs. In a real image, the rays cross at 
both the object and the image. We can therefore time-reverse the 
ray diagram, so that all the rays are coming from the image and 
reconverging at the object. Object and image swap roles. Due 
to this time-reversal symmetry, the object and image cannot be 
treated differently in any of the equations, and they must therefore 
have the same signs. They are both positive, since they must add 
up to a positive result. 


12.3.3 Aberrations 


An imperfection or distortion in an image is called an aberra- 
tion. An aberration can be produced by a flaw in a lens or mirror, 
but even with a perfect optical surface some degree of aberration is 
unavoidable. To see why, consider the mathematical approximation 
we’ve been making, which is that the depth of the mirror’s curve 
is small compared to d, and d;. Since only a flat mirror can sat- 
isfy this shallow-mirror condition perfectly, any curved mirror will 
deviate somewhat from the mathematical behavior we derived by 
assuming that condition. There are two main types of aberration in 
curved mirrors, and these also occur with lenses. 


(1) An object on the axis of the lens or mirror may be imaged 
correctly, but off-axis objects may be out of focus or distorted. In 
a camera, this type of aberration would show up as a fuzziness or 
warping near the sides of the picture when the center was perfectly 
focused. An example of this is shown in figure i, and in that partic- 
ular example, the aberration is not a sign that the equipment was 
of low quality or wasn’t right for the job but rather an inevitable 
result of trying to flatten a panoramic view; in the limit of a 360- 


h/ A diverging mirror in the shape 
of a sphere. The image is re- 
duced (M < 1). This is similar 
to example 8, but here the image 
is distorted because the mirror’s 
curve is not shallow. 


Section 12.3 Images, Quantitatively 719 


i/ This photo was taken using a 
“fish-eye lens,” which gives an ex- 
tremely large field of view. 


degree panorama, the problem would be similar to the problem of 
representing the Earth’s surface on a flat map, which can’t be ac- 
complished without distortion. 


(2) The image may be sharp when the object is at certain dis- 
tances and blurry when it is at other distances. The blurriness 
occurs because the rays do not all cross at exactly the same point. 
If we know in advance the distance of the objects with which the 
mirror or lens will be used, then we can optimize the shape of the 
optical surface to make in-focus images in that situation. For in- 
stance, a spherical mirror will produce a perfect image of an object 
that is at the center of the sphere, because each ray is reflected di- 
rectly onto the radius along which it was emitted. For objects at 
greater distances, however, the focus will be somewhat blurry. In 
astronomy the objects being used are always at infinity, so a spher- 
ical mirror is a poor choice for a telescope. A different shape (a 
parabola) is better specialized for astronomy. 


One way of decreasing aberration is to use a small-diameter mir- 
ror or lens, or block most of the light with an opaque screen with a 
hole in it, so that only light that comes in close to the axis can get 
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parabolic 


ie 


spherical 


through. Either way, we are using a smaller portion of the lens or 
mirror whose curvature will be more shallow, thereby making the 
shallow-mirror (or thin-lens) approximation more accurate. Your 
eye does this by narrowing down the pupil to a smaller hole. In 
a camera, there is either an automatic or manual adjustment, and 
narrowing the opening is called “stopping down.” The disadvantage 
of stopping down is that light is wasted, so the image will be dimmer 
or a longer exposure must be used. 


What I would suggest you take away from this discussion for the 
sake of your general scientific education is simply an understanding 
of what an aberration is, why it occurs, and how it can be reduced, 
not detailed facts about specific types of aberrations. 


j/Spherical mirrors are the 
cheapest to make, but parabolic 
mirrors are better for making 
images of objects at_ infinity. 
A sphere has equal curvature 
everywhere, but a parabola has 
tighter curvature at its center and 
gentler curvature at the sides. 


k / Even though the spherical mir- 
ror (solid line) is not well adapted 
for viewing an object at infinity, 
we can improve its performance 
greatly by stopping it down. Now 
the only part of the mirror be- 
ing used is the central portion, 
where its shape is virtually in- 
distinguishable from a parabola 
(dashed line). 
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1/The Hubble Space Telescope 
was placed into orbit with faulty 
optics in 1990. Its main mir- 
ror was supposed to have been 
nearly parabolic, since it is an as- 
tronomical telescope, meant for 
producing images of objects at in- 
finity. However, contractor Per- 
kin Elmer had delivered a faulty 
mirror, which produced aberra- 
tions. The large photo shows as- 
tronauts putting correcting mirrors 
in place in 1993. The two small 
photos show images produced by 
the telescope before and after the 
fix. 


12.4 Refraction 


Economists normally consider free markets to be the natural way of 
judging the monetary value of something, but social scientists also 
use questionnaires to gauge the relative value of privileges, disad- 
vantages, or possessions that cannot be bought or sold. They ask 
people to imagine that they could trade one thing for another and 
ask which they would choose. One interesting result is that the av- 
erage light-skinned person in the U.S. would rather lose an arm than 
suffer the racist treatment routinely endured by African-Americans. 
Even more impressive is the value of sight. Many prospective par- 
ents can imagine without too much fear having a deaf child, but 
would have a far more difficult time coping with raising a blind one. 


So great is the value attached to sight that some have imbued 
it with mystical aspects. Moses “had vision,” George Bush did not. 
Christian fundamentalists who perceive a conflict between evolution 
and their religion have claimed that the eye is such a perfect device 
that it could never have arisen through a process as helter-skelter as 
evolution, or that it could not have evolved because half of an eye 
would be useless. In fact, the structure of an eye is fundamentally 
dictated by physics, and it has arisen separately by evolution some- 
where between eight and 40 times, depending on which biologist you 
ask. We humans have a version of the eye that can be traced back 
to the evolution of a light-sensitive “eye spot” on the head of an 
ancient invertebrate. A sunken pit then developed so that the eye 
would only receive light from one direction, allowing the organism 
to tell where the light was coming from. (Modern flatworms have 
this type of eye.) The top of the pit then became partially covered, 
leaving a hole, for even greater directionality (as in the nautilus). 
At some point the cavity became filled with jelly, and this jelly fi- 
nally became a lens, resulting in the general type of eye that we 
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share with the bony fishes and other vertebrates. Far from being 
a perfect device, the vertebrate eye is marred by a serious design 
flaw due to the lack of planning or intelligent design in evolution: 
the nerve cells of the retina and the blood vessels that serve them 
are all in front of the light-sensitive cells, blocking part of the light. 
Squids and other molluscs, whose eyes evolved on a separate branch 
of the evolutionary tree, have a more sensible arrangement, with the 
light-sensitive cells out in front. 


12.4.1 Refraction 


Refraction 


The fundamental physical phenomenon at work in the eye is 
that when light crosses a boundary between two media (such as air 
and the eye’s jelly), part of its energy is reflected, but part passes 
into the new medium. In the ray model of light, we describe the 
original ray as splitting into a reflected ray and a transmitted one 
(the one that gets through the boundary). Of course the reflected 
ray goes in a direction that is different from that of the original one, 
according to the rules of reflection we have already studied. More 
surprisingly — and this is the crucial point for making your eye 
focus light — the transmitted ray is bent somewhat as well. This 
bending phenomenon is called refraction. The origin of the word 
is the same as that of the word “fracture,” i.e., the ray is bent or 
“broken.” (Keep in mind, however, that light rays are not physical 
objects that can really be “broken.”) Refraction occurs with all 
waves, not just light waves. 


The actual anatomy of the eye, b, is quite complex, but in essence 
it is very much like every other optical device based on refraction. 
The rays are bent when they pass through the front surface of the 
eye, c. Rays that enter farther from the central axis are bent more, 
with the result that an image is formed on the retina. There is 
only one slightly novel aspect of the situation. In most human-built 
optical devices, such as a movie projector, the light is bent as it 
passes into a lens, bent again as it reemerges, and then reaches a 
focus beyond the lens. In the eye, however, the “screen” is inside 
the eye, so the rays are only refracted once, on entering the jelly, 
and never emerge again. 


A common misconception is that the “lens” of the eye is what 
does the focusing. All the transparent parts of the eye are made 
of fairly similar stuff, so the dramatic change in medium is when a 
ray crosses from the air into the eye (at the outside surface of the 
cornea). This is where nearly all the refraction takes place. The lens 
medium differs only slightly in its optical properties from the rest 
of the eye, so very little refraction occurs as light enters and exits 
the lens. The lens, whose shape is adjusted by muscles attached to 
it, is only meant for fine-tuning the focus to form images of near or 
far objects. 


a/A human eye. 


optic 
nerve 


b/The anatomy of the eye. 


c/A_ simplified optical  dia- 
gram of the eye. Light rays are 
bent when they cross from the 
air into the eye. (A little of the 
incident rays’ energy goes into 
the reflected rays rather than the 
ones transmitted into the eye.) 
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incident ray reflected ray 


medium 1 


d/The _ incident, reflected, 
and transmitted (refracted) rays 
all lie in a plane that includes the 
normal (dashed line). 


medium 1 


e/The angles 6; and Q. are 
related to each other, and also 
depend on the properties of the 
two media. Because refraction 
is time-reversal symmetric, there 
is no need to label the rays with 
arrowheads. 


f/ Refraction has_ time-reversal 
symmetry. Regardless of whether 
the light is going into or out of the 
water, the relationship between 
the two angles is the same, and 
the ray is closer to the normal 
while in the water. 


Refractive properties of media 


What are the rules governing refraction? The first thing to ob- 
serve is that just as with reflection, the new, bent part of the ray lies 
in the same plane as the normal (perpendicular) and the incident 
ray, d. 


If you try shooting a beam of light at the boundary between 
two substances, say water and air, you’ll find that regardless of the 
angle at which you send in the beam, the part of the beam in the 
water is always closer to the normal line, e. It doesn’t matter if the 
ray is entering the water or leaving, so refraction is symmetric with 
respect to time-reversal, f. 


If, instead of water and air, you try another combination of sub- 
stances, say plastic and gasoline, again you’ll find that the ray’s 
angle with respect to the normal is consistently smaller in one and 
larger in the other. Also, we find that if substance A has rays closer 
to normal than in B, and B has rays closer to normal than in C, then 
A has rays closer to normal than C. This means that we can rank- 
order all materials according to their refractive properties. Isaac 
Newton did so, including in his list many amusing substances, such 
as “Danzig vitriol” and “a pseudo-topazius, being a natural, pellu- 
cid, brittle, hairy stone, of a yellow color.” Several general rules can 
be inferred from such a list: 


e Vacuum lies at one end of the list. In refraction across the 
interface between vacuum and any other medium, the other 
medium has rays closer to the normal. 


e Among gases, the ray gets closer to the normal if you increase 
the density of the gas by pressurizing it more. 


e The refractive properties of liquid mixtures and solutions vary 
in a smooth and systematic manner as the proportions of the 
mixture are changed. 


e Denser substances usually, but not always, have rays closer to 
the normal. 


The second and third rules provide us with a method for measur- 
ing the density of an unknown sample of gas, or the concentration 
of a solution. The latter technique is very commonly used, and the 
CRC Handbook of Physics and Chemistry, for instance, contains 
extensive tables of the refractive properties of sugar solutions, cat 
urine, and so on. 


Snell’s law 


The numerical rule governing refraction was discovered by Snell, 
who must have collected experimental data something like what is 
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90 
shown on this graph and then attempted by trial and error to find ee 
the right equation. The equation he came up with was Ww 704 
vo — 
sin 0; oH = | 
; = constant gS 50 5009 
sin 99 404 00° 
ge 60° 
The value of the constant would depend on the combination of media 2 i | 40° 
used. For instance, any one of the data points in the graph would ral Role 
have sufficed to show that the constant was 1.3 for an air-water 6 goo ee eee eran eee 
interface (taking air to be substance 1 and water to be substance Dd 0520.00 40) 20°00" 70289690 
2); Oair (degrees) 
Snell further found that if media A and B gave a constant K 4p g/The relationship between 


and media B and C gave a constant K gc, then refraction at an inter- the angles in refraction. 


face between A and C would be described by a constant equal to the 
product, Kac = KapkKgc. This is exactly what one would expect 
if the constant depended on the ratio of some number characteriz- 
ing one medium to the number characteristic of the second medium. 
This number is called the index of refraction of the medium, written 
as n in equations. Since measuring the angles would only allow him 
to determine the ratio of the indices of refraction of two media, Snell 
had to pick some medium and define it as having n = 1. He chose 
to define vacuum as having n = 1. (The index of refraction of air 
at normal atmospheric pressure is 1.0003, so for most purposes it is 
a good approximation to assume that air has n = 1.) He also had 
to decide which way to define the ratio, and he chose to define it so 
that media with their rays closer to the normal would have larger in- 
dices of refraction. This had the advantage that denser media would 
typically have higher indices of refraction, and for this reason the 
index of refraction is also referred to as the optical density. Written 
in terms of indices of refraction, Snell’s equation becomes 


sin 0, = ng 


sin Ag Ny 


but rewriting it in the form 


NY sin Oy = n2 sin A 


[relationship between angles of rays at the interface be- 
tween media with indices of refraction n; and ng; angles 
are defined with respect to the normal] 


makes us less likely to get the 1’s and 2’s mixed up, so this the way 
most people remember Snell’s law. A few indices of refraction are 
given in the back of the book. 


self-check E 
(1) What would the graph look like for two substances with the same 
index of refraction? 


(2) Based on the graph, when does refraction at an air-water interface 
change the direction of a ray most strongly? > Answer, p. 873 
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60°) 


h/ Example 10. 


medium 2 
medium 1 


i/A mechanical model of re- 
fraction. 


‘Finding an angle using Snell's law example 10 
> A submarine shines its searchlight up toward the surface of the 
water. What is the angle « shown in the figure? 


> The tricky part is that Snell’s law refers to the angles with re- 
spect to the normal. Forgetting this is a very common mistake. 
The beam is at an angle of 30° with respect to the normal in the 
water. Let’s refer to the air as medium 1 and the water as 2. 
Solving Snell’s law for 8;, we find 


. 4 [Ne_, 
= sin —sin 
0, =SIl (7 Sl 2) 
As mentioned above, air has an index of refraction very close to 
1, and water’s is about 1.3, so we find 0; = 40°. The angle a is 
therefore 50°. 


The index of refraction is related to the speed of light. 


What neither Snell nor Newton knew was that there is a very 
simple interpretation of the index of refraction. This may come as 
a relief to the reader who is taken aback by the complex reasoning 
involving proportionalities that led to its definition. Later experi- 
ments showed that the index of refraction of a medium was inversely 
proportional to the speed of light in that medium. Since c is defined 
as the speed of light in vacuum, and n = 1 is defined as the index 
of refraction of vacuum, we have 


C 
a 
v 

[2 = medium’s index of refraction, v = speed of light 


in that medium, c = speed of light in a vacuum] 


Many textbooks start with this as the definition of the index 
of refraction, although that approach makes the quantity’s name 
somewhat of a mystery, and leaves students wondering why c/v was 
used rather than v/c. It should also be noted that measuring angles 
of refraction is a far more practical method for determining n than 
direct measurement of the speed of light in the substance of interest. 


A mechanical model of Snell’s law 


Why should refraction be related to the speed of light? The 
mechanical model shown in the figure may help to make this more 
plausible. Suppose medium 2 is thick, sticky mud, which slows down 
the car. The car’s right wheel hits the mud first, causing the right 
side of the car to slow down. This will cause the car to turn to the 
right until is moves far enough forward for the left wheel to cross 
into the mud. After that, the two sides of the car will once again be 
moving at the same speed, and the car will go straight. 


Of course, light isn’t a car. Why should a beam of light have 
anything resembling a “left wheel” and “right wheel?” After all, 
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the mechanical model would predict that a motorcycle would go 
straight, and a motorcycle seems like a better approximation to a 
ray of light than a car. The whole thing is just a model, not a 
description of physical reality. 


yr 


motion of 
wave 2 


j/A derivation of Snell’s law. 


A derivation of Snell’s law 


However intuitively appealing the mechanical model may be, 
light is a wave, and we should be using wave models to describe 
refraction. In fact Snell’s law can be derived quite simply from 
wave concepts. Figure j shows the refraction of a water wave. The 
water in the upper left part of the tank is shallower, so the speed 
of the waves is slower there, and their wavelengths is shorter. The 
reflected part of the wave is also very faintly visible. 


In the close-up view on the right, the dashed lines are normals 
to the interface. The two marked angles on the right side are both 
equal to 6;, and the two on the left to 69. 


Trigonometry gives 
sin, = Ay/h and 
sin A = dQ f h 
Eliminating h by dividing the equations, we find 


sin 6; Ay 


sind) aS. 


The frequencies of the two waves must be equal or else they would 
get out of step, so by v = fA we know that their wavelengths are 
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k/ Total internal reflection in 
a fiber-optic cable. 


© 


1/A_ simplified drawing of a 
surgical endoscope. The first 
lens forms a real image at 
one end of a bundle of optical 
fibers. The light is transmitted 
through the bundle, and is finally 
magnified by the eyepiece. 


m/Endoscopic images of a 
duodenal ulcer. 


proportional to their velocities. Combining \ x v with v « 1/n 
gives \ x 1/n, so we find 


sin 0; ng 


sin Ap Ny 
which is one form of Snell’s law. 


Ocean waves near and far from shore example 11 
Ocean waves are formed by winds, typically on the open sea, and 
the wavefronts are perpendicular to the direction of the wind that 
formed them. At the beach, however, you have undoubtedly ob- 
served that waves tend come in with their wavefronts very nearly 
(out not exactly) parallel to the shoreline. This is because the 
speed of water waves in shallow water depends on depth: the 
shallower the water, the slower the wave. Although the change 
from the fast-wave region to the slow-wave region is gradual rather 
than abrupt, there is still refraction, and the wave motion is nearly 
perpendicular to the normal in the slow region. 


Color and refraction 


In general, the speed of light in a medium depends both on the 
medium and on the wavelength of the light. Another way of saying it 
is that a medium’s index of refraction varies with wavelength. This 
is why a prism can be used to split up a beam of white light into a 
rainbow. Each wavelength of light is refracted through a different 
angle. 


How much light is reflected, and how much is transmitted? 


In section 6.2 we developed an equation for the percentage of 
the wave energy that is transmitted and the percentage reflected at 
a boundary between media. This was only done in the case of waves 
in one dimension, however, and rather than discuss the full three di- 
mensional generalization it will be more useful to go into some qual- 
itative observations about what happens. First, reflection happens 
only at the interface between two media, and two media with the 
same index of refraction act as if they were a single medium. Thus, 
at the interface between media with the same index of refraction, 
there is no reflection, and the ray keeps going straight. Continuing 
this line of thought, it is not surprising that we observe very lit- 
tle reflection at an interface between media with similar indices of 
refraction. 


The next thing to note is that it is possible to have situations 
where no possible angle for the refracted ray can satisfy Snell’s law. 
Solving Snell’s law for 02, we find 


— myo. 
62 = sin! (= sins) F 
n2 


and if n; is greater than nz, then there will be large values of 0; 
for which the quantity (n;/n2)sin@ is greater than one, meaning 
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that your calculator will flash an error message at you when you 
try to take the inverse sine. What can happen physically in such 
a situation? The answer is that all the light is reflected, so there 
is no refracted ray. This phenomenon is known as total internal 
reflection, and is used in the fiber-optic cables that nowadays carry 
almost all long-distance telephone calls. The electrical signals from 
your phone travel to a switching center, where they are converted 
from electricity into light. From there, the light is sent across the 
country in a thin transparent fiber. The light is aimed straight into 
the end of the fiber, and as long as the fiber never goes through any 
turns that are too sharp, the light will always encounter the edge 
of the fiber at an angle sufficiently oblique to give total internal 
reflection. If the fiber-optic cable is thick enough, one can see an 
image at one end of whatever the other end is pointed at. 


Alternatively, a bundle of cables can be used, since a single thick 
cable is too hard to bend. This technique for seeing around corners 
is useful for making surgery less traumatic. Instead of cutting a 
person wide open, a surgeon can make a small “keyhole” incision 
and insert a bundle of fiber-optic cable (known as an endoscope) 
into the body. 


Since rays at sufficiently large angles with respect to the normal 
may be completely reflected, it is not surprising that the relative 
amount of reflection changes depending on the angle of incidence, 
and is greatest for large angles of incidence. 


Discussion Questions 


A What index of refraction should a fish have in order to be invisible to 
other fish? 


B Does a Surgeon using an endoscope need a source of light inside 
the body cavity? If so, how could this be done without inserting a light 
bulb through the incision? 


Cc A denser sample of a gas has a higher index of refraction than a 
less dense sample (i.e., a sample under lower pressure), but why would 
it not make sense for the index of refraction of a gas to be proportional to 
density? 

D_ The earth’s atmosphere gets thinner and thinner as you go higher in 
altitude. If a ray of light comes from a star that is below the zenith, what 
will happen to it as it comes into the earth’s atmosphere? 


E Does total internal reflection occur when light in a denser medium 
encounters a less dense medium, or the other way around? Or can it 
occur in either case? 


12.4.2 Lenses 


Figures n/1 and n/2 show examples of lenses forming images. 
There is essentially nothing for you to learn about imaging with 
lenses that is truly new. You already know how to construct and 
use ray diagrams, and you know about real and virtual images. The 
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n/1. A converging lens forms an 
image of a candle flame. 2. A di- 
verging lens. 


concept of the focal length of a lens is the same as for a curved 
mirror. The equations for locating images and determining magni- 
fications are of the same form. It’s really just a question of flexing 
your mental muscles on a few examples. The following self-checks 
and discussion questions will get you started. 


self-check F 
(1) In figures n/1 and n/2, classify the images as real or virtual. 


(2) Glass has an index of refraction that is greater than that of air. Con- 
sider the topmost ray in figure n/1. Explain why the ray makes a slight 
left turn upon entering the lens, and another left turn when it exits. 


(3) If the flame in figure n/2 was moved closer to the lens, what would 
happen to the location of the image? > Answer, p. 873 


Discussion Questions 


A_Infigures n/1 and n/2, the front and back surfaces are parallel to each 
other at the center of the lens. What will happen to a ray that enters near 
the center, but not necessarily along the axis of the lens? Draw a BIG ray 
diagram, and show a ray that comes from off axis. 


In discussion questions B-F, don’t draw ultra-detatled ray dia- 
grams as in A. 


B Suppose you wanted to change the setup in figure n/1 so that the 
location of the actual flame in the figure would instead be occupied by an 
image of a flame. Where would you have to move the candle to achieve 
this? What about in n/2? 


C_ There are three qualitatively different types of image formation that 
can occur with lenses, of which figures n/1 and n/2 exhaust only two. 
Figure out what the third possibility is. Which of the three possibilities can 
result in a magnification greater than one? 
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D Classify the examples shown in figure o according to the types of 
images delineated in discussion question C. 


E = _Infigures n/1 and n/2, the only rays drawn were those that happened 
to enter the lenses. Discuss this in relation to figure o. 


F Inthe right-hand side of figure 0, the image viewed through the lens 
is in focus, but the side of the rose that sticks out from behind the lens is 
not. Why? 


: 
ouis i ate aed a el 


o / Two images of a rose created by the same lens and recorded with the same camera. 


12.4.3 The lensmaker’s equation 


The focal length of a spherical mirror is simply r/2, but we can- 
not expect the focal length of a lens to be given by pure geometry, 
since it also depends on the index of refraction of the lens. Suppose 
we have a lens whose front and back surfaces are both spherical. aa 
(This is no great loss of generality, since any surface with a suffi- } 
ciently shallow curvature can be approximated with a sphere.) Then | 
if the lens is immersed in a medium with an index of refraction of | 


1, its focal length is given approximately by cn © eas 
-1 
i 1 i) 
p=|in-y[4—] ; 
Ty r2 
p/The radii of curvature ap- 

where n is the index of refraction and 7; and rg are the radii of pearing in the lensmaker’s 
curvature of the two surfaces of the lens. This is known as the equation. 


lensmaker’s equation. In my opinion it is not particularly worthy 
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q/The principle of least time 
applied to refraction. 


of memorization. The positive sign is used when both surfaces are 
curved outward or both are curved inward; otherwise a negative 
sign applies. The proof of this equation is left as an exercise to 
those readers who are sufficiently brave and motivated. 


12.4.4 The principle of least time for refraction 


We have seen previously how the rules governing straight-line 
motion of light and reflection of light can be derived from the prin- 
ciple of least time. What about refraction? In the figure, it is indeed 
plausible that the bending of the ray serves to minimize the time 
required to get from a point A to point B. If the ray followed the un- 
bent path shown with a dashed line, it would have to travel a longer 
distance in the medium in which its speed is slower. By bending 
the correct amount, it can reduce the distance it has to cover in the 
slower medium without going too far out of its way. It is true that 
Snell’s law gives exactly the set of angles that minimizes the time 
required for light to get from one point to another. The proof of 
this fact is left as an exercise (problem 38, p. 753). 
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12.5 Wave Optics 


Electron microscopes can make images of individual atoms, but why 
will a visible-light microscope never be able to? Stereo speakers 
create the illusion of music that comes from a band arranged in 
your living room, but why doesn’t the stereo illusion work with bass 
notes? Why are computer chip manufacturers investing billions of 
dollars in equipment to etch chips with x-rays instead of visible 
light? 


The answers to all of these questions have to do with the subject 
of wave optics. So far this book has discussed the interaction of 
light waves with matter, and its practical applications to optical 
devices like mirrors, but we have used the ray model of light almost 
exclusively. Hardly ever have we explicitly made use of the fact that 
light is an electromagnetic wave. We were able to get away with the 
simple ray model because the chunks of matter we were discussing, 
such as lenses and mirrors, were thousands of times larger than a 
wavelength of light. We now turn to phenomena and devices that 
can only be understood using the wave model of light. 


12.5.1 Diffraction 


Figure a shows a typical problem in wave optics, enacted with 
water waves. It may seem surprising that we don’t get a simple 
pattern like figure b, but the pattern would only be that simple 
if the wavelength was hundreds of times shorter than the distance 
between the gaps in the barrier and the widths of the gaps. 


Wave optics is a broad subject, but this example will help us 
to pick out a reasonable set of restrictions to make things more 
manageable: 


(1) We restrict ourselves to cases in which a wave travels through 
a uniform medium, encounters a certain area in which the medium 
has different properties, and then emerges on the other side into a 
second uniform region. 


(2) We assume that the incoming wave is a nice tidy sine-wave 
pattern with wavefronts that are lines (or, in three dimensions, 
planes). 


(3) In figure a we can see that the wave pattern immediately 
beyond the barrier is rather complex, but farther on it sorts itself 
out into a set of wedges separated by gaps in which the water is 
still. We will restrict ourselves to studying the simpler wave patterns 
that occur farther away, so that the main question of interest is how 
intense the outgoing wave is at a given angle. 


The kind of phenomenon described by restriction (1) is called 
diffraction. Diffraction can be defined as the behavior of a wave 
when it encounters an obstacle or a nonuniformity in its medium. 
In general, diffraction causes a wave to bend around obstacles and 


a/In this view from overhead, a 
straight, sinusoidal water wave 
encounters a barrier with two 
gaps in it. Strong wave vibration 
occurs at angles X and Z, but 
there is none at all at angle Y. 
(The figure has been retouched 
from a real photo of water waves. 
In reality, the waves beyond the 
barrier would be much weaker 
than the ones before it, and they 
would therefore be difficult to 
see.) 


b/ This doesn’t happen. 
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c/A practical, low-tech setup for 
observing diffraction of light. 


d/The bottom figure is sim- 
ply a copy of the middle portion 
of the top one, scaled up by a 
factor of two. All the angles are 
the same. Physically, the angular 
pattern of the diffraction fringes 
can't be any different if we scale 
both A and d by the same factor, 
leaving A/d unchanged. 


make patterns of strong and weak waves radiating out beyond the 
obstacle. Understanding diffraction is the central problem of wave 
optics. If you understand diffraction, even the subset of diffraction 
problems that fall within restrictions (2) and (3), the rest of wave 
optics is icing on the cake. 


Diffraction can be used to find the structure of an unknown 
diffracting object: even if the object is too small to study with 
ordinary imaging, it may be possible to work backward from the 
diffraction pattern to learn about the object. The structure of a 
crystal, for example, can be determined from its x-ray diffraction 
pattern. 


Diffraction can also be a bad thing. In a telescope, for example, 
light waves are diffracted by all the parts of the instrument. This will 
cause the image of a star to appear fuzzy even when the focus has 
been adjusted correctly. By understanding diffraction, one can learn 
how a telescope must be designed in order to reduce this problem 
— essentially, it should have the biggest possible diameter. 


There are two ways in which restriction (2) might commonly be 
violated. First, the light might be a mixture of wavelengths. If we 
simply want to observe a diffraction pattern or to use diffraction as 
a technique for studying the object doing the diffracting (e.g., if the 
object is too small to see with a microscope), then we can pass the 
light through a colored filter before diffracting it. 


A second issue is that light from sources such as the sun or a 
lightbulb does not consist of a nice neat plane wave, except over 
very small regions of space. Different parts of the wave are out of 
step with each other, and the wave is referred to as incoherent. One 
way of dealing with this is shown in figure c. After filtering to select 
a certain wavelength of red light, we pass the light through a small 
pinhole. The region of the light that is intercepted by the pinhole is 
so small that one part of it is not out of step with another. Beyond 
the pinhole, light spreads out in a spherical wave; this is analogous 
to what happens when you speak into one end of a paper towel roll 
and the sound waves spread out in all directions from the other end. 
By the time the spherical wave gets to the double slit it has spread 
out and reduced its curvature, so that we can now think of it as a 
simple plane wave. 


If this seems laborious, you may be relieved to know that modern 
technology gives us an easier way to produce a single-wavelength, 
coherent beam of light: the laser. 


The parts of the final image on the screen in c are called diffrac- 
tion fringes. The center of each fringe is a point of maximum bright- 
ness, and halfway between two fringes is a minimum. 


Discussion Question 


A’ Why would x-rays rather than visible light be used to find the structure 
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of a crystal? Sound waves are used to make images of fetuses in the 
womb. What would influence the choice of wavelength? 


12.5.2 Scaling of diffraction 


This chapter has “optics” in its title, so it is nominally about 
light, but we started out with an example involving water waves. 
Water waves are certainly easier to visualize, but is this a legitimate 
comparison? In fact the analogy works quite well, despite the fact 
that a light wave has a wavelength about a million times shorter. 
This is because diffraction effects scale uniformly. That is, if we 
enlarge or reduce the whole diffraction situation by the same factor, 
including both the wavelengths and the sizes of the obstacles the 
wave encounters, the result is still a valid solution. 


This is unusually simple behavior! In subsection 0.2.2 we saw 
many examples of more complex scaling, such as the impossibility 
of bacteria the size of dogs, or the need for an elephant to eliminate 
heat through its ears because of its small surface-to-volume ratio, 
whereas a tiny shrew’s life-style centers around conserving its body 
heat. 


Of course water waves and light waves differ in many ways, not 
just in scale, but the general facts you will learn about diffraction are 
applicable to all waves. In some ways it might have been more ap- 
propriate to insert this chapter after section 6.2 on bounded waves, 
but many of the important applications are to light waves, and you 
would probably have found these much more difficult without any 
background in optics. 


Another way of stating the simple scaling behavior of diffraction 
is that the diffraction angles we get depend only on the unitless ratio 
A/d, where \ is the wavelength of the wave and d is some dimension 
of the diffracting objects, e.g., the center-to-center spacing between 
the slits in figure a. If, for instance, we scale up both A and d by a 
factor of 37, the ratio A/d will be unchanged. 


12.5.3 The correspondence principle 


The only reason we don’t usually notice diffraction of light in 
everyday life is that we don’t normally deal with objects that are 
comparable in size to a wavelength of visible light, which is about a 
millionth of a meter. Does this mean that wave optics contradicts 
ray optics, or that wave optics sometimes gives wrong results? No. 
If you hold three fingers out in the sunlight and cast a shadow 
with them, either wave optics or ray optics can be used to predict 
the straightforward result: a shadow pattern with two bright lines 
where the light has gone through the gaps between your fingers. 
Wave optics is a more general theory than ray optics, so in any case 
where ray optics is valid, the two theories will agree. This is an e/Christiaan Huygens (1629- 
example of a general idea enunciated by the physicist Niels Bohr, 1695). 
called the correspondence principle: when flaws in a physical theory 
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g/A wavefront can be analyzed 
by the principle of superposition, 
breaking it down into many small 
parts. 


h/l\f it was by itself, each of 
the parts would spread out as a 
circular ripple. 


i/Adding up the ripples pro- 
duces a new wavefront. 


lead to the creation of a new and more general theory, the new 
theory must still agree with the old theory within its more restricted 
area of applicability. After all, a theory is only created as a way of 
describing experimental observations. If the original theory had not 
worked in any cases at all, it would never have become accepted. 


In the case of optics, the correspondence principle tells us that 
when A/d is small, both the ray and the wave model of light must 
give approximately the same result. Suppose you spread your fingers 
and cast a shadow with them using a coherent light source. The 
quantity A/d is about 10 -4, so the two models will agree very closely. 
(To be specific, the shadows of your fingers will be outlined by a 
series of light and dark fringes, but the angle subtended by a fringe 
will be on the order of 10~* radians, so they will be invisible and 
washed out by the natural fuzziness of the edges of sun-shadows, 
caused by the finite size of the sun.) 


self-check G 

What kind of wavelength would an electromagnetic wave have to have 
in order to diffract dramatically around your body? Does this contradict 
the correspondence principle? > Answer, p. 873 


12.5.4 Huygens’ principle 


Returning to the example of double-slit diffraction, f, note the 
strong visual impression of two overlapping sets of concentric semi- 
circles. This is an example of Huygens’ principle, named after a 
Dutch physicist and astronomer. (The first syllable rhymes with 
“boy.” ) Huygens’ principle states that any wavefront can be broken 
down into many small side-by-side wave peaks, g, which then spread 
out as circular ripples, h, and by the principle of superposition, the 
result of adding up these sets of ripples must give the same result 
as allowing the wave to propagate forward, i. In the case of sound 
or light waves, which propagate in three dimensions, the “ripples” 
are actually spherical rather than circular, but we can often imagine 
things in two dimensions for simplicity. 


In double-slit diffraction the application of Huygens’ principle is 
visually convincing: it is as though all the sets of ripples have been 
blocked except for two. It is a rather surprising mathematical fact, 
however, that Huygens’ principle gives the right result in the case of 
an unobstructed linear wave, h andi. A theoretically infinite number 
of circular wave patterns somehow conspire to add together and 
produce the simple linear wave motion with which we are familiar. 


Since Huygens’ principle is equivalent to the principle of super- 
position, and superposition is a property of waves, what Huygens 
had created was essentially the first wave theory of light. However, 
he imagined light as a series of pulses, like hand claps, rather than 
as a sinusoidal wave. 


The history is interesting. Isaac Newton loved the atomic theory 
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of matter so much that he searched enthusiastically for evidence that 
light was also made of tiny particles. The paths of his light particles 
would correspond to rays in our description; the only significant 
difference between a ray model and a particle model of light would 
occur if one could isolate individual particles and show that light 
had a “graininess” to it. Newton never did this, so although he 
thought of his model as a particle model, it is more accurate to say 
he was one of the builders of the ray model. 


Almost all that was known about reflection and refraction of 
light could be interpreted equally well in terms of a particle model 
or a wave model, but Newton had one reason for strongly opposing 
Huygens’ wave theory. Newton knew that waves exhibited diffrac- 
tion, but diffraction of light is difficult to observe, so Newton be- 
lieved that light did not exhibit diffraction, and therefore must not 
be a wave. Although Newton’s criticisms were fair enough, the de- 
bate also took on the overtones of a nationalistic dispute between 
England and continental Europe, fueled by English resentment over 
Leibniz’s supposed plagiarism of Newton’s calculus. Newton wrote 
a book on optics, and his prestige and political prominence tended 
to discourage questioning of his model. 


Thomas Young (1773-1829) was the person who finally, a hun- 
dred years later, did a careful search for wave interference effects 
with light and analyzed the results correctly. He observed double- 
slit diffraction of light as well as a variety of other diffraction ef- 
fects, all of which showed that light exhibited wave interference ef- 
fects, and that the wavelengths of visible light waves were extremely 
short. The crowning achievement was the demonstration by the ex- 
perimentalist Heinrich Hertz and the theorist James Clerk Maxwell 
that light was an electromagnetic wave. Maxwell is said to have re- 
lated his discovery to his wife one starry evening and told her that 
she was the only other person in the world who knew what starlight 
was. 


12.5.5 Double-slit diffraction 


Let’s now analyze double-slit diffraction, k, using Huygens’ prin- 
ciple. The most interesting question is how to compute the angles 
such as X and Z where the wave intensity is at a maximum, and 
the in-between angles like Y where it is minimized. Let’s measure 
all our angles with respect to the vertical center line of the figure, 
which was the original direction of propagation of the wave. 


If we assume that the width of the slits is small (on the order 
of the wavelength of the wave or less), then we can imagine only a 
single set of Huygens ripples spreading out from each one, 1. White 
lines represent peaks, black ones troughs. The only dimension of the 
diffracting slits that has any effect on the geometric pattern of the 
overlapping ripples is then the center-to-center distance, d, between 
the slits. 


j/ Thomas Young 


k / Double-slit diffraction. 


XYZ 


4 


|/Use of Huygens’ principle. 


interference 


m / Constructive 
along the center-line. 
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n/The waves travel distances L 
and L’ from the two slits to get 
to the same point in space, at an 
angle 0 from the center line. 


L-L' 


d 


0/A close-up view of figure 
n, showing how the path length 
difference L — L’ is related to d 
and to the angle 0. 


We know from our discussion of the scaling of diffraction that 
there must be some equation that relates an angle like @z to the 
ratio A/d, 

r 

— 696 

q Z 

If the equation for 6z depended on some other expression such as 
+d or \?/d, then it would change when we scaled \ and d by the 
same factor, which would violate what we know about the scaling 
of diffraction. 


Along the central maximum line, X, we always have positive 
waves coinciding with positive ones and negative waves coinciding 
with negative ones. (I have arbitrarily chosen to take a snapshot of 
the pattern at a moment when the waves emerging from the slit are 
experiencing a positive peak.) The superposition of the two sets of 
ripples therefore results in a doubling of the wave amplitude along 
this line. There is constructive interference. This is easy to explain, 
because by symmetry, each wave has had to travel an equal number 
of wavelengths to get from its slit to the center line, m: Because 
both sets of ripples have ten wavelengths to cover in order to reach 
the point along direction X, they will be in step when they get there. 


At the point along direction Y shown in the same figure, one 
wave has traveled ten wavelengths, and is therefore at a positive 
extreme, but the other has traveled only nine and a half wavelengths, 
so it at a negative extreme. There is perfect cancellation, so points 
along this line experience no wave motion. 


But the distance traveled does not have to be equal in order to 
get constructive interference. At the point along direction Z, one 
wave has gone nine wavelengths and the other ten. They are both 
at a positive extreme. 


self-check H 
At a point half a wavelength below the point marked along direction X, 
carry out a similar analysis. > Answer, p. 873 


To summarize, we will have perfect constructive interference at 
any point where the distance to one slit differs from the distance to 
the other slit by an integer number of wavelengths. Perfect destruc- 
tive interference will occur when the number of wavelengths of path 
length difference equals an integer plus a half. 


Now we are ready to find the equation that predicts the angles 
of the maxima and minima. The waves travel different distances 
to get to the same point in space, n. We need to find whether the 
waves are in phase (in step) or out of phase at this point in order to 
predict whether there will be constructive interference, destructive 
interference, or something in between. 


One of our basic assumptions in this chapter is that we will only 
be dealing with the diffracted wave in regions very far away from the 
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object that diffracts it, so the triangle is long and skinny. Most real- 
world examples with diffraction of light, in fact, would have triangles 
with even skinner proportions than this one. The two long sides are 
therefore very nearly parallel, and we are justified in drawing the 
right triangle shown in figure o, labeling one leg of the right triangle 
as the difference in path length , L—L’, and labeling the acute angle 
as 9. (In reality this angle is a tiny bit greater than the one labeled 
6 in figure n.) 


The difference in path length is related to d and 0 by the equation 


L-L' 


= sind 


Constructive interference will result in a maximum at angles for 
which L — L’ is an integer number of wavelengths, 
L-L'=m\ [condition for a maximum; 


m is an integer] 


Here m equals 0 for the central maximum, —1 for the first maximum 
to its left, +2 for the second maximum on the right, etc. Putting 
all the ingredients together, we find mA/d = sin@, or 


A sind a . 
= : [condition for a maximum; 
d m 


m is an integer] 
Similarly, the condition for a minimum is 


A sind - _ 
== : [condition for a minimum; 
d m 


m is an integer plus 1/2] 
That is, the minima are about halfway between the maxima. 


As expected based on scaling, this equation relates angles to the 
unitless ratio \/d. Alternatively, we could say that we have proven 
the scaling property in the special case of double-slit diffraction. It 
was inevitable that the result would have these scaling properties, 
since the whole proof was geometric, and would have been equally 
valid when enlarged or reduced on a photocopying machine! 


Counterintuitively, this means that a diffracting object with 
smaller dimensions produces a bigger diffraction pattern, p. 


p/Cutting d in half doubles 
the angles of the diffraction 
fringes. 


q/Double-slit diffraction _ pat- 
terns of long-wavelength red light 
(top) and short-wavelength blue 
light (bottom). 
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Double-slit diffraction of blue and red light example 12 
Blue light has a shorter wavelength than red. For a given double- 
slit spacing d, the smaller value of A/d for leads to smaller values 
of sin8, and therefore to a more closely spaced set of diffraction 
fringes, (g) 


The correspondence principle example 13 
Let’s also consider how the equations for double-slit diffraction 
relate to the correspondence principle. When the ratio A/d is very 
small, we should recover the case of simple ray optics. Now if A/d 
is small, sin6@ must be small as well, and the spacing between 
the diffraction fringes will be small as well. Although we have not 
proven it, the central fringe is always the brightest, and the fringes 
get dimmer and dimmer as we go farther from it. For small values 
of A/d, the part of the diffraction pattern that is bright enough to 
be detectable covers only a small range of angles. This is exactly 
what we would expect from ray optics: the rays passing through 
the two slits would remain parallel, and would continue moving 
in the 8 = 0 direction. (In fact there would be images of the two 
separate slits on the screen, but our analysis was all in terms of 
angles, so we should not expect it to address the issue of whether 
there is structure within a set of rays that are all traveling in the 
8 = 0 direction.) 


Spacing of the fringes at small angles example 14 
At small angles, we can use the approximation sin @ ~ 9, which 
is valid if 8 is measured in radians. The equation for double-slit 
diffraction becomes simply 


which can be solved for 6 to give 


_m 
are) 


The difference in angle between successive fringes is the change 
in 8 that results from changing m by plus or minus one, 


S) 


A@ = — 
d 

For example, if we write @7 for the angle of the seventh bright 
fringe on one side of the central maximum and 8g for the neigh- 
boring one, we have 


8A 7A 

ae ae ane 
feu 

-2 


and similarly for any other neighboring pair of fringes. 


Although the equation A/d = sin@/m is only valid for a double 
slit, it is can still be a guide to our thinking even if we are observing 
diffraction of light by a virus or a flea’s leg: it is always true that 


(1) large values of \/d lead to a broad diffraction pattern, and 
(2) diffraction patterns are repetitive. 


In many cases the equation looks just like A/d = sin@/m but 
with an extra numerical factor thrown in, and with d interpreted as 
some other dimension of the object, e.g., the diameter of a piece of 
wire. 


12.5.6 Repetition 


Suppose we replace a double slit with a triple slit, r. We can 
think of this as a third repetition of the structures that were present 
in the double slit. Will this device be an improvement over the 
double slit for any practical reasons? 


The answer is yes, as can be shown using figure t. For ease 
of visualization, I have violated our usual rule of only considering 
points very far from the diffracting object. The scale of the drawing 
is such that a wavelengths is one cm. In t/1, all three waves travel 
an integer number of wavelengths to reach the same point, so there 
is a bright central spot, as we would expect from our experience 
with the double slit. In figure t/2, we show the path lengths to 
a new point. This point is farther from slit A by a quarter of a 
wavelength, and correspondingly closer to slit C. The distance from 
slit B has hardly changed at all. Because the paths lengths traveled 
from slits A and C differ by half a wavelength, there will be perfect 
destructive interference between these two waves. There is still some 
uncanceled wave intensity because of slit B, but the amplitude will 
be three times less than in figure t/1, resulting in a factor of 9 
decrease in brightness. Thus, by moving off to the right a little, we 
have gone from the bright central maximum to a point that is quite 
dark. 


Now let’s compare with what would have happened if slit C had 
been covered, creating a plain old double slit. The waves coming 
from slits A and B would have been out of phase by 0.23 wavelengths, 
but this would not have caused very severe interference. The point 
in figure t/2 would have been quite brightly lit up. 


To summarize, we have found that adding a third slit narrows 
down the central fringe dramatically. The same is true for all the 
other fringes as well, and since the same amount of energy is con- 


r/A triple slit. 


diffraction 


s/A double-slit 
pattern (top), and a pattern made 
by five slits (bottom). 
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5.25A 4.75A 
4.02A 


Cc 


> 
o 
fo) 


t/1. There is a bright central maximum. 2. At this point just off the central maximum, the path lengths 
traveled by the three waves have changed. 


u / Single-slit diffraction of 


water waves. 


v / Single-slit diffraction of 
red light. Note the double width 
of the central maximum. 


w/A_ pretty good simulation 
of the single-slit pattern of figure 
u, made by using three motors to 
produce overlapping ripples from 
three neighboring points in the 
water. 


centrated in narrower diffraction fringes, each fringe is brighter and 
easier to see, Ss. 


This is an example of a more general fact about diffraction: if 
some feature of the diffracting object is repeated, the locations of 
the maxima and minima are unchanged, but they become narrower. 


Taking this reasoning to its logical conclusion, a diffracting ob- 
ject with thousands of slits would produce extremely narrow fringes. 
Such an object is called a diffraction grating. 


12.5.7 Single-slit diffraction 


If we use only a single slit, is there diffraction? If the slit is not 
wide compared to a wavelength of light, then we can approximate 
its behavior by using only a single set of Huygens ripples. There 
are no other sets of ripples to add to it, so there are no constructive 
or destructive interference effects, and no maxima or minima. The 
result will be a uniform spherical wave of light spreading out in all 
directions, like what we would expect from a tiny lightbulb. We 
could call this a diffraction pattern, but it is a completely feature- 
less one, and it could not be used, for instance, to determine the 
wavelength of the light, as other diffraction patterns could. 


All of this, however, assumes that the slit is narrow compared to 
a wavelength of light. If, on the other hand, the slit is broader, there 
will indeed be interference among the sets of ripples spreading out 
from various points along the opening. Figure u shows an example 
with water waves, and figure v with light. 


self-check | 
How does the wavelength of the waves compare with the width of the 
slit in figure u? > Answer, p. 873 


We will not go into the details of the analysis of single-slit diffrac- 
tion, but let us see how its properties can be related to the general 
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things we’ve learned about diffraction. We know based on scaling 
arguments that the angular sizes of features in the diffraction pat- 
tern must be related to the wavelength and the width, a, of the slit 
by some relationship of the form 


a + 6 

a 
This is indeed true, and for instance the angle between the maximum 
of the central fringe and the maximum of the next fringe on one side 
equals 1.5\/a. Scaling arguments will never produce factors such as 
the 1.5, but they tell us that the answer must involve \/a, so all the 
familiar qualitative facts are true. For instance, shorter-wavelength 
light will produce a more closely spaced diffraction pattern. 


An important scientific example of single-slit diffraction is in 
telescopes. Images of individual stars, as in figure x, are a good way 
to examine diffraction effects, because all stars except the sun are so 
far away that no telescope, even at the highest magnification, can 
image their disks or surface features. Thus any features of a star’s 
image must be due purely to optical effects such as diffraction. A 
prominent cross appears around the brightest star, and dimmer ones 
surround the dimmer stars. Something like this is seen in most tele- 
scope photos, and indicates that inside the tube of the telescope 
there were two perpendicular struts or supports. Light diffracted 
around these struts. You might think that diffraction could be elim- 
inated entirely by getting rid of all obstructions in the tube, but the 
circles around the stars are diffraction effects arising from single- 
slit diffraction at the mouth of the telescope’s tube! (Actually we 
have not even talked about diffraction through a circular opening, 
but the idea is the same.) Since the angular sizes of the diffracted 
images depend on A/a, the only way to improve the resolution of 
the images is to increase the diameter, a, of the tube. This is one 
of the main reasons (in addition to light-gathering power) why the 
best telescopes must be very large in diameter. 


self-check J 

What would this imply about radio telescopes as compared with visible- 
light telescopes? > Answer, p. 
873 


Double-slit diffraction is easier to understand conceptually than 
single-slit diffraction, but if you do a double-slit diffraction experi- 
ment in real life, you are likely to encounter a complicated pattern 
like figure z/1, rather than the simpler one, 2, you were expecting. 
This is because the slits are fairly big compared to the wavelength 
of the light being used. We really have two different distances in 
our pair of slits: d, the distance between the slits, and w, the width 
of each slit. Remember that smaller distances on the object the 
light diffracts around correspond to larger features of the diffraction 
pattern. The pattern 1 thus has two spacings in it: a short spac- 


x/An image of the Pleiades 
star cluster. The circular rings 
around the bright stars are due to 
single-slit diffraction at the mouth 
of the telescope’s tube. 


y / A radio telescope. 
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ing corresponding to the large distance d, and a long spacing that 
relates to the small dimension w. 


z/1. A diffraction pattern formed by a real double slit. The width of each slit is fairly big compared to 


Mme. This is not a real photo. 3. A real photo of a single-slit diffraction pattern caused by 


; A\same as the widths of the slits used to make the top pattern. 


aa/Light could take 
different paths from A to B. 


many 


Discussion Question 


A Why is it optically impossible for bacteria to evolve eyes that use 
visible light to form images? 


The principle of least time 


In subsection 12.1.5 and 12.4.4, we saw how in the ray model 
of light, both refraction and reflection can be described in an ele- 
gant and beautiful way by a single principle, the principle of least 
time. We can now justify the principle of least time based on the 
wave model of light. Consider an example involving reflection, aa. 
Starting at point A, Huygens’ principle for waves tells us that we 
can think of the wave as spreading out in all directions. Suppose we 
imagine all the possible ways that a ray could travel from A to B. 
We show this by drawing 25 possible paths, of which the central one 
is the shortest. Since the principle of least time connects the wave 
model to the ray model, we should expect to get the most accurate 
results when the wavelength is much shorter than the distances in- 
volved — for the sake of this numerical example, let’s say that a 
wavelength is 1/10 of the shortest reflected path from A to B. The 
table, 2, shows the distances traveled by the 25 rays. 


Note how similar are the distances traveled by the group of 7 
rays, indicated with a bracket, that come closest to obeying the 
principle of least time. If we think of each one as a wave, then 
all 7 are again nearly in phase at point B. However, the rays that 
are farther from satisfying the principle of least time show more 
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rapidly changing distances; on reuniting at point B, their phases 
are a random jumble, and they will very nearly cancel each other 
out. Thus, almost none of the wave energy delivered to point B 
goes by these longer paths. Physically we find, for instance, that 
a wave pulse emitted at A is observed at B after a time interval 
corresponding very nearly to the shortest possible path, and the 
pulse is not very “smeared out” when it gets there. The shorter 
the wavelength compared to the dimensions of the figure, the more 
accurate these approximate statements become. 


Instead of drawing a finite number of rays, such 25, what hap- 
pens if we think of the angle, 6, of emission of the ray as a continu- 
ously varying variable? Minimizing the distance L requires 


aL 
dé 

Because L is changing slowly in the vicinity of the angle that 
satisfies the principle of least time, all the rays that come out close 
to this angle have very nearly the same L, and remain very nearly in 
phase when they reach B. This is the basic reason why the discrete 
table, aa/2, turned out to have a group of rays that all traveled 
nearly the same distance. 


As discussed in subsection 12.1.5, the principle of least time 
is really a principle of least or greatest time. This makes perfect 
sense, since dL/d@ = 0 can in general describe either a minimum or 
a maximum 


The principle of least time is very general. It does not apply just 
to refraction and reflection — it can even be used to prove that light 
rays travel in a straight line through empty space, without taking 
detours! This general approach to wave motion was used by Richard 
Feynman, one of the pioneers who in the 1950’s reconciled quantum 
mechanics with relativity. A very readable explanation is given in 
a book Feynman wrote for laypeople, QED: The Strange Theory of 
Light and Matter. 


Section 12.5 Wave Optics 


745 


746 


Problems 
The symbols v, “, etc. are explained on page 760. 


1 Draw a ray diagram showing why a small light source (a 
candle, say) produces sharper shadows than a large one (e.g., a long 
fluorescent bulb). a 


2 A Global Positioning System (GPS) receiver is a device that 
lets you figure out where you are by receiving timed radio signals 
from satellites. It works by measuring the travel time for the signals, 
which is related to the distance between you and the satellite. By 
finding the ranges to several different satellites in this way, it can 
pin down your location in three dimensions to within a few meters. 
How accurate does the measurement of the time delay have to be to 
determine your position to this accuracy? E 


3 Estimate the frequency of an electromagnetic wave whose 
wavelength is similar in size to an atom (about a nm). Referring 
back to figure n on p. 657, in what part of the electromagnetic 
spectrum would such a wave lie (infrared, gamma-rays,...)? = © 


4 The Stealth bomber is designed with flat, smooth surfaces. 
Why would this make it difficult to detect via radar? E 


5 The natives of planet Wumpus play pool using light rays on 
an eleven-sided table with mirrors for bumpers, shown in the figure 
on the next page. Trace this shot accurately with a ruler to reveal 
the hidden message. To get good enough accuracy, you’ll need to 
photocopy the page (or download the book and print the page) and 
construct each reflection using a protractor. E 


Problem 5. 


6 The figure on the next page shows a curved (parabolic) mirror, 
with three parallel light rays coming toward it. One ray is approach- 
ing along the mirror’s center line. (a) Trace the drawing accurately, 
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and continue the light rays until they are about to undergo their 
second reflection. To get good enough accuracy, you'll need to pho- 
tocopy the page (or download the book and print the page) and 
draw in the normal at each place where a ray is reflected. What 
do you notice? (b) Make up an example of a practical use for this 
device. (c) How could you use this mirror with a small lightbulb to 
produce a parallel beam of light rays going off to the right? E 


Problem 6. 


7 A man is walking at 1.0 m/s directly towards a flat mirror. At 
what speed is his separation from his image decreasing? vo 


8 If a mirror on a wall is only big enough for you to see your- 
self from your head down to your waist, can you see your entire 
body by backing up? Test this experimentally and come up with an 
explanation for your observations, including a ray diagram. 


Note that when you do the experiment, it’s easy to confuse your- 
self if the mirror is even a tiny bit off of vertical. One way to check 
yourself is to artificially lower the top of the mirror by putting a 
piece of tape or a post-it note where it blocks your view of the top 
of your head. You can then check whether you are able to see more 
of yourself both above and below by backing up. & 


9 In this chapter we’ve only done examples of mirrors with 
hollowed-out shapes (called concave mirrors). Now draw a ray dia- 
gram for a curved mirror that has a bulging outward shape (called a 
convex mirror). (a) How does the image’s distance from the mirror 
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compare with the actual object’s distance from the mirror? From 
this comparison, determine whether the magnification is greater 
than or less than one. (b) Is the image real, or virtual? Could 
this mirror ever make the other type of image? = 


10 As discussed in question 9, there are two types of curved 
mirrors, concave and convex. Make a list of all the possible com- 
binations of types of images (virtual or real) with types of mirrors 
(concave and convex). (Not all of the four combinations are phys- 
ically possible.) Now for each one, use ray diagrams to determine 
whether increasing the distance of the object from the mirror leads 
to an increase or a decrease in the distance of the image from the 
mirror. 


Draw BIG ray diagrams! Each diagram should use up about half 
a page of paper. 


Some tips: To draw a ray diagram, you need two rays. For one 
of these, pick the ray that comes straight along the mirror’s axis, 
since its reflection is easy to draw. After you draw the two rays and 
locate the image for the original object position, pick a new object 
position that results in the same type of image, and start a new ray 
diagram, in a different color of pen, right on top of the first one. For 
the two new rays, pick the ones that just happen to hit the mirror at 
the same two places; this makes it much easier to get the result right 
without depending on extreme accuracy in your ability to draw the 
reflected rays. z 


11 If the user of an astronomical telescope moves her head 
closer to or farther away from the image she is looking at, does 
the magnification change? Does the angular magnification change? 
Explain. (For simplicity, assume that no eyepiece is being used.) 


~~ 


12 In figure g/2 in on page 706, only the image of my fore- 
head was located by drawing rays. Either photocopy the figure or 
download the book and print out the relevant page. On this copy 
of the figure, make a new set of rays coming from my chin, and 
locate its image. To make it easier to judge the angles accurately, 
draw rays from the chin that happen to hit the mirror at the same 
points where the two rays from the forehead were shown hitting it. 
By comparing the locations of the chin’s image and the forehead’s 
image, verify that the image is actually upside-down, as shown in 


the original figure. & 
13 The figure shows four points where rays cross. Of these, 
which are image points? Explain. z 


14 Here’s a game my kids like to play. I sit next to a sunny 
window, and the sun reflects from the glass on my watch, making a 
disk of light on the wall or floor, which they pretend to chase as I 
move it around. Is the spot a disk because that’s the shape of the 
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sun, or because it’s the shape of my watch? In other words, would 
a square watch make a square spot, or do we just have a circular 
image of the circular sun, which will be circular no matter what? 


~ 


15 Apply the equation M = d;/d, to the case of a flat mirror. 


~— 


16 Use the method described in the text to derive the equation 
relating object distance to image distance for the case of a virtual 
image produced by a converging mirror. > Solution, p. 883 © 


17 Find the focal length of the mirror in problem 6 in chapter 
2, ae 


18 ~~ Rank the focal lengths of the mirrors, from shortest to longest. 


~ 


19 (a) A converging mirror with a focal length of 20 cm is used 
to create an image, using an object at a distance of 10 cm. Is the 
image real, or is it virtual? (b) How about f = 20 cm and d, = 30 
cm? (c) What if it was a diverging mirror with f = 20 cm and 
do = 10 cm? (d) A diverging mirror with f = 20 cm and dy = 30 
cm? > Solution, p. 883 & 


20 (a) Make up a numerical example of a virtual image formed by 
a converging mirror with a certain focal length, and determine the 
magnification. (You will need the result of problem 2.) Make sure 
to choose values of d, and f that would actually produce a virtual 
image, not a real one. Now change the location of the object a little 
bit and redetermine the magnification, showing that it changes. At 
my local department store, the cosmetics department sells mirrors 
advertised as giving a magnification of 5 times. How would you 
interpret this? 


(b) Suppose a Newtonian telescope is being used for astronom- 
ical observing. Assume for simplicity that no eyepiece is used, and 
assume a value for the focal length of the mirror that would be 
reasonable for an amateur instrument that is to fit in a closet. Is 
the angular magnification different for objects at different distances? 
For example, you could consider two planets, one of which is twice 
as far as the other. E 


21 (a) Find a case where the magnification of a curved mirror 
is infinite. Is the angular magnification infinite from any realistic 
viewing position? (b) Explain why an arbitrarily large magnification 
can’t be achieved by having a sufficiently small value of do. g 


22 A concave surface that reflects sound waves can act just like 
a converging mirror. Suppose that, standing near such a surface, 
you are able to find a point where you can place your head so that 
your own whispers are focused back on your head, so that they 
sound loud to you. Given your distance to the surface, what is the 


Problem 18. 
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surface’s focal length? E 


23 The figure shows a device for constructing a realistic optical 
illusion. Two mirrors of equal focal length are put against each 
other with their silvered surfaces facing inward. A small object 
placed in the bottom of the cavity will have its image projected in 
the air above. The way it works is that the top mirror produces a 
virtual image, and the bottom mirror then creates a real image of 
the virtual image. (a) Show that if the image is to be positioned 
as shown, at the mouth of the cavity, then the focal length of the 
mirrors is related to the dimension h via the equation 


1 1 1 


ae ey 


(b) Restate the equation in terms of a single variable x = h/f, and 
show that there are two solutions for z. Which solution is physically 
consistent with the assumptions of the calculation? > 


24 (a) A converging mirror is being used to create a virtual 
image. What is the range of possible magnifications? (b) Do the 
same for the other types of images that can be formed by curved 
mirrors (both converging and diverging). a 


25 A diverging mirror of focal length f is fixed, and faces down. 
An object is dropped from the surface of the mirror, and falls away 
from it with acceleration g. The goal of the problem is to find the 
maximum velocity of the image. 

(a) Describe the motion of the image verbally, and explain why we 
should expect there to be a maximum velocity. 

(b) Use arguments based on units to determine the form of the 
solution, up to an unknown unitless multiplicative constant. 

(c) Complete the solution by determining the unitless constant. 


~~ 


26 Diamond has an index of refraction of 2.42, and part of 
the reason diamonds sparkle is that this encourages a light ray to 
undergo many total internal reflections before it emerges. (a) Cal- 
culate the critical angle at which total internal reflection occurs in 
diamond. (b) Explain the interpretation of your result: Is it mea- 
sured from the normal, or from the surface? Is it a minimum, or a 
maximum? How would the critical angle have been different for a 


substance such as glass or plastic, with a lower index of refraction? 
vo 


27 Suppose a converging lens is constructed of a type of plastic 
whose index of refraction is less than that of water. How will the 
lens’s behavior be different if it is placed underwater? z 


28 There are two main types of telescopes, refracting (using 
lenses) and reflecting (using mirrors). (Some telescopes use a mix- 
ture of the two types of elements: the light first encounters a large 
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curved mirror, and then goes through an eyepiece that is a lens.) 
What implications would the color-dependence of focal length have 
for the relative merits of the two types of telescopes? Describe the 
case where an image is formed of a white star. You may find it 
helpful to draw a ray diagram. g 


29 Based on Snell’s law, explain why rays of light passing through 
the edges of a converging lens are bent more than rays passing 
through parts closer to the center. It might seem like it should 
be the other way around, since the rays at the edge pass through 
less glass — shouldn’t they be affected less? In your answer: 


e Include a ray diagram showing a huge close-up view of the 
relevant part of the lens. 


e Make use of the fact that the front and back surfaces aren’t 
always parallel; a lens in which the front and back surfaces are 
always parallel doesn’t focus light at all, so if your explanation 
doesn’t make use of this fact, your argument must be incorrect. 


e Make sure your argument still works even if the rays don’t 
come in parallel to the axis. 


— 


30 When you take pictures with a camera, the distance between 
the lens and the film has to be adjusted, depending on the distance 
at which you want to focus. This is done by moving the lens. If 
you want to change your focus so that you can take a picture of 
something farther away, which way do you have to move the lens? 
Explain using ray diagrams. [Based on a problem by Eric Mazur.| 


— 


31 When swimming underwater, why is your vision made much 
clearer by wearing goggles with flat pieces of glass that trap air 
behind them? [Hint: You can simplify your reasoning by considering 
the special case where you are looking at an object far away, and 
along the optic axis of the eye.] = 


32 An object is more than one focal length from a converging 
lens. (a) Draw a ray diagram. (b) Using reasoning like that devel- 
oped in section 12.3, determine the positive and negative signs in 
the equation 1/f = +1/d;+1/d,. (c) The images of the rose in 
section 4.2 were made using a lens with a focal length of 23 cm. If 
the lens is placed 80 cm from the rose, locate the image. vo 


33 The figure shows four lenses. Lens 1 has two spherical sur- 
faces. Lens 2 is the same as lens 1 but turned around. Lens 3 is 
made by cutting through lens 1 and turning the bottom around. 
Lens 4 is made by cutting a central circle out of lens 1 and recessing 
it. 
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(a) A parallel beam of light enters lens 1 from the left, parallel 
to its axis. Reasoning based on Snell’s law, will the beam emerging 
from the lens be bent inward, or outward, or will it remain parallel 
to the axis? Explain your reasoning. As part of your answer, make 
a huge drawing of one small part of the lens, and apply Snell’s law 
at both interfaces. Recall that rays are bent more if they come to 
the interface at a larger angle with respect to the normal. 


(b) What will happen with lenses 2, 3, and 4? Explain. Drawings 
are not necessary. R 


34 The drawing shows the anatomy of the human eye, at twice 
life size. Find the radius of curvature of the outer surface of the 
cornea by measurements on the figure, and then derive the focal 
length of the air-cornea interface, where almost all the focusing of 
light occurs. You will need to use physical reasoning to modify 
the lensmaker’s equation for the case where there is only a single 
refracting surface. Assume that the index of refraction of the cornea 
is essentially that of water. = 


35 An object is less than one focal length from a converging lens. 
(a) Draw a ray diagram. (b) Using reasoning like that developed 
in section 12.3, determine the positive and negative signs in the 
equation 1/f = +1/d;+1/do. (c) The images of the rose in section 
4.2 were made using a lens with a focal length of 23 cm. If the lens 
is placed 10 cm from the rose, locate the image. Vv 


36 Nearsighted people wear glasses whose lenses are diverging. 
(a) Draw a ray diagram. For simplicity pretend that there is no 
eye behind the glasses. (b) Using reasoning like that developed 
in section 12.3, determine the positive and negative signs in the 
equation 1/f = +1/d;+1/d,. (c) If the focal length of the lens is 
50.0 cm, and the person is looking at an object at a distance of 80.0 
cm, locate the image. Vv eo 


37 (a) Light is being reflected diffusely from an object 1.000 m 
underwater. The light that comes up to the surface is refracted at 
the water-air interface. If the refracted rays all appear to come from 
the same point, then there will be a virtual image of the object in 
the water, above the object’s actual position, which will be visible 
to an observer above the water. Consider three rays, A, B and C, 
whose angles in the water with respect to the normal are 0; = 0.000°, 
1.000° and 20.000° respectively. Find the depth of the point at which 
the refracted parts of A and B appear to have intersected, and do 
the same for A and C. Show that the intersections are at nearly the 
same depth, but not quite. [Check: The difference in depth should 
be about 4 cm.] 


(b) Since all the refracted rays do not quite appear to have come 
from the same point, this is technically not a virtual image. In 
practical terms, what effect would this have on what you see? 
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(c) In the case where the angles are all small, use algebra and 
trig to show that the refracted rays do appear to come from the 
same point, and find an equation for the depth of the virtual im- 
age. Do not put in any numerical values for the angles or for the 
indices of refraction — just keep them as symbols. You will need 
the approximation sin @ + tan @ = @, which is valid for small angles 
measured in radians. = 


38 Prove that the principle of least time leads to Snell’s law. 


39 Two standard focal lengths for camera lenses are 50 mm 
(standard) and 28 mm (wide-angle). To see how the focal lengths 
relate to the angular size of the field of view, it is helpful to visualize 
things as represented in the figure. Instead of showing many rays 
coming from the same point on the same object, as we normally do, 
the figure shows two rays from two different objects. Although the 
lens will intercept infinitely many rays from each of these points, we 
have shown only the ones that pass through the center of the lens, 
so that they suffer no angular deflection. (Any angular deflection at 
the front surface of the lens is canceled by an opposite deflection at 
the back, since the front and back surfaces are parallel at the lens’s 
center.) What is special about these two rays is that they are aimed 
at the edges of one 35-mm-wide frame of film; that is, they show 
the limits of the field of view. Throughout this problem, we assume 
that d, is much greater than d;. (a) Compute the angular width 
of the camera’s field of view when these two lenses are used. (b) 
Use small-angle approximations to find a simplified equation for the 
angular width of the field of view, 0, in terms of the focal length, 
f, and the width of the film, w. Your equation should not have 
any trig functions in it. Compare the results of this approximation 
with your answers from part a. (c) Suppose that we are holding 
constant the aperture (amount of surface area of the lens being 
used to collect light). When switching from a 50-mm lens to a 28- 
mm lens, how many times longer or shorter must the exposure be 
in order to make a properly developed picture, i.e., one that is not 
under- or overexposed? [Based on a problem by Arnold Arons.| 
> Solution, p. 883 © 


40 A nearsighted person is one whose eyes focus light too 
strongly, and who is therefore unable to relax the lens inside her 
eye sufficiently to form an image on her retina of an object that is 
too far away. 


(a) Draw a ray diagram showing what happens when the person 
tries, with uncorrected vision, to focus at infinity. 


(b) What type of lenses do her glasses have? Explain. 


(c) Draw a ray diagram showing what happens when she wears 
glasses. Locate both the image formed by the glasses and the final 
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image. 


(d) Suppose she sometimes uses contact lenses instead of her 
glasses. Does the focal length of her contacts have to be less than, 
equal to, or greater than that of her glasses? Explain. x 


Al Fred’s eyes are able to focus on things as close as 5.0 cm. 
Fred holds a magnifying glass with a focal length of 3.0 cm at a 
height of 2.0 cm above a flatworm. (a) Locate the image, and find 
the magnification. (b) Without the magnifying glass, from what 
distance would Fred want to view the flatworm to see its details 
as well as possible? With the magnifying glass? (c) Compute the 
angular magnification. Eg 


42 Panel 1 of the figure shows the optics inside a pair of binoc- 
ulars. They are essentially a pair of telescopes, one for each eye. 
But to make them more compact, and allow the eyepieces to be the 
right distance apart for a human face, they incorporate a set of eight 
prisms, which fold the light path. In addition, the prisms make the 
image upright. Panel 2 shows one of these prisms, known as a Porro 
prism. The light enters along a normal, undergoes two total internal 
reflections at angles of 45 degrees with respect to the back surfaces, 
and exits along a normal. The image of the letter R has been flipped 
across the horizontal. Panel 3 shows a pair of these prisms glued 
together. The image will be flipped across both the horizontal and 
the vertical, which makes it oriented the right way for the user of 
the binoculars. 

(a) Find the minimum possible index of refraction for the glass used 
in the prisms. 

(b) For a material of this minimal index of refraction, find the frac- 
tion of the incoming light that will be lost to reflection in the four 
Porro prisms on a each side of a pair of binoculars. (See section 6.2.) 
In real, high-quality binoculars, the optical surfaces of the prisms 
have antireflective coatings, but carry out your calculation for the 
case where there is no such coating. 
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(c) Discuss the reasons why a designer of binoculars might or might 
not want to use a material with exactly the index of refraction found 
in part a. = 


43 It would be annoying if your eyeglasses produced a magnified 
or reduced image. Prove that when the eye is very close to a lens, 
and the lens produces a virtual image, the angular magnification is 
always approximately equal to 1 (regardless of whether the lens is 
diverging or converging). & 


44 The figure shows a diffraction pattern made by a double 
slit, along with an image of a meter stick to show the scale. Sketch 
the diffraction pattern from the figure on your paper. Now consider 
the four variables in the equation A/d = sin@/m. Which of these 
are the same for all five fringes, and which are different for each 
fringe? Which variable would you naturally use in order to label 
which fringe was which? Label the fringes on your sketch using the 
values of that variable. g 


45 Match gratings A-C with the diffraction patterns 1-3 that 
they produce. Explain. 


A i 

BEST T1110 
3 

c ITIL Liiiiiie 


— 


46 The figure below shows two diffraction patterns. The top one 
was made with yellow light, and the bottom one with red. Could 
the slits used to make the two patterns have been the same? 
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AT The figure shows a diffraction pattern made by a double slit, 
along with an image of a meter stick to show the scale. The slits 
were 146 cm away from the screen on which the diffraction pattern 
was projected. The spacing of the slits was 0.050 mm. What was 
the wavelength of the light? L 


48 Why would blue or violet light be the best for microscopy? 


~~ 


49 The figure below shows two diffraction patterns, both made 
with the same wavelength of red light. (a) What type of slits made 
the patterns? Is it a single slit, double slits, or something else? 
Explain. (b) Compare the dimensions of the slits used to make the 
top and bottom pattern. Give a numerical ratio, and state which 
way the ratio is, i.e., which slit pattern was the larger one. Explain. 


~ 


50 When white light passes through a diffraction grating, what 
is the smallest value of m for which the visible spectrum of order m 
overlaps the next one, of order m+ 1? (The visible spectrum runs 
from about 400 nm to about 700 nm.) Fe 


51 For star images such as the ones in figure x, estimate the 
angular width of the diffraction spot due to diffraction at the mouth 
of the telescope. Assume a telescope with a diameter of 10 meters 
(the largest currently in existence), and light with a wavelength in 
the middle of the visible range. Compare with the actual angular 


756 Chapter 12 Optics 


Problem 51. This image of 
the Pleiades star cluster shows 
haloes around the stars due to the 
wave nature of light. 


size of a star of diameter 10? m seen from a distance of 10!” m. 
What does this tell you? E 


52 The figure below shows three diffraction patterns. All were 
made under identical conditions, except that a different set of double 
slits was used for each one. The slits used to make the top pattern 
had a center-to-center separation d = 0.50 mm, and each slit was 
w = 0.04 mm wide. (a) Determine d and w for the slits used to 
make the pattern in the middle. (b) Do the same for the slits used 
to make the bottom pattern. 


~— 


53 The beam of a laser passes through a diffraction grating, 
fans out, and illuminates a wall that is perpendicular to the original 
beam, lying at a distance of 2.0 m from the grating. The beam 
is produced by a helium-neon laser, and has a wavelength of 694.3 
nm. The grating has 2000 lines per centimeter. (a) What is the 
distance on the wall between the central maximum and the maxima 
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immediately to its right and left? (b) How much does your answer 
change when you use the small-angle approximations 0 ~ sin@ = 
tan 0? Vv & 


54 Ultrasound, i.e., sound waves with frequencies too high to be 
audible, can be used for imaging fetuses in the womb or for break- 
ing up kidney stones so that they can be eliminated by the body. 
Consider the latter application. Lenses can be built to focus sound 
waves, but because the wavelength of the sound is not all that small 
compared to the diameter of the lens, the sound will not be concen- 
trated exactly at the geometrical focal point. Instead, a diffraction 
pattern will be created with an intense central spot surrounded by 
fainter rings. About 85% of the power is concentrated within the 
central spot. The angle of the first minimum (surrounding the cen- 
tral spot) is given by sin? = \/b, where 6 is the diameter of the lens. 
This is similar to the corresponding equation for a single slit, but 
with a factor of 1.22 in front which arises from the circular shape of 
the aperture. Let the distance from the lens to the patient’s kidney 
stone be L = 20 cm. You will want f > 20 kHz, so that the sound 
is inaudible. Find values of 6 and f that would result in a usable 
design, where the central spot is small enough to lie within a kidney 
stone 1 cm in diameter. x 


55 Under what circumstances could one get a mathematically 
undefined result by solving the double-slit diffraction equation for 0? 
Give a physical interpretation of what would actually be observed. 


~ 


56 When ultrasound is used for medical imaging, the frequency 
may be as high as 5-20 MHz. Another medical application of ul- 
trasound is for therapeutic heating of tissues inside the body; here, 
the frequency is typically 1-3 MHz. What fundamental physical rea- 
sons could you suggest for the use of higher frequencies for imaging? 


~ 


57 Suppose we have a polygonal room whose walls are mirrors, 
and there a pointlike light source in the room. In most such exam- 
ples, every point in the room ends up being illuminated by the light 
source after some finite number of reflections. A difficult mathemat- 
ical question, first posed in the middle of the last century, is whether 
it is ever possible to have an example in which the whole room is 
not illuminated. (Rays are assumed to be absorbed if they strike 
exactly at a vertex of the polygon, or if they pass exactly through 
the plane of a mirror.) 


The problem was finally solved in 1995 by G.W. Tokarsky, who 
found an example of a room that was not illuminable from a cer- 
tain point. Figure 57 shows a slightly simpler example found two 
years later by D. Castro. If a light source is placed at either of the 
locations shown with dots, the other dot remains unilluminated, al- 
though every other point is lit up. It is not straightforward to prove 
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rigorously that Castro’s solution has this property. However, the 
plausibility of the solution can be demonstrated as follows. 


Suppose the light source is placed at the right-hand dot. Locate 
all the images formed by single reflections. Note that they form a 
regular pattern. Convince yourself that none of these images illumi- 
nates the left-hand dot. Because of the regular pattern, it becomes 
plausible that even if we form images of images, images of images 
of images, etc., none of them will ever illuminate the other dot. 


There are various other versions of the problem, some of which 
remain unsolved. The book by Klee and Wagon gives a good in- 
troduction to the topic, although it predates Tokarsky and Castro’s 
work. 


References: 


G.W. Tokarsky, “Polygonal Rooms Not Illuminable from Every Point. 


Amer. Math. Monthly 102, 867-879, 1995. 

D. Castro, “Corrections.” Quantum 7, 42, Jan. 1997. 

V. Klee and 8. Wagon, Old and New Unsolved Problems in Plane 
Geometry and Number Theory. Mathematical Association of Amer- 
ica, 1991. = 


58 A mechanical linkage is a device that changes one type of 
motion into another. The most familiar example occurs in a gasoline 
car’s engine, where a connecting rod changes the linear motion of the 
piston into circular motion of the crankshaft. The top panel of the 
figure shows a mechanical linkage invented by Peaucellier in 1864, 
and independently by Lipkin around the same time. It consists of 
six rods joined by hinges, the four short ones forming a rhombus. 
Point O is fixed in space, but the apparatus is free to rotate about 
O. Motion at P is transformed into a different motion at P’ (or vice 
versa). 


Geometrically, the linkage is a mechanical implementation of 
the ancient problem of inversion in a circle. Considering the case in 
which the rhombus is folded flat, let the k be the distance from O 
to the point where P and P’ coincide. Form the circle of radius k 
with its center at O. As P and P’ move in and out, points on the 
inside of the circle are always mapped to points on its outside, such 
that rr’ = k?. That is, the linkage is a type of analog computer 
that exactly solves the problem of finding the inverse of a number 
r. Inversion in a circle has many remarkable geometrical properties, 
discussed in H.S.M. Coxeter, Introduction to Geometry, Wiley, 1961. 
If a pen is inserted through a hole at P, and P’ is traced over a 
geometrical figure, the Peaucellier linkage can be used to draw a 
kind of image of the figure. 


A related problem is the construction of pictures, like the one in 
the bottom panel of the figure, called anamorphs. The drawing of 
the column on the paper is highly distorted, but when the reflecting 


? 


o] 
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cylinder is placed in the correct spot on top of the page, an undis- 
torted image is produced inside the cylinder. (Wide-format movie 
technologies such as Cinemascope are based on similar principles.) 


Show that the Peaucellier linkage does not convert correctly be- 
tween an image and its anamorph, and design a modified version of 
the linkage that does. Some knowledge of analytic geometry will be 
helpful. > 


Key to symbols: 
easy © typical challenging Mdifficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 
Exercise 12A: Exploring Images With a Curved Mirror 
Equipment: 

concave mirrors with deep curvature 

concave mirrors with gentle curvature 

convex mirrors 


1. Obtain a curved mirror from your instructor. If it is silvered on both sides, make sure you’re 
working with the concave side, which bends light rays inward. Look at your own face in the 
mirror. Now change the distance between your face and the mirror, and see what happens. 
Explore the full range of possible distances between your face and the mirror. 


In these observations you’ve been changing two variables at once: the distance between the 
object (your face) and the mirror, and the distance from the mirror to your eye. In general, 
scientific experiments become easier to interpret if we practice isolation of variables, i.e., only 
change one variable while keeping all the others constant. In parts 2 and 3 you'll form an image 
of an object that’s not your face, so that you can have independent control of the object distance 
and the point of view. 


2. With the mirror held far away from you, observe the image of something behind you, over 
your shoulder. Now bring your eye closer and closer to the mirror. Can you see the image with 
your eye very close to the mirror? See if you can explain your observation by drawing a ray 
diagram. 


> turn page 
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3. Now imagine the following new situation, but don’t actually do it yet. Suppose you lay the 
mirror face-up on a piece of tissue paper, put your finger a few cm above the mirror, and look 
at the image of your finger. As in part 2, you can bring your eye closer and closer to the mirror. 


Will you be able to see the image with your eye very close to the mirror? Draw a ray diagram 
to help you predict what you will observe. 


Prediction:_______--- 


Now test your prediction. If your prediction was incorrect, see if you can figure out what went 
wrong, or ask your instructor for help. 


4. For parts 4 and 5, it’s more convenient to use concave mirrors that are more gently curved; 
obtain one from your instructor. Lay the mirror on the tissue paper, and use it to create an 
image of the overhead lights on a piece of paper above it and a little off to the side. What do 
you have to do in order to make the image clear? Can you explain this observation using a ray 
diagram? 


> turn page 
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5. Now imagine the following experiment, but don’t do it yet. What will happen to the image 
on the paper if you cover half of the mirror with your hand? 


Prediction:____------__ 
Test your prediction. If your prediction was incorrect, can you explain what happened? 


6. Now imagine forming an image with a convex mirror (one that bulges outward), and that 
therefore bends light rays away from the central axis (i.e., is diverging). Draw a typical ray 


diagram. 


Is the image real, or virtual? Will there be more than one type of image? 
Prediction:______-------__ 


Test your prediction. 
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Exercise 12B: Object and Image Distances 
Equipment: 

optical benches 

converging mirrors 

illuminated objects 


1. Set up the optical bench with the mirror at zero on the centimeter scale. Set up the 
illuminated object on the bench as well. 


2. Each group will locate the image for their own value of the object distance, by finding where 
a piece of paper has to be placed in order to see the image on it. (The instructor will do one 
point as well.) Note that you will have to tilt the mirror a little so that the paper on which you 
project the image doesn’t block the light from the illuminated object. 


Is the image real or virtual? How do you know? Is it inverted, or uninverted? 


Draw a ray diagram. 


3. Measure the image distance and write your result in the table on the board. Do the same for 
the magnification. 


4. What do you notice about the trend of the data on the board? Draw a second ray diagram 
with a different object distance, and show why this makes sense. Some tips for doing this 
correctly: (1) For simplicity, use the point on the object that is on the mirror’s axis. (2) You 
need to trace two rays to locate the image. To save work, don’t just do two rays at random 
angles. You can either use the on-axis ray as one ray, or do two rays that come off at the same 
angle, one above and one below the axis. (3) Where each ray hits the mirror, draw the normal 
line, and make sure the ray is at equal angles on both sides of the normal. 


5. We will find the mirror’s focal length from the instructor’s data-point. Then, using this focal 
length, calculate a theoretical prediction of the image distance, and write it on the board next 
to the experimentally determined image distance. 
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Exercise 12C: How strong are your glasses? 
This exercise was created by Dan Maclsaac. 
Equipment: 
eyeglasses 
outbending lenses for students who don’t wear glasses, or who use inbending glasses 
rulers and metersticks 
scratch paper 
marking pens 


Most people who wear glasses have glasses whose lenses are outbending, which allows them to 
focus on objects far away. Such a lens cannot form a real image, so its focal length cannot be 
measured as easily as that of an inbending lens. In this exercise you will determine the focal 
length of your own glasses by taking them off, holding them at a distance from your face, and 
looking through them at a set of parallel lines on a piece of paper. The lines will be reduced 
(the lens’s magnification is less than one), and by adjusting the distance between the lens and 
the paper, you can make the magnification equal 1/2 exactly, so that two spaces between lines 
as seen through the lens fit into one space as seen simultaneously to the side of the lens. This 
object distance can be used in order to find the focal length of the lens. 


1. Use a marker to draw three evenly spaced parallel lines on the paper. (A spacing of a few 
cm works well.) 


2. Does this technique really measure magnification or does it measure angular magnification? 
What can you do in your experiment in order to make these two quantities nearly the same, so 
the math is simpler? 


3. Before taking any numerical data, use algebra to find the focal length of the lens in terms of 
dy, the object distance that results in a magnification of 1/2. 


4. Measure the object distance that results in a magnification of 1/2, and determine the focal 
length of your lens. 
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Exercise 12D: Double-Source Interference 


1. Two sources separated by a distance d = 1 cm make circular ripples with a wavelength of 
A =2cm. Ona piece of paper, make a life-size drawing of the two sources in the default setup, 
and locate the following points: 


A. The point that is 10 wavelengths from source #1 and 10 wavelengths from source #2. 
B. The point that is 10.5 wavelengths from #1 and 10.5 from #2. 

C. The point that is 11 wavelengths from #1 and 11 from #2. 

D. The point that is 10 wavelengths from #1 and 10.5 from #2. 

E. The point that is 11 wavelengths from #1 and 11.5 from #2. 

F. The point that is 10 wavelengths from #1 and 11 from #2. 

G. The point that is 11 wavelengths from #1 and 12 from #2. 


You can do this either using a compass or by putting the next page under your paper and 
tracing. It is not necessary to trace all the arcs completely, and doing so is unnecessarily time- 
consuming; you can fairly easily estimate where these points would lie, and just trace arcs long 
enough to find the relevant intersections. 


What do these points correspond to in the real wave pattern? 


2. Make a fresh copy of your drawing, showing only point F and the two sources, which form a 
long, skinny triangle. Now suppose you were to change the setup by doubling d, while leaving 
the same. It’s easiest to understand what’s happening on the drawing if you move both sources 
outward, keeping the center fixed. Based on your drawing, what will happen to the position of 
point F when you double d? How has the angle of point F changed? 


3. In part 2, you saw the effect of doubling d while leaving A the same. Now what do you think 
would happen to your angles if, starting from the standard setup, you doubled 4 while leaving 
d the same?____ 


4. Suppose A was a millionth of a centimeter, while d was still as in the standard setup. What 
would happen to the angles? What does this tell you about observing diffraction of light? 
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10 cm 
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Exercise 12E: Single-slit diffraction 
Equipment: 

rulers 

computer with web browser 


The following page is a diagram of a single slit and a screen onto which its diffraction pattern 
is projected. The class will make a numerical prediction of the intensity of the pattern at the 
different points on the screen. Each group will be responsible for calculating the intensity at 
one of the points. (Either 11 groups or six will work nicely — in the latter case, only points a, 
c, e, g, i, and k are used.) The idea is to break up the wavefront in the mouth of the slit into 
nine parts, each of which is assumed to radiate semicircular ripples as in Huygens’ principle. 
The wavelength of the wave is 1 cm, and we assume for simplicity that each set of ripples has 
an amplitude of 1 unit when it reaches the screen. 


1. For simplicity, let’s imagine that we were only to use two sets of ripples rather than nine. 
You could measure the distance from each of the two points inside the slit to your point on 
the screen. Suppose the distances were both 25.0 cm. What would be the amplitude of the 
superimposed waves at this point on the screen? 


Suppose one distance was 24.0 cm and the other was 25.0 cm. What would happen? 
What if one was 24.0 cm and the other was 26.0 cm? 
What if one was 24.5 cm and the other was 25.0 cm? 


In general, what combinations of distances will lead to completely destructive and completely 
constructive interference? 


Can you estimate the answer in the case where the distances are 24.7 and 25.0 cm? 


2. Although it is possible to calculate mathematically the amplitude of the sine wave that results 
from superimposing two sine waves with an arbitrary phase difference between them, the algebra 
is rather laborious, and it become even more tedious when we have more than two waves to super- 
impose. Instead, one can simply use a computer spreadsheet or some other computer program to 
add up the sine waves numerically at a series of points covering one complete cycle. This is what 
we will actually do. You just need to enter the relevant data into the computer, then examine the 
results and pick off the amplitude from the resulting list of numbers. You can run the software 
through a web interface at http://lightandmatter.com/cgi-bin/diffraction1.cgi. 


3. Measure all nine distances to your group’s point on the screen, and write them on the board 
- that way everyone can see everyone else’s data, and the class can try to make sense of why the 
results came out the way they did. Determine the amplitude of the combined wave, and write 
it on the board as well. 


The class will discuss why the results came out the way they did. 
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Exercise 12F: Diffraction of Light 
Equipment: 
slit patterns, lasers, straight-filament bulbs 


station 1 
You have a mask with a bunch of different double slits cut out of it. The values of w and d are 
as follows: 


pattern A w=0.04mm d=.250 mm 
pattern B w=0.04mm d=.500 mm 
pattern C w=0.08 mm d=.250 mm 
pattern D w=0.08 mm d=.500 mm 


Predict how the patterns will look different, and test your prediction. The easiest way to get 
the laser to point at different sets of slits is to stick folded up pieces of paper in one side or the 
other of the holders. 


station 2 
This is just like station 1, but with single slits: 


pattern A w=0.02 mm 
pattern B w=0.04 mm 
pattern C w=0.08 mm 
pattern D w=0.16 mm 


Predict what will happen, and test your predictions. If you have time, check the actual numerical 
ratios of the w values against the ratios of the sizes of the diffraction patterns 


station 3 
This is like station 1, but the only difference among the sets of slits is how many slits there are: 


pattern A double slit 
pattern B 8 slits 
pattern C 4 slits 
pattern D 5 slits 


station 4 

Hold the diffraction grating up to your eye, and look through it at the straight-filament light 
bulb. If you orient the grating correctly, you should be able to see the m = 1 and m = —1 
diffraction patterns off the left and right. If you have it oriented the wrong way, they’ll be above 
and below the bulb instead, which is inconvenient because the bulb’s filament is vertical. Where 
is the m = 0 fringe? Can you see m = 2, etc.? 


Station 5 has the same equipment as station 4. If you’re assigned to station 5 first, you should 
actually do activity 4 first, because it’s easier. 


station 5 

Use the transformer to increase and decrease the voltage across the bulb. This allows you to 
control the filament’s temperature. Sketch graphs of intensity as a function of wavelength for 
various temperatures. The inability of the wave model of light to explain the mathematical 
shapes of these curves was historically one of the reasons for creating a new model, in which 
light is both a particle and a wave. 
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Chapter 13 
Quantum Physics 


13.1 Rules of Randomness 


Given for one instant an intelligence which could comprehend all 
the forces by which nature is animated and the respective positions 
of the things which compose it...nothing would be uncertain, and 
the future as the past would be laid out before its eyes. 


Pierre Simon de Laplace, 1776 


The energy produced by the atom is a very poor kind of thing. 
Anyone who expects a source of power from the transformation of 
these atoms is talking moonshine. 


Ernest Rutherford, 1983 


The Quantum Mechanics is very imposing. But an inner voice tells 
me that it is still not the final truth. The theory yields much, but 
it hardly brings us nearer to the secret of the Old One. In any case, 
I am convinced that He does not play dice. 


Albert Einstein 


However radical Newton’s clockwork universe seemed to his con- 
temporaries, by the early twentieth century it had become a sort of 
smugly accepted dogma. Luckily for us, this deterministic picture of 
the universe breaks down at the atomic level. The clearest demon- 
stration that the laws of physics contain elements of randomness 
is in the behavior of radioactive atoms. Pick two identical atoms 
of a radioactive isotope, say the naturally occurring uranium 238, 
and watch them carefully. They will decay at different times, even 
though there was no difference in their initial behavior. 


We would be in big trouble if these atoms’ behavior was as pre- 
dictable as expected in the Newtonian world-view, because radioac- 
tivity is an important source of heat for our planet. In reality, each 
atom chooses a random moment at which to release its energy, re- 
sulting in a nice steady heating effect. The earth would be a much 
colder planet if only sunlight heated it and not radioactivity. Prob- 
ably there would be no volcanoes, and the oceans would never have 
been liquid. The deep-sea geothermal vents in which life first evolved 
would never have existed. But there would be an even worse conse- 
quence if radioactivity was deterministic: after a few billion years of 
peace, all the uranium 238 atoms in our planet would presumably 
pick the same moment to decay. The huge amount of stored nuclear 


a/In 1980, the continental 
U.S. got its first taste of active 
volcanism in recent memory with 
the eruption of Mount St. Helens. 
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energy, instead of being spread out over eons, would all be released 
at one instant, blowing our whole planet to Kingdom Come.! 


The new version of physics, incorporating certain kinds of ran- 
domness, is called quantum physics (for reasons that will become 
clear later). It represented such a dramatic break with the pre- 
vious, deterministic tradition that everything that came before is 
considered “classical,” even the theory of relativity. This chapter is 
a basic introduction to quantum physics. 


Discussion Question 


A | said “Pick two identical atoms of a radioactive isotope.” Are two 
atoms really identical? If their electrons are orbiting the nucleus, can we 
distinguish each atom by the particular arrangement of its electrons at 
some instant in time? 


13.1.1 Randomness isn’t random. 


Einstein’s distaste for randomness, and his association of deter- 
minism with divinity, goes back to the Enlightenment conception of 
the universe as a gigantic piece of clockwork that only had to be set 
in motion initially by the Builder. Many of the founders of quan- 
tum mechanics were interested in possible links between physics and 
Eastern and Western religious and philosophical thought, but every 
educated person has a different concept of religion and philosophy. 
Bertrand Russell remarked, “Sir Arthur Eddington deduces religion 
from the fact that atoms do not obey the laws of mathematics. Sir 
James Jeans deduces it from the fact that they do.” 


Russell’s witticism, which implies incorrectly that mathemat- 
ics cannot describe randomness, remind us how important it is not 
to oversimplify this question of randomness. You should not sim- 
ply surmise, “Well, it’s all random, anything can happen.” For 
one thing, certain things simply cannot happen, either in classical 
physics or quantum physics. The conservation laws of mass, energy, 
momentum, and angular momentum are still valid, so for instance 
processes that create energy out of nothing are not just unlikely 
according to quantum physics, they are impossible. 


A useful analogy can be made with the role of randomness in 
evolution. Darwin was not the first biologist to suggest that species 
changed over long periods of time. His two new fundamental ideas 
were that (1) the changes arose through random genetic variation, 
and (2) changes that enhanced the organism’s ability to survive and 
reproduce would be preserved, while maladaptive changes would be 


'This is under the assumption that all the uranium atoms were created at the 
same time. In reality, we have only a general idea of the processes that might 
have created the heavy elements in the nebula from which our solar system 
condensed. Some portion of them may have come from nuclear reactions in 
supernova explosions in that particular nebula, but some may have come from 
previous supernova explosions throughout our galaxy, or from exotic events like 
collisions of white dwarf stars. 
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eliminated by natural selection. Doubters of evolution often consider 
only the first point, about the randomness of natural variation, but 
not the second point, about the systematic action of natural selec- 
tion. They make statements such as, “the development of a complex 
organism like Homo sapiens via random chance would be like a whirl- 
wind blowing through a junkyard and spontaneously assembling a 
jumbo jet out of the scrap metal.” The flaw in this type of reason- 
ing is that it ignores the deterministic constraints on the results of 
random processes. For an atom to violate conservation of energy is 
no more likely than the conquest of the world by chimpanzees next 
year. 


Discussion Question 


A Economists often behave like wannabe physicists, probably because 
it seems prestigious to make numerical calculations instead of talking 
about human relationships and organizations like other social scientists. 
Their striving to make economics work like Newtonian physics extends 
to a parallel use of mechanical metaphors, as in the concept of a mar- 
ket’s supply and demand acting like a self-adjusting machine, and the 
idealization of people as economic automatons who consistently strive to 
maximize their own wealth. What evidence is there for randomness rather 
than mechanical determinism in economics? 


13.1.2 Calculating randomness 


You should also realize that even if something is random, we 
can still understand it, and we can still calculate probabilities nu- 
merically. In other words, physicists are good bookmakers. A good 
bookmaker can calculate the odds that a horse will win a race much 
more accurately that an inexperienced one, but nevertheless cannot 
predict what will happen in any particular race. 


Statistical independence 


As an illustration of a general technique for calculating odds, 
suppose you are playing a 25-cent slot machine. Each of the three 
wheels has one chance in ten of coming up with a cherry. If all 
three wheels come up cherries, you win $100. Even though the 
results of any particular trial are random, you can make certain 
quantitative predictions. First, you can calculate that your odds 
of winning on any given trial are 1/10 x 1/10 x 1/10 = 1/1000 = 
0.001. Here, I am representing the probabilities as numbers from 
0 to 1, which is clearer than statements like “The odds are 999 to 
1,” and makes the calculations easier. A probability of 0 represents 
something impossible, and a probability of 1 represents something 
that will definitely happen. 


Also, you can say that any given trial is equally likely to result in 
a win, and it doesn’t matter whether you have won or lost in prior 
games. Mathematically, we say that each trial is statistically inde- 
pendent, or that separate games are uncorrelated. Most gamblers 
are mistakenly convinced that, to the contrary, games of chance are 
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correlated. If they have been playing a slot machine all day, they 
are convinced that it is “getting ready to pay,” and they do not 
want anyone else playing the machine and “using up” the jackpot 
that they “have coming.” In other words, they are claiming that 
a series of trials at the slot machine is negatively correlated, that 
losing now makes you more likely to win later. Craps players claim 
that you should go to a table where the person rolling the dice is 
“hot,” because she is likely to keep on rolling good numbers. Craps 
players, then, believe that rolls of the dice are positively correlated, 
that winning now makes you more likely to win later. 


My method of calculating the probability of winning on the slot 
machine was an example of the following important rule for calcu- 
lations based on independent probabilities: 


The Law of Independent Probabilities 

If the probability of one event happening is P4, and the prob- 
ability of a second statistically independent event happening 
is Pg, then the probability that they will both occur is the 
product of the probabilities, P4Pg. If there are more than 
two events involved, you simply keep on multiplying. 


Note that this only applies to independent probabilities. For 
instance, if you have a nickel and a dime in your pocket, and you 
randomly pull one out, there is a probability of 0.5 that it will be 
the nickel. If you then replace the coin and again pull one out 
randomly, there is again a probability of 0.5 of coming up with the 
nickel, because the probabilities are independent. Thus, there is a 
probability of 0.25 that you will get the nickel both times. 


Suppose instead that you do not replace the first coin before 
pulling out the second one. Then you are bound to pull out the 
other coin the second time, and there is no way you could pull the 
nickel out twice. In this situation, the two trials are not indepen- 
dent, because the result of the first trial has an effect on the second 
trial. The law of independent probabilities does not apply, and the 
probability of getting the nickel twice is zero, not 0.25. 


Experiments have shown that in the case of radioactive decay, 
the probability that any nucleus will decay during a given time in- 
terval is unaffected by what is happening to the other nuclei, and 
is also unrelated to how long it has gone without decaying. The 
first observation makes sense, because nuclei are isolated from each 
other at the centers of their respective atoms, and therefore have no 
physical way of influencing each other. The second fact is also rea- 
sonable, since all atoms are identical. Suppose we wanted to believe 
that certain atoms were “extra tough,” as demonstrated by their 
history of going an unusually long time without decaying. Those 
atoms would have to be different in some physical way, but nobody 
has ever succeeded in detecting differences among atoms. There is 
no way for an atom to be changed by the experiences it has in its 
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lifetime. 


Addition of indexprobabilities!addition ofprobabilities 


The law of independent probabilities tells us to use multiplica- 
tion to calculate the probability that both A and B will happen, 
assuming the probabilities are independent. What about the prob- 
ability of an “or” rather than an “and”? If two events A and B 
are mutually exclusive, then the probability of one or the other oc- 
curring is the sum P4 + Pg. For instance, a bowler might have a 
30% chance of getting a strike (knocking down all ten pins) and a 
20% chance of knocking down nine of them. The bowler’s chance of 
knocking down either nine pins or ten pins is therefore 50%. 


It does not make sense to add probabilities of things that are 
not mutually exclusive, i.e., that could both happen. Say I have a 
90% chance of eating lunch on any given day, and a 90% chance of 
eating dinner. The probability that I will eat either lunch or dinner 
is not 180%. 


Normalization 


If I spin a globe and randomly pick a point on it, I have about a 
70% chance of picking a point that’s in an ocean and a 30% chance 
of picking a point on land. The probability of picking either wa- 
ter or land is 70% + 30% = 100%. Water and land are mutually 
exclusive, and there are no other possibilities, so the probabilities 
had to add up to 100%. It works the same if there are more than 
two possibilities — if you can classify all possible outcomes into a 
list of mutually exclusive results, then all the probabilities have to 
add up to 1, or 100%. This property of probabilities is known as 
normalization. 


Averages 


Another way of dealing with randomness is to take averages. 
The casino knows that in the long run, the number of times you win 
will approximately equal the number of times you play multiplied 
by the probability of winning. In the slot-machine game described 
on page 773, where the probability of winning is 0.001, if you spend 
a week playing, and pay $2500 to play 10,000 times, you are likely 
to win about 10 times (10,000 x 0.001 = 10), and collect $1000. On 
the average, the casino will make a profit of $1500 from you. This 
is an example of the following rule. 


Rule for Calculating Averages 

If you conduct N identical, statistically independent trials, 
and the probability of success in each trial is P, then on the 
average, the total number of successful trials will be NP. If N 
is large enough, the relative error in this estimate will become 
small. 


The statement that the rule for calculating averages gets more 


b/Normalization: the proba- 
bility of picking land plus the 
probability of picking water adds 
up to 1. 
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dice 


random? 


and more accurate for larger and larger N(known popularly as the 
“law of averages”) often provides a correspondence principle that 
connects classical and quantum physics. For instance, the amount 
of power produced by a nuclear power plant is not random at any 
detectable level, because the number of atoms in the reactor is so 
large. In general, random behavior at the atomic level tends to 
average out when we consider large numbers of atoms, which is why 
physics seemed deterministic before physicists learned techniques for 
studying atoms individually. 


We can achieve great precision with averages in quantum physics 
because we can use identical atoms to reproduce exactly the same 
situation many times. If we were betting on horses or dice, we would 
be much more limited in our precision. After a thousand races, the 
horse would be ready to retire. After a million rolls, the dice would 
be worn out. 


self-check A 

Which of the following things must be independent, which could be in- 
dependent, and which definitely are not independent? (1) the probabil- 
ity of successfully making two free-throws in a row in basketball; (2) the 
probability that it will rain in London tomorrow and the probability that it 
will rain on the same day in a certain city in a distant galaxy; (3) your 
probability of dying today and of dying tomorrow. > Answer, p. 873 


Discussion Questions 


A Newtonian physics is an essentially perfect approximation for de- 
scribing the motion of a pair of dice. If Newtonian physics is deterministic, 
why do we consider the result of rolling dice to be random? 


B Why isn’t it valid to define randomness by saying that randomness 
is when all the outcomes are equally likely? 


C The sequence of digits 121212121212121212 seems clearly nonran- 
dom, and 41592653589793 seems random. The latter sequence, how- 
ever, is the decimal form of pi, starting with the third digit. There is a story 
about the Indian mathematician Ramanujan, a self-taught prodigy, that a 
friend came to visit him in a cab, and remarked that the number of the 
cab, 1729, seemed relatively uninteresting. Ramanujan replied that on 
the contrary, it was very interesting because it was the smallest number 
that could be represented in two different ways as the sum of two cubes. 
The Argentine author Jorge Luis Borges wrote a short story called “The 
Library of Babel,” in which he imagined a library containing every book 
that could possibly be written using the letters of the alphabet. It would in- 
clude a book containing only the repeated letter “a;” all the ancient Greek 
tragedies known today, all the lost Greek tragedies, and millions of Greek 
tragedies that were never actually written; your own life story, and various 
incorrect versions of your own life story; and countless anthologies con- 
taining a short story called “The Library of Babel.” Of course, if you picked 
a book from the shelves of the library, it would almost certainly look like a 
nonsensical sequence of letters and punctuation, but it’s always possible 
that the seemingly meaningless book would be a science-fiction screen- 
play written in the language of a Neanderthal tribe, or the lyrics to a set 
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of incomparably beautiful love songs written in a language that never ex- 
isted. In view of these examples, what does it really mean to say that 
something is random? 


13.1.3 Probability distributions 


So far we’ve discussed random processes having only two possible 
outcomes: yes or no, win or lose, on or off. More generally, a random 
process could have a result that is a number. Some processes yield 
integers, as when you roll a die and get a result from one to six, but 
some are not restricted to whole numbers, for example the number 
of seconds that a uranium-238 atom will exist before undergoing 
radioactive decay. 


Consider a throw of a die. If the die is “honest,” then we expect 
all six values to be equally likely. Since all six probabilities must 
add up to 1, then probability of any particular value coming up 
must be 1/6. We can summarize this in a graph, d. Areas under 
the curve can be interpreted as total probabilities. For instance, 
the area under the curve from 1 to 3 is 1/6 + 1/6 + 1/6 = 1/2, so 
the probability of getting a result from 1 to 3 is 1/2. The function 
shown on the graph is called the probability distribution. 


Figure e shows the probabilities of various results obtained by 
rolling two dice and adding them together, as in the game of craps. 
The probabilities are not all the same. There is a small probability 
of getting a two, for example, because there is only one way to do 
it, by rolling a one and then another one. The probability of rolling 
a seven is high because there are six different ways to do it: 1+6, 
2+5, etc. 


If the number of possible outcomes is large but finite, for example 
the number of hairs on a dog, the graph would start to look like a 
smooth curve rather than a ziggurat. 


What about probability distributions for random numbers that 
are not integers? We can no longer make a graph with probability 
on the y axis, because the probability of getting a given exact num- 
ber is typically zero. For instance, there is zero probability that a 
radioactive atom will last for exactly 3 seconds, since there is are 
infinitely many possible results that are close to 3 but not exactly 
three: 2.999999999999999996876876587658465436, for example. It 
doesn’t usually make sense, therefore, to talk about the probability 
of a single numerical result, but it does make sense to talk about 
the probability of a certain range of results. For instance, the prob- 
ability that an atom will last more than 3 and less than 4 seconds is 
a perfectly reasonable thing to discuss. We can still summarize the 
probability information on a graph, and we can still interpret areas 
under the curve as probabilities. 


But the y axis can no longer be a unitless probability scale. In 
radioactive decay, for example, we want the x axis to have units of 
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time, and we want areas under the curve to be unitless probabilities. 
The area of a single square on the graph paper is then 


(unitless area of a square) 
= (width of square with time units) 


x (height of square) 


If the units are to cancel out, then the height of the square must 
evidently be a quantity with units of inverse time. In other words, 
the y axis of the graph is to be interpreted as probability per unit 
time, not probability. 


Figure f shows another example, a probability distribution for 
people’s height. This kind of bell-shaped curve is quite common. 


self-check B 


Compare the number of people with heights in the range of 130-135 cm 
to the number in the range 135-140. > Answer, p. 873 


Looking for tall basketball players example 1 
> A certain country with a large population wants to find very tall 
people to be on its Olympic basketball team and strike a blow 
against western imperialism. Out of a pool of 10° people who are 
the right age and gender, how many are they likely to find who are 
over 225 cm (7 feet 4 inches) in height? Figure g gives a close-up 
of the “tail” of the distribution shown previously in figure f. 


> The shaded area under the curve represents the probability that 
a given person is tall enough. Each rectangle represents a prob- 
ability of 0.2 x 10-7 cm~! x 1 cm = 2x 10-®. There are about 35 
rectangles covered by the shaded area, so the probability of hav- 
ing a height greater than 225 cm is 7 x 10~” , or just under one in 
a million. Using the rule for calculating averages, the average, or 
expected number of people this tall is (10°) x (7 x 107”) = 70. 


Average and width of a probability distribution 


If the next Martian you meet asks you, “How tall is an adult hu- 
man?,” you will probably reply with a statement about the average 
human height, such as “Oh, about 5 feet 6 inches.” If you wanted 
to explain a little more, you could say, “But that’s only an average. 
Most people are somewhere between 5 feet and 6 feet tall.” Without 
bothering to draw the relevant bell curve for your new extraterres- 
trial acquaintance, you’ve summarized the relevant information by 
giving an average and a typical range of variation. 


The average of a probability distribution can be defined geomet- 
rically as the horizontal position at which it could be balanced if 
it was constructed out of cardboard. A convenient numerical mea- 
sure of the amount of variation about the average, or amount of 
uncertainty, is the full width at half maximum, or FWHM, shown 
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in figure i. 


A great deal more could be said about this topic, and indeed 
an introductory statistics course could spend months on ways of 
defining the center and width of a distribution. Rather than force- 
feeding you on mathematical detail or techniques for calculating 
these things, it is perhaps more relevant to point out simply that 
there are various ways of defining them, and to inoculate you against 
the misuse of certain definitions. 


The average is not the only possible way to say what is a typical 
value for a quantity that can vary randomly; another possible defi- 
nition is the median, defined as the value that is exceeded with 50% 
probability. When discussing incomes of people living in a certain 
town, the average could be very misleading, since it can be affected 
massively if a single resident of the town is Bill Gates. Nor is the 
FWHM the only possible way of stating the amount of random vari- 
ation; another possible way of measuring it is the standard deviation 
(defined as the square root of the average squared deviation from 
the average value). 


13.1.4 Exponential decay and half-life 
Half-life 


Most people know that radioactivity “lasts a certain amount of 
time,” but that simple statement leaves out a lot. As an example, 
consider the following medical procedure used to diagnose thyroid 
function. A very small quantity of the isotope !°'I, produced in a 
nuclear reactor, is fed to or injected into the patient. The body’s 
biochemical systems treat this artificial, radioactive isotope exactly 
the same as !?"I, which is the only naturally occurring type. (Nu- 
tritionally, iodine is a necessary trace element. Iodine taken into 
the body is partly excreted, but the rest becomes concentrated in 
the thyroid gland. Iodized salt has had iodine added to it to pre- 
vent the nutritional deficiency known as goiters, in which the iodine- 
starved thyroid becomes swollen.) As the !°'I undergoes beta decay, 
it emits electrons, neutrinos, and gamma rays. The gamma rays can 
be measured by a detector passed over the patient’s body. As the 
radioactive iodine becomes concentrated in the thyroid, the amount 
of gamma radiation coming from the thyroid becomes greater, and 
that emitted by the rest of the body is reduced. The rate at which 
the iodine concentrates in the thyroid tells the doctor about the 
health of the thyroid. 


If you ever undergo this procedure, someone will presumably 
explain a little about radioactivity to you, to allay your fears that 
you will turn into the Incredible Hulk, or that your next child will 
have an unusual number of limbs. Since iodine stays in your thyroid 
for a long time once it gets there, one thing you’ll want to know is 
whether your thyroid is going to become radioactive forever. They 
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maximum 
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mum (FWHM) of a_ probability 
distribution. 
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may just tell you that the radioactivity “only lasts a certain amount 
of time,” but we can now carry out a quantitative derivation of how 
the radioactivity really will die out. 


Let Psurv(t) be the probability that an iodine atom will survive 
without decaying for a period of at least t. It has been experimen- 
tally measured that half all '°'I atoms decay in 8 hours, so we have 


Paury(8 hr) = 0.5 


Now using the law of independent probabilities, the probability 
of surviving for 16 hours equals the probability of surviving for the 
first 8 hours multiplied by the probability of surviving for the second 
8 hours, 


Pyury(16 hr) = 0.50 x 0.50 
= 0.25 


Similarly we have 


Poury(24 hr) = 0.50 x 0.5 x 0.5 
= 0.125 


Generalizing from this pattern, the probability of surviving for any 
time t that is a multiple of 8 hours is 


Pyury(t) = 0.5¢/8 br 


We now know how to find the probability of survival at intervals 
of 8 hours, but what about the points in time in between? What 
would be the probability of surviving for 4 hours? Well, using the 
law of independent probabilities again, we have 


Paga(S hr) = Peal hr) x Pee (A hr) ; 
which can be rearranged to give 


Prury (4 hr) = Peel hr) 


= V0.5 
= 0.707 


This is exactly what we would have found simply by plugging in 
Path), = 0.5¢/8 br and ignoring the restriction to multiples of 8 
hours. Since 8 hours is the amount of time required for half of the 
atoms to decay, it is known as the half-life, written t,/2. The general 
rule is as follows: 


Exponential Decay Equation 


Prury(t) = 0.5°/4/2 
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Using the rule for calculating averages, we can also find the num- 
ber of atoms, N(t), remaining in a sample at time tf: 


N(t) = N(0) x 0.5%/412 


Both of these equations have graphs that look like dying-out expo- 
nentials, as in the example below. 


Radioactive contamination at Chernobyl! example 2 
> One of the most dangerous radioactive isotopes released by the 
Chernobyl disaster in 1986 was °°Sr, whose half-life is 28 years. 
(a) How long will it be before the contamination is reduced to one 
tenth of its original level? (b) If a total of 102” atoms was released, 
about how long would it be before not a single atom was left? 


> (a) We want to know the amount of time that a 9°Sr nucleus 
has a probability of 0.1 of surviving. Starting with the exponential 
decay formula, 


Psurv = 0.51/42 
we want to solve for t. Taking natural logarithms of both sides, 
t 
InP = —In0.5 ; 
ty/2 


re) 
ty /2 
Ind.5 


Plugging in P = 0.1 and t; 2 = 28 years, we get ft = 93 years. 


t= InP 


(b) This is just like the first part, but P = 10-2” . The result is 
about 2500 years. 


'4C Dating example 3 
Almost all the carbon on Earth is '*C, but not quite. The isotope 
14, with a half-life of 5600 years, is produced by cosmic rays in 
the atmosphere. It decays naturally, but is replenished at such a 
rate that the fraction of '4C in the atmosphere remains constant, 
at 1.3 x 10-12 . Living plants and animals take in both '?C and 
14 from the atmosphere and incorporate both into their bodies. 
Once the living organism dies, it no longer takes in C atoms from 
the atmosphere, and the proportion of '4C gradually falls off as it 
undergoes radioactive decay. This effect can be used to find the 
age of dead organisms, or human artifacts made from plants or 
animals. Figure j on page 782 shows the exponential decay curve 
of '*C in various objects. Similar methods, using longer-lived iso- 
topes, provided the first firm proof that the earth was billions of 
years old, not a few thousand as some had claimed on religious 
grounds. 
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j/ Calibration of the '*C dating method using tree rings and _ arti- 
facts whose ages were known from other methods. Redrawn from Emilio 
Segre, Nuclei and Particles, 1965. 


Rate of decay 


If you want to find how many radioactive decays occur within a 
time interval lasting from time ¢ to time t + At, the most straight- 
forward approach is to calculate it like this: 


(number of decays between ¢ and t + At) 

= N(t) — N(t+ At) 

= N(0) [ Pitot ) >. Paget + At)| 
= N(0) [0 0.5¢/t1/2 — 9. stttA0/t2] 
(0) 


= N(0 E oar) pati] 0.5/42 


A problem arises when At is small compared to t;/2. For instance, 
suppose you have a hunk of 102? atoms of 7°°U, with a half-life of 
700 million years, which is 2.2 x 10!° s. You want to know how many 
decays will occur in At = 1 s. Since we’re specifying the current 
number of atoms, t = 0. As you plug in to the formula above on 
your calculator, the quantity 0.54¢/41/2 comes out on your calculator 
to equal one, so the final result is zero. That’s incorrect, though. 
In reality, 0.54!/41/2 should equal 0.999999999999999968, but your 
calculator only gives eight digits of precision, so it rounded it off to 
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one. In other words, the probability that a ??°U atom will survive 
for 1 s is very close to one, but not equal to one. The number of 
decays in one second is therefore 3.2 x 10°, not zero. 


Well, my calculator only does eight digits of precision, just like 
yours, so how did I know the right answer? The way to do it is to 
use the following approximation: 


a’x1+blina, ifb<1 


(The symbol < means “is much less than.”) Using it, we can find 
the following approximation: 


(number of decays between t and t + At) 
= N(0) [1 — 0.54/12] 0,5*/42 


~ N(0) 1 = (1 ice! in0.5) 0.5¢/ 41/2 
ty /2 


At 
re (In 2).N(0)0.5%/41/2 — 
t1/2 
This also gives us a way to calculate the rate of decay, i.e., the 
number of decays per unit time. Dividing by At on both sides, we 
have 


(In 2) N(0) 


05/42 | ifAt<tyy 
1/2 


(decays per unit time) ~ 


The hot potato example 4 
> A nuclear physicist with a demented sense of humor tosses 
you a cigar box, yelling “hot potato.” The label on the box says 
“contains 107° atoms of '’F, half-life of 66 s, produced today in 
our reactor at 1 p.m.” It takes you two seconds to read the label, 
after which you toss it behind some lead bricks and run away. The 
time is 1:40 p.m. Will you die? 


> The time elapsed since the radioactive fluorine was produced 
in the reactor was 40 minutes, or 2400 s. The number of elapsed 
half-lives is therefore t/t;;2 = 36. The initial number of atoms 
was N(0) = 102° . The number of decays per second is now 
about 10’ s~', so it produced about 2 x 10” high-energy electrons 
while you held it in your hands. Although twenty million electrons 
sounds like a lot, it is not really enough to be dangerous. 


By the way, none of the equations we’ve derived so far was the 
actual probability distribution for the time at which a particular 
radioactive atom will decay. That probability distribution would be 
found by substituting N(0) = 1 into the equation for the rate of 
decay. 


If the sheer number of equations is starting to seem formidable, 
let’s pause and think for a second. The simple equation for Pyy,;y is 
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something you can derive easily from the law of independent prob- 
abilities any time you need it. From that, you can quickly find the 
exact equation for the rate of decay. The derivation of the approx- 
imate equations for At < t is a little hairier, but note that except 
for the factors of In2, everything in these equations can be found 
simply from considerations of logic and units. For instance, a longer 
half-life will obviously lead to a slower rate of decays, so it makes 
sense that we divide by it. As for the lIn2 factors, they are exactly 
the kind of thing that one looks up in a book when one needs to 
know them. 


Discussion Questions 


A In the medical procedure involving '$'1, why is it the gamma rays 


that are detected, not the electrons or neutrinos that are also emitted? 


B For 1 s, Fred holds in his hands 1 kg of radioactive stuff with a 
half-life of 1000 years. Ginger holds 1 kg of a different substance, with a 
half-life of 1 min, for the same amount of time. Did they place themselves 
in equal danger, or not? 


Cc How would you interpret it if you calculated N(t), and found it was 
less than one? 


D___ Does the half-life depend on how much of the substance you have? 
Does the expected time until the sample decays completely depend on 
how much of the substance you have? 


13.1.5 Applications of calculus 


The area under the probability distribution is of course an in- 
tegral. If we call the random number x and the probability distri- 
bution D(x), then the probability that «x lies in a certain range is 
given by 


b 
(probability of a< a <b) = i, Dede 


What about averages? If x had a finite number of equally probable 
values, we would simply add them up and divide by how many we 
had. If they weren’t equally likely, we’d make the weighted average 
r1P, + 2%2P2+... But we need to generalize this to a variable x that 
can take on any of a continuum of values. The continuous version 
of a sum is an integral, so the average is 


(average value of x) = [eD(oez ‘ 


where the integral is over all possible values of x. 


Probability distribution for radioactive decay example 5 
Here is a rigorous justification for the statement in subsection 
13.1.4 that the probability distribution for radioactive decay is found 
by substituting N(0) = 1 into the equation for the rate of decay. We 
know that the probability distribution must be of the form 


D(t) = k0.54/42 
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where k is a constant that we need to determine. The atom is 
guaranteed to decay eventually, so normalization gives us 


(probability of 0 < tf < co) =1 


= i D(t)dt 


The integral is most easily evaluated by converting the function 
into an exponential with e as the base 


D(t) = k exp [in (0.5/2) 


= k exp [a In 05] 
he 


= k exp (-7=1) 
he 


which gives an integral of the familiar form [ e%dx = (1/c)e™. 
We thus have 


Average lifetime example 6 
You might think that the half-life would also be the average life- 
time of an atom, since half the atoms’ lives are shorter and half 
longer. But the half whose lives are longer include some that sur- 
vive for many half-lives, and these rare long-lived atoms skew the 
average. We can calculate the average lifetime as follows: 


(average lifetime) = i t D(t) dt 
0 


Using the convenient base-e form again, we have 


(average lifetime) = al texp (-728) dt 
t2 Jo te 


This integral is of a form that can either be attacked with in- 
tegration by parts or by looking it up in a table. The result is 
J xe%dx = 0° — se, and the first term can be ignored for our 
purposes because it equals zero at both limits of integration. We 
end up with 


t 2 
(average lifetime) = we (74) 


tye \In2 
3 We 
~ In2 
=1.443 tj. , 


which is, as expected, longer than one half-life. 
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k/In recent decades, a huge hole in the ozone layer has spread 
out from Antarctica. Left: November 1978. Right: November 1992 


13.2 Light As a Particle 


The only thing that interferes with my learning is my education. 
Albert Einstein 


Radioactivity is random, but do the laws of physics exhibit ran- 
domness in other contexts besides radioactivity? Yes. Radioactive 
decay was just a good playpen to get us started with concepts of 
randomness, because all atoms of a given isotope are identical. By 
stocking the playpen with an unlimited supply of identical atom- 
toys, nature helped us to realize that their future behavior could be 
different regardless of their original identicality. We are now ready 
to leave the playpen, and see how randomness fits into the structure 
of physics at the most fundamental level. 


The laws of physics describe light and matter, and the quantum 
revolution rewrote both descriptions. Radioactivity was a good ex- 
ample of matter’s behaving in a way that was inconsistent with 
classical physics, but if we want to get under the hood and under- 
stand how nonclassical things happen, it will be easier to focus on 
light rather than matter. A radioactive atom such as uranium-235 
is after all an extremely complex system, consisting of 92 protons, 
143 neutrons, and 92 electrons. Light, however, can be a simple sine 
wave. 


However successful the classical wave theory of light had been 
— allowing the creation of radio and radar, for example — it still 
failed to describe many important phenomena. An example that 
is currently of great interest is the way the ozone layer protects us 
from the dangerous short-wavelength ultraviolet part of the sun’s 
spectrum. In the classical description, light is a wave. When a wave 
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passes into and back out of a medium, its frequency is unchanged, 
and although its wavelength is altered while it is in the medium, 
it returns to its original value when the wave reemerges. Luckily 
for us, this is not at all what ultraviolet light does when it passes 
through the ozone layer, or the layer would offer no protection at 
all! 


Evidence for light as a particle 


For a long time, physicists tried to explain away the problems 
with the classical theory of light as arising from an imperfect under- 
standing of atoms and the interaction of light with individual atoms 
and molecules. The ozone paradox, for example, could have been 
attributed to the incorrect assumption that one could think of the 
ozone layer as a smooth, continuous substance, when in reality it 
was made of individual ozone molecules. It wasn’t until 1905 that 
Albert Einstein threw down the gauntlet, proposing that the prob- 
lem had nothing to do with the details of light’s interaction with 
atoms and everything to do with the fundamental nature of light 
itself. 


a/ Digital camera images of dimmer and dimmer sources of light. 
The dots are records of individual photons. 


In those days the data were sketchy, the ideas vague, and the 
experiments difficult to interpret; it took a genius like Einstein to cut 
through the thicket of confusion and find a simple solution. Today, 
however, we can get right to the heart of the matter with a piece of 
ordinary consumer electronics, the digital camera. Instead of film, a 
digital camera has a computer chip with its surface divided up into a 
grid of light-sensitive squares, called “pixels.” Compared to a grain 
of the silver compound used to make regular photographic film, a 
digital camera pixel is activated by an amount of light energy orders 
of magnitude smaller. We can learn something new about light by 
using a digital camera to detect smaller and smaller amounts of 
light, as shown in figure a. Figure a/1 is fake, but a/2 and a/3 are 
real digital-camera images made by Prof. Lyman Page of Princeton 
University as a classroom demonstration. Figure a/1 is what we 
would see if we used the digital camera to take a picture of a fairly 


Section 13.2 Light As a Particle 


787 


b/ A wave is partially absorbed. 
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dim source of light. In figures a/2 and a/3, the intensity of the light 
was drastically reduced by inserting semitransparent absorbers like 
the tinted plastic used in sunglasses. Going from a/1 to a/2 to a/3, 
more and more light energy is being thrown away by the absorbers. 


The results are drastically different from what we would expect 
based on the wave theory of light. If light was a wave and nothing 
but a wave, b, then the absorbers would simply cut down the wave’s 
amplitude across the whole wavefront. The digital camera’s entire 
chip would be illuminated uniformly, and weakening the wave with 
an absorber would just mean that every pixel would take a long time 
to soak up enough energy to register a signal. 


But figures a/2 and a/3 show that some pixels take strong hits 
while others pick up no energy at all. Instead of the wave picture, 
the image that is naturally evoked by the data is something more 
like a hail of bullets from a machine gun, c. Each “bullet” of light 
apparently carries only a tiny amount of energy, which is why de- 
tecting them individually requires a sensitive digital camera rather 
than an eye or a piece of film. 


Although Einstein was interpreting different observations, this 
is the conclusion he reached in his 1905 paper: that the pure wave 
theory of light is an oversimplification, and that the energy of a beam 
of light comes in finite chunks rather than being spread smoothly 
throughout a region of space. 


We now think of these chunks as particles of light, and call them 
“photons,” although Einstein avoided the word “particle,” and the 
word “photon” was invented later. Regardless of words, the trou- 
ble was that waves and particles seemed like inconsistent categories. 
The reaction to Einstein’s paper could be kindly described as vig- 
orously skeptical. Even twenty years later, Einstein wrote, “There 
are therefore now two theories of light, both indispensable, and — 
as one must admit today despite twenty years of tremendous effort 
on the part of theoretical physicists — without any logical connec- 
tion.” In the remainder of this chapter we will learn how the seeming 
paradox was eventually resolved. 


Discussion Questions 


A Suppose someone rebuts the digital camera data in figure a, claim- 
ing that the random pattern of dots occurs not because of anything fun- 
damental about the nature of light but simply because the camera’s pixels 
are not all exactly the same — some are just more sensitive than others. 
How could we test this interpretation? 


B Discuss how the correspondence principle applies to the observa- 
tions and concepts discussed in this section. 


Chapter 13 Quantum Physics 


13.2.2 How much light is one photon? 
The photoelectric effect 


We have seen evidence that light energy comes in little chunks, 
so the next question to be asked is naturally how much energy is 
in one chunk. The most straightforward experimental avenue for 
addressing this question is a phenomenon known as the photoelec- 
tric effect. The photoelectric effect occurs when a photon strikes 
the surface of a solid object and knocks out an electron. It occurs 
continually all around you. It is happening right now at the surface 
of your skin and on the paper or computer screen from which you 
are reading these words. It does not ordinarily lead to any observ- 
able electrical effect, however, because on the average free electrons 
are wandering back in just as frequently as they are being ejected. 
(If an object did somehow lose a significant number of electrons, 
its growing net positive charge would begin attracting the electrons 
back more and more strongly.) 


Figure e shows a practical method for detecting the photoelec- 
tric effect. Two very clean parallel metal plates (the electrodes of a 
capacitor) are sealed inside a vacuum tube, and only one plate is ex- 
posed to light. Because there is a good vacuum between the plates, 
any ejected electron that happens to be headed in the right direc- 
tion will almost certainly reach the other capacitor plate without 
colliding with any air molecules. 


The illuminated (bottom) plate is left with a net positive charge, 
and the unilluminated (top) plate acquires a negative charge from 
the electrons deposited on it. There is thus an electric field between 
the plates, and it is because of this field that the electrons’ paths are 
curved, as shown in the diagram. However, since vacuum is a good 
insulator, any electrons that reach the top plate are prevented from 
responding to the electrical attraction by jumping back across the 
gap. Instead they are forced to make their way around the circuit, 
passing through an ammeter. The ammeter allows a measurement 
of the strength of the photoelectric effect. 


An unexpected dependence on frequency 


The photoelectric effect was discovered serendipitously by Hein- 
rich Hertz in 1887, as he was experimenting with radio waves. He 
was not particularly interested in the phenomenon, but he did notice 
that the effect was produced strongly by ultraviolet light and more 
weakly by lower frequencies. Light whose frequency was lower than 
a certain critical value did not eject any electrons at all. (In fact 
this was all prior to Thomson’s discovery of the electron, so Hertz 
would not have described the effect in terms of electrons — we are 
discussing everything with the benefit of hindsight.) This depen- 
dence on frequency didn’t make any sense in terms of the classical 
wave theory of light. A light wave consists of electric and magnetic 


e/Apparatus for observing 
the photoelectric effect. A beam 
of light strikes a capacitor plate 
inside a vacuum tube, and elec- 
trons are ejected (black arrows). 
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f/ The hamster in her hamster 
ball is like an electron emerging 
from the metal (tiled kitchen floor) 
into the surrounding vacuum 
(wood floor). The wood floor is 
higher than the tiled floor, so as 
she rolls up the step, the hamster 
will lose a certain amount of 
kinetic energy, analogous to Es. 
If her kinetic energy is too small, 
she won’t even make it up the 
step. 


fields. The stronger the fields, i.e., the greater the wave’s ampli- 
tude, the greater the forces that would be exerted on electrons that 
found themselves bathed in the light. It should have been amplitude 
(brightness) that was relevant, not frequency. The dependence on 
frequency not only proves that the wave model of light needs mod- 
ifying, but with the proper interpretation it allows us to determine 
how much energy is in one photon, and it also leads to a connec- 
tion between the wave and particle models that we need in order to 
reconcile them. 


To make any progress, we need to consider the physical process 
by which a photon would eject an electron from the metal electrode. 
A metal contains electrons that are free to move around. Ordinarily, 
in the interior of the metal, such an electron feels attractive forces 
from atoms in every direction around it. The forces cancel out. But 
if the electron happens to find itself at the surface of the metal, 
the attraction from the interior side is not balanced out by any 
attraction from outside. In popping out through the surface the 
electron therefore loses some amount of energy E,, which depends 
on the type of metal used. 


Suppose a photon strikes an electron, annihilating itself and giv- 
ing up all its energy to the electron. (We now know that this is what 
always happens in the photoelectric effect, although it had not yet 
been established in 1905 whether or not the photon was completely 
annihilated.) The electron will (1) lose kinetic energy through colli- 
sions with other electrons as it plows through the metal on its way 
to the surface; (2) lose an amount of kinetic energy equal to FE, as 
it emerges through the surface; and (3) lose more energy on its way 
across the gap between the plates, due to the electric field between 
the plates. Even if the electron happens to be right at the surface of 
the metal when it absorbs the photon, and even if the electric field 
between the plates has not yet built up very much, FE, is the bare 
minimum amount of energy that it must receive from the photon 
if it is to contribute to a measurable current. The reason for using 
very clean electrodes is to minimize EF, and make it have a definite 
value characteristic of the metal surface, not a mixture of values 
due to the various types of dirt and crud that are present in tiny 
amounts on all surfaces in everyday life. 


We can now interpret the frequency dependence of the photo- 
electric effect in a simple way: apparently the amount of energy 
possessed by a photon is related to its frequency. A low-frequency 
red or infrared photon has an energy less than E;, so a beam of 
them will not produce any current. A high-frequency blue or violet 
photon, on the other hand, packs enough of a punch to allow an 
electron to make it to the other plate. At frequencies higher than 
the minimum, the photoelectric current continues to increase with 
the frequency of the light because of effects (1) and (3). 
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Numerical relationship between energy and frequency 


Prompted by Einstein’s photon paper, Robert Millikan (whom 
we first encountered in chapter 8) figured out how to use the pho- 
toelectric effect to probe precisely the link between frequency and 
photon energy. Rather than going into the historical details of Mil- 
likan’s actual experiments (a lengthy experimental program that 
occupied a large part of his professional career) we will describe a 
simple version, shown in figure g, that is used sometimes in college 
laboratory courses.? The idea is simply to illuminate one plate of 
the vacuum tube with light of a single wavelength and monitor the 
voltage difference between the two plates as they charge up. Since 
the resistance of a voltmeter is very high (much higher than the 
resistance of an ammeter), we can assume to a good approximation 
that electrons reaching the top plate are stuck there permanently, 
so the voltage will keep on increasing for as long as electrons are 
making it across the vacuum tube. 


At a moment when the voltage difference has a reached a value 
AV, the minimum energy required by an electron to make it out of 
the bottom plate and across the gap to the other plate is FE, + eAV. 
As AV increases, we eventually reach a point at which EF, + eAV 
equals the energy of one photon. No more electrons can cross the 
gap, and the reading on the voltmeter stops rising. The quantity 
F,+eAV now tells us the energy of one photon. If we determine this 
energy for a variety of wavelengths, h, we find the following simple 
relationship between the energy of a photon and the frequency of 


g/A_ different way of study- 
ing the photoelectric effect. 


the light: 
E=hf ; 
Pos 
where h is a constant with the value 6.63 x 10~°4 J-s. Note how Ps 


the equation brings the wave and particle models of light under the E,+eAV Pou 
same roof: the left side is the energy of one particle of light, while 
the right side is the frequency of the same light, interpreted as a 
wave. The constant h is known as Planck’s constant, for historical f 
reasons explained in the footnote beginning on the preceding page. 


h/The quantity E; + eAV in- 
self-check C dicates the energy of one photon. 
It is found to be proportional to 
the frequency of the light. 


?What I’m presenting in this chapter is a simplified explanation of how the 
photon could have been discovered. The actual history is more complex. Max 
Planck (1858-1947) began the photon saga with a theoretical investigation of 
the spectrum of light emitted by a hot, glowing object. He introduced quanti- 
zation of the energy of light waves, in multiples of hf, purely as a mathematical 
trick that happened to produce the right results. Planck did not believe that 
his procedure could have any physical significance. In his 1905 paper Einstein 
took Planck’s quantization as a description of reality, and applied it to various 
theoretical and experimental puzzles, including the photoelectric effect. Mil- 
likan then subjected Einstein’s ideas to a series of rigorous experimental tests. 
Although his results matched Einstein’s predictions perfectly, Millikan was skep- 
tical about photons, and his papers conspicuously omit any reference to them. 
Only in his autobiography did Millikan rewrite history and claim that he had 
given experimental proof for photons. 
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How would you extract h from the graph in figure h? What if you didn’t 
even know E, in advance, and could only graph eAV versus f? > 
Answer, p. 873 


Since the energy of a photon is hf, a beam of light can only have 
energies of hf, 2hf, 3hf, etc. Its energy is quantized — there is no 
such thing as a fraction of a photon. Quantum physics gets its name 
from the fact that it quantizes quantities like energy, momentum, 
and angular momentum that had previously been thought to be 
smooth, continuous and infinitely divisible. 


Number of photons emitted by a lightbulb per second example 7 
> Roughly how many photons are emitted by a 100-W lightbulb in 
1 second? 


> People tend to remember wavelengths rather than frequencies 
for visible light. The bulb emits photons with a range of frequen- 
cies and wavelengths, but let’s take 600 nm as a typical wave- 
length for purposes of estimation. The energy of a single photon 
is 


Ephoton = hf 
=hce/A 
A power of 100 W means 100 joules per second, so the number 
of photons is 


(100 J)/Ephoton = (100 J)/(he/A) 
~ 3x 107° 


Momentum of a photon example 8 
> According to the theory of relativity, the momentum of a beam 
of light is given by p = E/c. Apply this to find the momentum 
of a single photon in terms of its frequency, and in terms of its 
wavelength. 


> Combining the equations p = E/c and E = hf, we find 


peEle 


The second form turns out to be simpler. 
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Discussion Questions 


A The photoelectric effect only ever ejects a very tiny percentage of 
the electrons available near the surface of an object. How well does this 
agree with the wave model of light, and how well with the particle model? 
Consider the two different distance scales involved: the wavelength of the 
light, and the size of an atom, which is on the order of 10-'° or 10-9 m. 


B What is the significance of the fact that Planck’s constant is numeri- 
cally very small? How would our everyday experience of light be different 
if it was not so small? 


C —_ How would the experiments described above be affected if a single 
electron was likely to get hit by more than one photon? 


D Draw some representative trajectories of electrons for AV = 0, AV 
less than the maximum value, and AV greater than the maximum value. 


E Explain based on the photon theory of light why ultraviolet light would 
be more likely than visible or infrared light to cause cancer by damaging 
DNA molecules. How does this relate to discussion question C? 


F Does E = hf imply that a photon changes its energy when it passes 
from one transparent material into another substance with a different in- 
dex of refraction? 


13.2.3 Wave-particle duality 


How can light be both a particle and a wave? We are now 
ready to resolve this seeming contradiction. Often in science when 
something seems paradoxical, it’s because we (1) don’t define our 
terms carefully, or (2) don’t test our ideas against any specific real- 
world situation. Let’s define particles and waves as follows: 


e Waves exhibit superposition, and specifically interference phe- 
nomena. 


e Particles can only exist in whole numbers, not fractions. 


As a real-world check on our philosophizing, there is one partic- 
ular experiment that works perfectly. We set up a double-slit inter- 
ference experiment that we know will produce a diffraction pattern 
if light is an honest-to-goodness wave, but we detect the light with 
a detector that is capable of sensing individual photons, e.g., a dig- 
ital camera. To make it possible to pick out individual dots due to 
individual photons, we must use filters to cut down the intensity of 
the light to a very low level, just as in the photos by Prof. Page on 
page 787. The whole thing is sealed inside a light-tight box. The 
results are shown in figure i. (In fact, the similar figures in on page 
787 are simply cutouts from these figures.) 


Neither the pure wave theory nor the pure particle theory can 
explain the results. If light was only a particle and not a wave, there 
would be no interference effect. The result of the experiment would 
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i/Wave_ interference patterns 
photographed by Prof. Lyman 
Page with a digital camera. Laser 
light with a single well-defined 
wavelength passed through a 
series of absorbers to cut down 
its intensity, then through a set of 
slits to produce interference, and 
finally into a digital camera chip. 
(A triple slit was actually used, 
but for conceptual simplicity we 
discuss the results in the main 
text as if it was a double slit.) In 
panel 2 the intensity has been 
reduced relative to 1, and even 
more so for panel 3. 


be like firing a hail of bullets through a double slit, j. Only two 
spots directly behind the slits would be hit. 


If, on the other hand, light was only a wave and not a particle, 
we would get the same kind of diffraction pattern that would happen 
with a water wave, k. There would be no discrete dots in the photo, 
only a diffraction pattern that shaded smoothly between light and 
dark. 


Applying the definitions to this experiment, light must be both 
a particle and a wave. It is a wave because it exhibits interference 
effects. At the same time, the fact that the photographs contain 
discrete dots is a direct demonstration that light refuses to be split 
into units of less than a single photon. There can only be whole 
numbers of photons: four photons in figure i/3, for example. 


A wrong interpretation: photons interfering with each other 


One possible interpretation of wave-particle duality that occurred 
to physicists early in the game was that perhaps the interference ef- 
fects came from photons interacting with each other. By analogy, a 
water wave consists of moving water molecules, and interference of 
water waves results ultimately from all the mutual pushes and pulls 
of the molecules. This interpretation was conclusively disproved by 
G.I. Taylor, a student at Cambridge. The demonstration by Prof. 
Page that we’ve just been discussing is essentially a modernized 
version of Taylor’s work. Taylor reasoned that if interference effects 
came from photons interacting with each other, a bare minimum of 
two photons would have to be present at the same time to produce 
interference. By making the light source extremely dim, we can be 
virtually certain that there are never two photons in the box at the 
same time. In figure i/3, however, the intensity of the light has 
been cut down so much by the absorbers that if it was in the open, 
the average separation between photons would be on the order of 
a kilometer! At any given moment, the number of photons in the 
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box is most likely to be zero. It is virtually certain that there were 
never two photons in the box at once. 


The concept of a photon’s path is undefined. 


If a single photon can demonstrate double-slit interference, then 
which slit did it pass through? The unavoidable answer must be that 
it passes through both! This might not seem so strange if we think 
of the photon as a wave, but it is highly counterintuitive if we try 
to visualize it as a particle. The moral is that we should not think 
in terms of the path of a photon. Like the fully human and fully 
divine Jesus of Christian theology, a photon is supposed to be 100% 
wave and 100% particle. If a photon had a well defined path, then it 
would not demonstrate wave superposition and interference effects, 
contradicting its wave nature. (In subsection 13.3.4 we will discuss 
the Heisenberg uncertainty principle, which gives a numerical way 
of approaching this issue.) 


Another wrong interpretation: the pilot wave hypothesis 


A second possible explanation of wave-particle duality was taken 
seriously in the early history of quantum mechanics. What if the 
photon particle is like a surfer riding on top of its accompanying 
wave? As the wave travels along, the particle is pushed, or “piloted” 
by it. Imagining the particle and the wave as two separate entities 
allows us to avoid the seemingly paradoxical idea that a photon is 
both at once. The wave happily does its wave tricks, like super- 
position and interference, and the particle acts like a respectable 
particle, resolutely refusing to be in two different places at once. If 
the wave, for instance, undergoes destructive interference, becoming 
nearly zero in a particular region of space, then the particle simply 
is not guided into that region. 


The problem with the pilot wave interpretation is that the only 
way it can be experimentally tested or verified is if someone manages 
to detach the particle from the wave, and show that there really are 
two entities involved, not just one. Part of the scientific method is 
that hypotheses are supposed to be experimentally testable. Since 
nobody has ever managed to separate the wavelike part of a photon 
from the particle part, the interpretation is not useful or meaningful 
in a scientific sense. 


The probability interpretation 


The correct interpretation of wave-particle duality is suggested 
by the random nature of the experiment we’ve been discussing: even 
though every photon wave/particle is prepared and released in the 
same way, the location at which it is eventually detected by the 
digital camera is different every time. The idea of the probability 
interpretation of wave-particle duality is that the location of the 
photon-particle is random, but the probability that it is in a certain 
location is higher where the photon-wave’s amplitude is greater. 


j/ Bullets pass through a double 
slit. 


k/ A water wave passes through 
a double slit. 


1/A_ single photon can go 
through both slits. 
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m/ Example 9. 


More specifically, the probability distribution of the particle must 
be proportional to the square of the wave’s amplitude, 


(probability distribution) o (amplitude)? 


This follows from the correspondence principle and from the fact 
that a wave’s energy density is proportional to the square of its am- 
plitude. If we run the double-slit experiment for a long enough time, 
the pattern of dots fills in and becomes very smooth as would have 
been expected in classical physics. To preserve the correspondence 
between classical and quantum physics, the amount of energy de- 
posited in a given region of the picture over the long run must be 
proportional to the square of the wave’s amplitude. The amount of 
energy deposited in a certain area depends on the number of pho- 
tons picked up, which is proportional to the probability of finding 
any given photon there. 


‘A microwave oven example 9 
> The figure shows two-dimensional (top) and one-dimensional 
(bottom) representations of the standing wave inside a microwave 
oven. Gray represents zero field, and white and black signify the 
strongest fields, with white being a field that is in the opposite di- 
rection compared to black. Compare the probabilities of detecting 
a microwave photon at points A, B, and C. 


> A and C are both extremes of the wave, so the probabilities of 
detecting a photon at A and C are equal. It doesn’t matter that we 
have represented C as negative and A as positive, because it is 
the square of the amplitude that is relevant. The amplitude at B is 
about 1/2 as much as the others, so the probability of detecting a 
photon there is about 1/4 as much. 


The probability interpretation was disturbing to physicists who 
had spent their previous careers working in the deterministic world 
of classical physics, and ironically the most strenuous objections 
against it were raised by Einstein, who had invented the photon 
concept in the first place. The probability interpretation has never- 
theless passed every experimental test, and is now as well established 
as any part of physics. 


An aspect of the probability interpretation that has made many 
people uneasy is that the process of detecting and recording the 
photon’s position seems to have a magical ability to get rid of the 
wavelike side of the photon’s personality and force it to decide for 
once and for all where it really wants to be. But detection or mea- 
surement is after all only a physical process like any other, governed 
by the same laws of physics. We will postpone a detailed discussion 
of this issue until the following chapter, since a measuring device 
like a digital camera is made of matter, but we have so far only 
discussed how quantum mechanics relates to light. 


‘What is the proportionality constant? example 10 
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> What is the proportionality constant that would make an actual 
equation out of (probability distribution) o (amplitude)*? 
> The probability that the photon is in a certain small region of 


volume v should equal the fraction of the wave’s energy that is 
within that volume: 


energy in volume v 
7 energy of photon 

energy in volume v 
> hf 


We assume v is small enough so that the electric and magnetic 
fields are nearly constant throughout it. We then have 


(stelEl? + ot IBI?) v 
hi 


We can simplify this formidable looking expression by recogniz- 
ing that in an electromagnetic wave, |E| and |B| are related by 
|E| = c|B|. With some algebra, it turns out that the electric and 
magnetic fields each contribute half the total energy, so we can 
simplify this to 


p — 2 (anklEl*) v 
hf 
SEP 
4nkhf 


As advertised, the probability is proportional to the square of 
the wave’s amplitude. 


Discussion Questions 


A Referring back to the example of the carrot in the microwave oven, 
show that it would be nonsensical to have probability be proportional to 
the field itself, rather than the square of the field. 


B Einstein did not try to reconcile the wave and particle theories of 
light, and did not say much about their apparent inconsistency. Einstein 
basically visualized a beam of light as a stream of bullets coming from 
a machine gun. In the photoelectric effect, a photon “bullet” would only 
hit one atom, just as a real bullet would only hit one person. Suppose 
someone reading his 1905 paper wanted to interpret it by saying that 
Einstein’s so-called particles of light are simply short wave-trains that only 
occupy a small region of space. Comparing the wavelength of visible light 
(a few hundred nm) to the size of an atom (on the order of 0.1 nm), explain 
why this poses a difficulty for reconciling the particle and wave theories. 


C Can awhite photon exist? 


D In double-slit diffraction of photons, would you get the same pattern 
of dots on the digital camera image if you covered one slit? Why should it 
matter whether you give the photon two choices or only one? 
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7 


volume 
under'surface 


n/ Probability is the volume 


under 
D(x, y). 
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a surface defined by 


13.2.4 Photons in three dimensions 


Up until now I’ve been sneaky and avoided a full discussion of 
the three-dimensional aspects of the probability interpretation. The 
example of the carrot in the microwave oven, for example, reduced 
to a one-dimensional situation because we were considering three 
points along the same line and because we were only comparing 
ratios of probabilities. The purpose of bringing it up now is to head 
off any feeling that you’ve been cheated conceptually rather than to 
prepare you for mathematical problem solving in three dimensions, 
which would not be appropriate for the level of this course. 


A typical example of a probability distribution in section 13.1 
was the distribution of heights of human beings. The thing that 
varied randomly, height, h, had units of meters, and the probabil- 
ity distribution was a graph of a function D(h). The units of the 
probability distribution had to be m~! (inverse meters) so that ar- 
eas under the curve, interpreted as probabilities, would be unitless: 
(area) = (height)(width) = m7! - m. 


Now suppose we have a two-dimensional problem, e.g., the prob- 
ability distribution for the place on the surface of a digital camera 
chip where a photon will be detected. The point where it is detected 
would be described with two variables, x and y, each having units 
of meters. The probability distribution will be a function of both 
variables, D(x,y). A probability is now visualized as the volume 
under the surface described by the function D(x,y), as shown in 
figure n. The units of D must be m~? so that probabilities will be 
unitless: (probability) = (depth) (length)(width) = m~?-m-m. In 
terms of calculus, we have P = { D dx dy. 


Generalizing finally to three dimensions, we find by analogy that 
the probability distribution will be a function of all three coordi- 
nates, D(x,y,z), and will have units of m7. It is unfortunately 
impossible to visualize the graph unless you are a mutant with a 
natural feel for life in four dimensions. If the probability distri- 
bution is nearly constant within a certain volume of space v, the 
probability that the photon is in that volume is simply vD. If not, 
then we can use an integral, P = { D dx dy dz. 
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13.3 Matter As a Wave 


[In] a few minutes I shall be all melted... I have been wicked in my 
day, but I never thought a little girl like you would ever be able to 
melt me and end my wicked deeds. Look out — here I go! 


The Wicked Witch of the West 


As the Wicked Witch learned the hard way, losing molecular 
cohesion can be unpleasant. That’s why we should be very grate- 
ful that the concepts of quantum physics apply to matter as well 
as light. If matter obeyed the laws of classical physics, molecules 
wouldn’t exist. 


Consider, for example, the simplest atom, hydrogen. Why does 
one hydrogen atom form a chemical bond with another hydrogen 
atom? Roughly speaking, we’d expect a neighboring pair of hy- 
drogen atoms, A and B, to exert no force on each other at all, 
attractive or repulsive: there are two repulsive interactions (proton 
A with proton B and electron A with electron B) and two attractive 
interactions (proton A with electron B and electron A with proton 
B). Thinking a little more precisely, we should even expect that once 
the two atoms got close enough, the interaction would be repulsive. 
For instance, if you squeezed them so close together that the two 
protons were almost on top of each other, there would be a tremen- 
dously strong repulsion between them due to the 1/r? nature of the 
electrical force. The repulsion between the electrons would not be 
as strong, because each electron ranges over a large area, and is not 
likely to be found right on top of the other electron. Thus hydrogen 
molecules should not exist according to classical physics. 


Quantum physics to the rescue! As we’ll see shortly, the whole 
problem is solved by applying the same quantum concepts to elec- 


Section 13.3. Matter As a Wave 


799 


800 


trons that we have already used for photons. 


13.3.1 Electrons as waves 


We started our journey into quantum physics by studying the 
random behavior of matter in radioactive decay, and then asked how 
randomness could be linked to the basic laws of nature governing 
light. The probability interpretation of wave-particle duality was 
strange and hard to accept, but it provided such a link. It is now 
natural to ask whether the same explanation could be applied to 
matter. If the fundamental building block of light, the photon, is 
a particle as well as a wave, is it possible that the basic units of 
matter, such as electrons, are waves as well as particles? 


A young French aristocrat studying physics, Louis de Broglie 
(pronounced “broylee”), made exactly this suggestion in his 1923 
Ph.D. thesis. His idea had seemed so farfetched that there was 
serious doubt about whether to grant him the degree. Einstein was 
asked for his opinion, and with his strong support, de Broglie got 
his degree. 


Only two years later, American physicists C.J. Davisson and L. 
Germer confirmed de Broglie’s idea by accident. They had been 
studying the scattering of electrons from the surface of a sample 
of nickel, made of many small crystals. (One can often see such a 
crystalline pattern on a brass doorknob that has been polished by 
repeated handling.) An accidental explosion occurred, and when 
they put their apparatus back together they observed something 
entirely different: the scattered electrons were now creating an in- 
terference pattern! This dramatic proof of the wave nature of matter 
came about because the nickel sample had been melted by the ex- 
plosion and then resolidified as a single crystal. The nickel atoms, 
now nicely arranged in the regular rows and columns of a crystalline 
lattice, were acting as the lines of a diffraction grating. The new 
crystal was analogous to the type of ordinary diffraction grating in 
which the lines are etched on the surface of a mirror (a reflection 
grating) rather than the kind in which the light passes through the 
transparent gaps between the lines (a transmission grating). 


Although we will concentrate on the wave-particle duality of elec- 
trons because it is important in chemistry and the physics of atoms, 
all the other “particles” of matter you’ve learned about show wave 
properties as well. Figure a, for instance, shows a wave interference 
pattern of neutrons. 


It might seem as though all our work was already done for us, 
and there would be nothing new to understand about electrons: 
they have the same kind of funny wave-particle duality as photons. 
That’s almost true, but not quite. There are some important ways 
in which electrons differ significantly from photons: 
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a/A double-slit interference pattern made with neutrons. (A. Zeilinger, 
R. Gahler, C.G. Shull, W. Treimer, and W. Mampe, Reviews of Modern 
Physics, Vol. 60, 1988.) 


1. Electrons have mass, and photons don’t. 


2. Photons always move at the speed of light, but electrons can 
move at any speed less than c. 


3. Photons don’t have electric charge, but electrons do, so electric 
forces can act on them. The most important example is the 
atom, in which the electrons are held by the electric force of 
the nucleus. 


4. Electrons cannot be absorbed or emitted as photons are. De- 
stroying an electron or creating one out of nothing would vio- 
late conservation of charge. 


(In section 13.4 we will learn of one more fundamental way in which 
electrons differ from photons, for a total of five.) 


Because electrons are different from photons, it is not immedi- 
ately obvious which of the photon equations from chapter 11 can be 
applied to electrons as well. A particle property, the energy of one 
photon, is related to its wave properties via EF = hf or, equivalently, 
E = hc/X. The momentum of a photon was given by p = hf/c or 
p =h/A. Ultimately it was a matter of experiment to determine 
which of these equations, if any, would work for electrons, but we 
can make a quick and dirty guess simply by noting that some of 
the equations involve c, the speed of light, and some do not. Since 
c is irrelevant in the case of an electron, we might guess that the 
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equations of general validity are those that do not have c in them: 


E=hf 
p=h/r 


This is essentially the reasoning that de Broglie went through, 
and experiments have confirmed these two equations for all the fun- 
damental building blocks of light and matter, not just for photons 
and electrons. 


The second equation, which I soft-pedaled in the previous chap- 
ter, takes on a greater important for electrons. This is first of all 
because the momentum of matter is more likely to be significant 
than the momentum of light under ordinary conditions, and also 
because force is the transfer of momentum, and electrons are af- 
fected by electrical forces. 


The wavelength of an elephant example 11 
> What is the wavelength of a trotting elephant? 


> One may doubt whether the equation should be applied to an 
elephant, which is not just a single particle but a rather large col- 
lection of them. Throwing caution to the wind, however, we esti- 
mate the elephant’s mass at 10° kg and its trotting speed at 10 
m/s. Its wavelength is therefore roughly 


em 
p 
3 
~ mv 
_ 6.63 x 10-34 Js 
~ (108 kg)(10 m/s) 
wipe? (kg-m?/s?)-s 


kg-m/s 
S107 mM 


The wavelength found in this example is so fantastically small 
that we can be sure we will never observe any measurable wave 
phenomena with elephants or any other human-scale objects. The 
result is numerically small because Planck’s constant is so small, 
and as in some examples encountered previously, this smallness is 
in accord with the correspondence principle. 


Although a smaller mass in the equation 4 = h/mv does re- 
sult in a longer wavelength, the wavelength is still quite short even 
for individual electrons under typical conditions, as shown in the 
following example. 


The typical wavelength of an electron example 12 
> Electrons in circuits and in atoms are typically moving through 


Chapter 13 Quantum Physics 


voltage differences on the order of 1 V, so that a typical energy is 
(e)(1 V), which is on the order of 10—19 J. What is the wavelength 
of an electron with this amount of kinetic energy? 


> This energy is nonrelativistic, since it is much less than mc?. 
Momentum and energy are therefore related by the nonrelativistic 
equation K = p*/2m. Solving for p and substituting in to the 
equation for the wavelength, we find 


=1.6x10-9m 


This is on the same order of magnitude as the size of an atom, 
which is no accident: as we will discuss in the next chapter in 
more detail, an electron in an atom can be interpreted as a stand- 
ing wave. The smallness of the wavelength of a typical electron 
also helps to explain why the wave nature of electrons wasn’t dis- 
covered until a hundred years after the wave nature of light. To 
scale the usual wave-optics devices such as diffraction gratings 
down to the size needed to work with electrons at ordinary ener- 
gies, we need to make them so small that their parts are compa- 
rable in size to individual atoms. This is essentially what Davisson 
and Germer did with their nickel crystal. 


self-check D 


These remarks about the inconvenient smallness of electron wavelengths 


apply only under the assumption that the electrons have typical ener- 
gies. What kind of energy would an electron have to have in order to 
have a longer wavelength that might be more convenient to work with? 
> Answer, p. 873 


What kind of wave is it? 


If a sound wave is a vibration of matter, and a photon is a 
vibration of electric and magnetic fields, what kind of a wave is 
an electron made of? The disconcerting answer is that there is 
no experimental “observable,” i.e., directly measurable quantity, to 
correspond to the electron wave itself. In other words, there are 
devices like microphones that detect the oscillations of air pressure 
in a sound wave, and devices such as radio receivers that measure 
the oscillation of the electric and magnetic fields in a light wave, 
but nobody has ever found any way to measure the electron wave 
directly. 


We can of course detect the energy (or momentum) possessed by 
an electron just as we could detect the energy of a photon using a 
digital camera. (In fact I’d imagine that an unmodified digital cam- 
era chip placed in a vacuum chamber would detect electrons just as 
handily as photons.) But this only allows us to determine where the 


b/These two electron waves 
are not distinguishable by any 


measuring device. 
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wave carries high probability and where it carries low probability. 
Probability is proportional to the square of the wave’s amplitude, 
but measuring its square is not the same as measuring the wave 
itself. In particular, we get the same result by squaring either a 
positive number or its negative, so there is no way to determine the 
positive or negative sign of an electron wave. 


Most physicists tend toward the school of philosophy known as 
operationalism, which says that a concept is only meaningful if we 
can define some set of operations for observing, measuring, or test- 
ing it. According to a strict operationalist, then, the electron wave 
itself is a meaningless concept. Nevertheless, it turns out to be one 
of those concepts like love or humor that is impossible to measure 
and yet very useful to have around. We therefore give it a symbol, 
W (the capital Greek letter psi), and a special name, the electron 
wavefunction (because it is a function of the coordinates x, y, and 
z that specify where you are in space). It would be impossible, for 
example, to calculate the shape of the electron wave in a hydro- 
gen atom without having some symbol for the wave. But when the 
calculation produces a result that can be compared directly to ex- 
periment, the final algebraic result will turn out to involve only W?, 
which is what is observable, not W itself. 


Since W, unlike EF and B, is not directly measurable, we are free 
to make the probability equations have a simple form: instead of 
having the probability density equal to some funny constant multi- 
plied by ?, we simply define V so that the constant of proportion- 
ality is one: 


(probability distribution) = V? 


Since the probability distribution has units of m~°, the units of U 
must be m7?/2. 


Discussion Question 


A _ Frequency is oscillations per second, whereas wavelength is meters 
per oscillation. How could the equations E = hf and p = h/A be made 
to look more alike by using quantities that were more closely analogous? 
(This more symmetric treatment makes it easier to incorporate relativity 
into quantum mechanics, since relativity says that space and time are not 
entirely separate.) 


13.3.2 Dispersive waves 


A colleague of mine who teaches chemistry loves to tell the story 
about an exceptionally bright student who, when told of the equa- 
tion p = h/A, protested, “But when I derived it, it had a factor of 
2!” The issue that’s involved is a real one, albeit one that could be 
glossed over (and is, in most textbooks) without raising any alarms 
in the mind of the average student. The present optional section 
addresses this point; it is intended for the student who wishes to 
delve a little deeper. 
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Here’s how the now-legendary student was presumably reason- 
ing. We start with the equation v = fA, which is valid for any sine 
wave, whether it’s quantum or classical. Let’s assume we already 
know E = hf, and are trying to derive the relationship between 
wavelength and momentum: 


A= 


The reasoning seems valid, but the result does contradict the 
accepted one, which is after all solidly based on experiment. 


The mistaken assumption is that we can figure everything out in 
terms of pure sine waves. Mathematically, the only wave that has 
a perfectly well defined wavelength and frequency is a sine wave, 
and not just any sine wave but an infinitely long sine wave, c. The 
unphysical thing about such a wave is that it has no leading or 
trailing edge, so it can never be said to enter or leave any particular 
region of space. Our derivation made use of the velocity, v, and if 
velocity is to be a meaningful concept, it must tell us how quickly 
stuff (mass, energy, momentum, ...) is transported from one region 
of space to another. Since an infinitely long sine wave doesn’t remove 
any stuff from one region and take it to another, the “velocity of its 
stuff” is not a well defined concept. 


Of course the individual wave peaks do travel through space, and 
one might think that it would make sense to associate their speed 
with the “speed of stuff,” but as we will see, the two velocities are 
in general unequal when a wave’s velocity depends on wavelength. 
Such a wave is called a dispersive wave, because a wave pulse consist- 
ing of a superposition of waves of different wavelengths will separate 
(disperse) into its separate wavelengths as the waves move through 
space at different speeds. Nearly all the waves we have encountered 
have been nondispersive. For instance, sound waves and light waves 
(in a vacuum) have speeds independent of wavelength. A water wave 
is one good example of a dispersive wave. Long-wavelength water 
waves travel faster, so a ship at sea that encounters a storm typi- 
cally sees the long-wavelength parts of the wave first. When dealing 
with dispersive waves, we need symbols and words to distinguish 
the two speeds. The speed at which wave peaks move is called the 
phase velocity, vp, and the speed at which “stuff” moves is called 
the group velocity, vg. 


c/ Part of an infinite sine wave. 


d/A_ finite-length sine wave. 


e/A beat pattern created by 
superimposing two sine waves 


with slightly different 
lengths. 


wave- 
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An infinite sine wave can only tell us about the phase velocity, 
not the group velocity, which is really what we would be talking 
about when we refer to the speed of an electron. If an infinite 
sine wave is the simplest possible wave, what’s the next best thing? 
We might think the runner up in simplicity would be a wave train 
consisting of a chopped-off segment of a sine wave, d. However, this 
kind of wave has kinks in it at the end. A simple wave should be 
one that we can build by superposing a small number of infinite 
sine waves, but a kink can never be produced by superposing any 
number of infinitely long sine waves. 


Actually the simplest wave that transports stuff from place to 
place is the pattern shown in figure e. Called a beat pattern, it is 
formed by superposing two sine waves whose wavelengths are similar 
but not quite the same. If you have ever heard the pulsating howling 
sound of musicians in the process of tuning their instruments to each 
other, you have heard a beat pattern. The beat pattern gets stronger 
and weaker as the two sine waves go in and out of phase with each 
other. The beat pattern has more “stuff” (energy, for example) 
in the areas where constructive interference occurs, and less in the 
regions of cancellation. As the whole pattern moves through space, 
stuff is transported from some regions and into other ones. 


If the frequency of the two sine waves differs by 10%, for in- 
stance, then ten periods will be occur between times when they are 
in phase. Another way of saying it is that the sinusoidal “envelope” 
(the dashed lines in figure e) has a frequency equal to the difference 
in frequency between the two waves. For instance, if the waves had 
frequencies of 100 Hz and 110 Hz, the frequency of the envelope 
would be 10 Hz. 


To apply similar reasoning to the wavelength, we must define a 
quantity z = 1/A that relates to wavelength in the same way that 
frequency relates to period. In terms of this new variable, the z of 
the envelope equals the difference between the z's of the two sine 
waves. 


The group velocity is the speed at which the envelope moves 
through space. Let Af and Az be the differences between the 
frequencies and 2’s of the two sine waves, which means that they 
equal the frequency and z of the envelope. The group velocity is 
Ug = fenveloperenvelope = Af /Az. If Af and Az are sufficiently 


small, we can approximate this expression as a derivative, 


_ af 

USS fae 
This expression is usually taken as the definition of the group veloc- 
ity for wave patterns that consist of a superposition of sine waves 
having a narrow range of frequencies and wavelengths. In quan- 
tum mechanics, with f = E/h and z = p/h, we have vg = dE/dp. 
In the case of a nonrelativistic electron the relationship between 
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energy and momentum is E = p?/2m, so the group velocity is 
dE/dp = p/m = v, exactly what it should be. It is only the phase 
velocity that differs by a factor of two from what we would have 
expected, but the phase velocity is not the physically important 
thing. 


13.3.3 Bound states 


Electrons are at their most interesting when they’re in atoms, 
that is, when they are bound within a small region of space. We 
can understand a great deal about atoms and molecules based on 
simple arguments about such bound states, without going into any 
of the realistic details of atom. The simplest model of a bound state 
is known as the particle in a box: like a ball on a pool table, the 
electron feels zero force while in the interior, but when it reaches 
an edge it encounters a wall that pushes back inward on it with 
a large force. In particle language, we would describe the electron 
as bouncing off of the wall, but this incorrectly assumes that the 
electron has a certain path through space. It is more correct to 
describe the electron as a wave that undergoes 100% reflection at 
the boundaries of the box. 


Like a generation of physics students before me, I rolled my 
eyes when initially introduced to the unrealistic idea of putting a 
particle in a box. It seemed completely impractical, an artificial 
textbook invention. Today, however, it has become routine to study 
electrons in rectangular boxes in actual laboratory experiments. The 
“box” is actually just an empty cavity within a solid piece of silicon, 
amounting in volume to a few hundred atoms. The methods for 
creating these electron-in-a-box setups (known as “quantum dots”) 
were a by-product of the development of technologies for fabricating 
computer chips. 


For simplicity let’s imagine a one-dimensional electron in a box, 
i.e., we assume that the electron is only free to move along a line. 
The resulting standing wave patterns, of which the first three are 
shown in the figure, are just like some of the patterns we encoun- 
tered with sound waves in musical instruments. The wave patterns 
must be zero at the ends of the box, because we are assuming the 
walls are impenetrable, and there should therefore be zero proba- 
bility of finding the electron outside the box. Each wave pattern is 
labeled according to n, the number of peaks and valleys it has. In 
quantum physics, these wave patterns are referred to as “states” of 
the particle-in-the-box system. 


The following seemingly innocuous observations about the par- 
ticle in the box lead us directly to the solutions to some of the most 
vexing failures of classical physics: 


The particle’s energy is quantized (can only have certain values). 
Each wavelength corresponds to a certain momentum, and a given 


n=3 


f/ Three possible — standing- 
wave patterns for a particle in a 
box. 
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g/The spectrum of the 
from the star Sirius. 


light 


momentum implies a definite kinetic energy, E = p?/2m. (This is 
the second type of energy quantization we have encountered. The 
type we studied previously had to do with restricting the number 
of particles to a whole number, while assuming some specific wave- 
length and energy for each particle. This type of quantization refers 
to the energies that a single particle can have. Both photons and 
matter particles demonstrate both types of quantization under the 
appropriate circumstances. ) 


The particle has a minimum kinetic energy. Long wavelengths cor- 
respond to low momenta and low energies. There can be no state 
with an energy lower than that of the n = 1 state, called the ground 
state. 


The smaller the space in which the particle is confined, the higher 
its kinetic energy must be. Again, this is because long wavelengths 
give lower energies. 


Spectra of thin gases example 13 
A fact that was inexplicable by classical physics was that thin 
gases absorb and emit light only at certain wavelengths. This 
was observed both in earthbound laboratories and in the spectra 
of stars. The figure on the left shows the example of the spec- 
trum of the star Sirius, in which there are “gap teeth” at certain 
wavelengths. Taking this spectrum as an example, we can give a 
straightforward explanation using quantum physics. 


Energy is released in the dense interior of the star, but the outer 
layers of the star are thin, so the atoms are far apart and electrons 
are confined within individual atoms. Although their standing- 
wave patterns are not as simple as those of the particle in the 
box, their energies are quantized. 


When a photon is on its way out through the outer layers, it can be 
absorbed by an electron in an atom, but only if the amount of en- 
ergy it carries happens to be the right amount to kick the electron 
from one of the allowed energy levels to one of the higher lev- 
els. The photon energies that are missing from the spectrum are 
the ones that equal the difference in energy between two elec- 
tron energy levels. (The most prominent of the absorption lines in 
Sirius’s spectrum are absorption lines of the hydrogen atom.) 


The stability of atoms example 14 

In many Star Trek episodes the Enterprise, in orbit around a planet, 
suddenly lost engine power and began spiraling down toward the 

planet’s surface. This was utter nonsense, of course, due to con- 

servation of energy: the ship had no way of getting rid of energy, 

so it did not need the engines to replenish it. 


Consider, however, the electron in an atom as it orbits the nu- 
cleus. The electron does have a way to release energy: it has an 
acceleration due to its continuously changing direction of motion, 
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and according to classical physics, any accelerating charged par- 
ticle emits electromagnetic waves. According to classical physics, 
atoms should collapse! 


The solution lies in the observation that a bound state has a min- 
imum energy. An electron in one of the higher-energy atomic 
states can and does emit photons and hop down step by step in 
energy. But once it is in the ground state, it cannot emit a photon 
because there is no lower-energy state for it to go to. 


Chemical bonds example 15 
| began this section with a classical argument that chemical 
bonds, as in an Hz molecule, should not exist. Quantum physics 
explains why this type of bonding does in fact occur. When the 
atoms are next to each other, the electrons are shared between 
them. The “box” is about twice as wide, and a larger box allows 
a smaller energy. Energy is required in order to separate the 
atoms. (A qualitatively different type of bonding is discussed on 
page 838. Example 20 on page 833 revisits the Hs bond in more 
detail.) 


Discussion Questions 


A_ Neutrons attract each other via the strong nuclear force, so according 
to classical physics it should be possible to form nuclei out of clusters of 
two or more neutrons, with no protons at all. Experimental searches, 
however, have failed to turn up evidence of a stable two-neutron system 
(dineutron) or larger stable clusters. These systems are apparently not 
just unstable in the sense of being able to beta decay but unstable in 
the sense that they don’t hold together at all. Explain based on quantum 
physics why a dineutron might spontaneously fly apart. 


B__ The following table shows the energy gap between the ground state 
and the first excited state for four nuclei, in units of picojoules. (The nuclei 
were chosen to be ones that have similar structures, e.g., they are all 
spherical in shape.) 


nucleus energy gap (picojoules) 


4He 3.234 
16O 0.968 
Ca 0.536 
208 Ph 0.418 


Explain the trend in the data. 


3.4 The uncertainty principle and measurement 
Eliminating randomness through measurement? 


A common reaction to quantum physics, among both early- 
twentieth-century physicists and modern students, is that we should 
be able to get rid of randomness through accurate measurement. If 
I say, for example, that it is meaningless to discuss the path of a 
photon or an electron, one might suggest that we simply measure 
the particle’s position and velocity many times in a row. This series 


h/Two hydrogen atoms bond 
to form an Hse molecule. In the 
molecule, the two electrons’ wave 
patterns overlap , and are about 
twice as wide. 
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of snapshots would amount to a description of its path. 


A practical objection to this plan is that the process of measure- 
ment will have an effect on the thing we are trying to measure. This 
may not be of much concern, for example, when a traffic cop mea- 
sure’s your car’s motion with a radar gun, because the energy and 
momentum of the radar pulses are insufficient to change the car’s 
motion significantly. But on the subatomic scale it is a very real 
problem. Making a videotape through a microscope of an electron 
orbiting a nucleus is not just difficult, it is theoretically impossi- 
ble. The video camera makes pictures of things using light that has 
bounced off them and come into the camera. If even a single photon 
of visible light was to bounce off of the electron we were trying to 
study, the electron’s recoil would be enough to change its behavior 
significantly. 


The heisenberg uncertainty principle 


This insight, that measurement changes the thing being mea- 
sured, is the kind of idea that clove-cigarette-smoking intellectuals 
outside of the physical sciences like to claim they knew all along. If 
only, they say, the physicists had made more of a habit of reading 
literary journals, they could have saved a lot of work. The anthro- 
pologist Margaret Mead has recently been accused of inadvertently 
encouraging her teenaged Samoan informants to exaggerate the free- 
dom of youthful sexual experimentation in their society. If this is 
considered a damning critique of her work, it is because she could 
have done better: other anthropologists claim to have been able to 
eliminate the observer-as-participant problem and collect untainted 
data. 


The German physicist Werner Heisenberg, however, showed that 
in quantum physics, any measuring technique runs into a brick wall 
when we try to improve its accuracy beyond a certain point. Heisen- 
berg showed that the limitation is a question of what there is to be 
known, even in principle, about the system itself, not of the ability 
or inability of a specific measuring device to ferret out information 
that is knowable but not previously hidden. 


Suppose, for example, that we have constructed an electron in a 
box (quantum dot) setup in our laboratory, and we are able to adjust 
the length L of the box as desired. All the standing wave patterns 
pretty much fill the box, so our knowledge of the electron’s position 
is of limited accuracy. If we write Ax for the range of uncertainty 
in our knowledge of its position, then Az is roughly the same as the 
length of the box: 


Are L 


If we wish to know its position more accurately, we can certainly 
squeeze it into a smaller space by reducing L, but this has an unin- 
tended side-effect. A standing wave is really a superposition of two 
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traveling waves going in opposite directions. The equation p = h/X 
really only gives the magnitude of the momentum vector, not its 
direction, so we should really interpret the wave as a 50/50 mixture 
of a right-going wave with momentum p = h/X and a left-going one 
with momentum p = —h/X. The uncertainty in our knowledge of 
the electron’s momentum is Ap = 2h/A, covering the range between 
these two values. Even if we make sure the electron is in the ground 
state, whose wavelength \ = 2L is the longest possible, we have an 
uncertainty in momentum of Ap = h/L. In general, we find 


ApZh/L  , 


with equality for the ground state and inequality for the higher- 
energy states. Thus if we reduce L to improve our knowledge of the 
electron’s position, we do so at the cost of knowing less about its 
momentum. This trade-off is neatly summarized by multiplying the 
two equations to give 


ApAz 2h 


Although we have derived this in the special case of a particle in a 
box, it is an example of a principle of more general validity: 


The Heisenberg uncertainty principle 

It is not possible, even in principle, to know the momentum and the 
position of a particle simultaneously and with perfect accuracy. The 
uncertainties in these two quantities are always such that ApAx = 


h. 


(This approximation can be made into a strict inequality, ApAx > 
h/4m, but only with more careful definitions, which we will not 
bother with.) 


Note that although I encouraged you to think of this deriva- 
tion in terms of a specific real-world system, the quantum dot, no 
reference was ever made to any specific laboratory equipment or pro- 
cedures. The argument is simply that we cannot know the particle’s 
position very accurately unless it has a very well defined position, 
it cannot have a very well defined position unless its wave-pattern 
covers only a very small amount of space, and its wave-pattern can- 
not be thus compressed without giving it a short wavelength and 
a correspondingly uncertain momentum. The uncertainty princi- 
ple is therefore a restriction on how much there is to know about 
a particle, not just on what we can know about it with a certain 
technique. 


An estimate for electrons in atoms example 16 
> A typical energy for an electron in an atom is on the order of 
(1 volt)-e, which corresponds to a speed of about 1% of the speed 
of light. If a typical atom has a size on the order of 0.1 nm, how 
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close are the electrons to the limit imposed by the uncertainty 
principle? 


> If we assume the electron moves in all directions with equal 
probability, the uncertainty in its momentum is roughly twice its 
typical momentum. This only an order-of-magnitude estimate, so 
we take Ap to be the same as a typical momentum: 


ApAX = Ptypical AX 
= (Melectron)(0.01c)(0.1 x 1072 m) 
e310 ks 


This is on the same order of magnitude as Planck’s constant, so 
evidently the electron is “right up against the wall.” (The fact that 
it is somewhat less than his of no concern since this was only an 
estimate, and we have not stated the uncertainty principle in its 
most exact form.) 


self-check E 


If we were to apply the uncertainty principle to human-scale objects, 
what would be the significance of the small numerical value of Planck’s 
constant? > Answer, p. 874 


Historical Note 

The true nature of Heisenberg’s role in the Nazi atomic bomb effort is 
a fascinating question, and dramatic enough to have inspired a well- 
received 1998 theatrical play, “Copenhagen.” The real story, however, 
may never be completely unraveled. Heisenberg was the scientific leader 
of the German bomb program up until its cancellation in 1942, when the 
German military decided that it was too ambitious a project to undertake 
in wartime, and too unlikely to produce results. 


Some historians believe that Heisenberg intentionally delayed and 
obstructed the project because he secretly did not want the Nazis to 
get the bomb. Heisenberg’s apologists point out that he never joined 
the Nazi party, and was not anti-Semitic. He actively resisted the gov- 
ernment’s Deutsche-Physik policy of eliminating supposed Jewish in- 
fluences from physics, and as a result was denounced by the S.S. as 
a traitor, escaping punishment only because Himmler personally de- 
clared him innocent. One strong piece of evidence for this view is a 
secret message carried to the U.S. in 1941, by one of the last Jews to 
escape from Berlin, and eventually delivered to the chairman of the Ura- 
nium Committee, which was then studying the feasibility of a bomb. The 
message states “...that a large number of German physicists are work- 
ing intensively on the problem of the uranium bomb under the direction 
of Heisenberg, [and] that Heisenberg himself tries to delay the work as 
much as possible, fearing the catastrophic results of success. But he 
cannot help fulfilling the orders given to him, and if the problem can be 
solved, it will be solved probably in the near future. So he gave the 
advice to us to hurry up if U.S.A. will not come too late.” The message 
supports the view that Heisenberg intentionally misled his government 
about the bomb’s technical feasibility; Nazi Minister of Armaments Al- 
bert Speer wrote that he was convinced to drop the project after a 1942 
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meeting with Heisenber because “the physicists themselves didn’t want 
to put too much into it.” Heisenberg also may have warned Danish physi- 
cist Niels Bohr personally in September 1941 about the existence of the 
Nazi bomb effort. 


On the other side of the debate, critics of Heisenberg say that he 
clearly wanted Germany to win the war, that he visited German-occupied 
territories in a semi-official role, and that he may simply have not been 
very good at his job directing the bomb project. On a visit to the oc- 
cupied Netherlands in 1943, he told a colleague, “Democracy cannot 
develop sufficient energy to rule Europe. There are, therefore, only two 
alternatives: Germany and Russia. And then a Europe under German 
leadership would be the lesser evil.” Some historians® argue that the 
real point of Heisenberg’s meeting with Bohr was to try to convince the 
U.S. not to try to build a bomb, so that Germany, possessing a nuclear 
monopoly, would defeat the Soviets — this was after the June 1941 en- 
try of the U.S.S.R. into the war, but before the December 1941 Pearl 
Harbor attack brought the U.S. in. Bohr apparently considered Heisen- 
berg’s account of the meeting, published after the war was over, to be 
inaccurate. The secret 1941 message also has a curious moral pas- 
sivity to it, as if Heisenberg was saying “I hope you stop me before | do 
something bad,” but we should also consider the great risk Heisenberg 
would have been running if he actually originated the message. 


Measurement and Schrédinger’s cat 


In the previous chapter I briefly mentioned an issue concerning 
measurement that we are now ready to address carefully. If you 
hang around a laboratory where quantum-physics experiments are 
being done and secretly record the physicists’ conversations, you'll 
hear them say many things that assume the probability interpreta- 
tion of quantum mechanics. Usually they will speak as though the 
randomness of quantum mechanics enters the picture when some- 
thing is measured. In the digital camera experiments of the previous 
chapter, for example, they would casually describe the detection of 
a photon at one of the pixels as if the moment of detection was 
when the photon was forced to “make up its mind.” Although this 
mental cartoon usually works fairly well as a description of things 
they experience in the lab, it cannot ultimately be correct, because 
it attributes a special role to measurement, which is really just a 
physical process like all other physical processes. 


If we are to find an interpretation that avoids giving any spe- 
cial role to measurement processes, then we must think of the entire 
laboratory, including the measuring devices and the physicists them- 
selves, as one big quantum-mechanical system made out of protons, 


3 Historical Perspective on Copenhagen, David C. Cassidy, Physics Today, 
July 2000, p. 28, http://www.aip.org/pt/vol-53/iss-7/p28.html 

*Bohr drafted several replies, but never published them for fear of 
hurting Heisenberg and his family. Bohr’s papers were to be sealed for 
50 years after his death, but his family released them early, in Febru- 
ary 2002. The texts, including English translations, are available at 
http: //www.nbi.dk/NBA/papers/docs/cover.html. 
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neutrons, electrons, and photons. In other words, we should take 
quantum physics seriously as a description not just of microscopic 
objects like atoms but of human-scale (“macroscopic”) things like 
the apparatus, the furniture, and the people. 


The most celebrated example is called the Schrédinger’s cat ex- 
periment. Luckily for the cat, there probably was no actual ex- 
periment — it was simply a “thought experiment” that the physi- 
cist the German theorist Schrodinger discussed with his colleagues. 
Schrodinger wrote: 


One can even construct quite burlesque cases. A cat is 
shut up in a steel container, together with the following 
diabolical apparatus (which one must keep out of the 
direct clutches of the cat): In a Geiger tube [radiation 
detector] there is a tiny mass of radioactive substance, so 
little that in the course of an hour perhaps one atom of 
it disintegrates, but also with equal probability not even 
one; if it does happen, the counter [detector] responds 
and ... activates a hammer that shatters a little flask 
of prussic acid [filling the chamber with poison gas]. If 
one has left this entire system to itself for an hour, then 
one will say to himself that the cat is still living, if in 
that time no atom has disintegrated. The first atomic 
disintegration would have poisoned it. 


Now comes the strange part. Quantum mechanics describes the 
particles the cat is made of as having wave properties, including the 
property of superposition. Schrédinger describes the wavefunction 
of the box’s contents at the end of the hour: 


The wavefunction of the entire system would express this sit- 
uation by having the living and the dead cat mixed ... in equal 
parts [50/50 proportions]. The uncertainty originally restricted to 
the atomic domain has been transformed into a macroscopic uncer- 
tainty... 


At first Schr6édinger’s description seems like nonsense. When you 
opened the box, would you see two ghostlike cats, as in a doubly 
exposed photograph, one dead and one alive? Obviously not. You 
would have a single, fully material cat, which would either be dead 
or very, very upset. But Schrédinger has an equally strange and 
logical answer for that objection. In the same way that the quantum 
randomness of the radioactive atom spread to the cat and made its 
wavefunction a random mixture of life and death, the randomness 
spreads wider once you open the box, and your own wavefunction 
becomes a mixture of a person who has just killed a cat and a person 
who hasn’t. 
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Discussion Questions 


A Compare Ap and Ax for the two lowest energy levels of the one- 
dimensional particle in a box, and discuss how this relates to the uncer- 
tainty principle. 

B = Onagraph of Ap versus Ax, sketch the regions that are allowed and 
forbidden by the Heisenberg uncertainty principle. Interpret the graph: 
Where does an atom lie on it? An elephant? Can either p or x be mea- 
sured with perfect accuracy if we don’t care about the other? 


13.3.5 Electrons in electric fields 


So far the only electron wave patterns we’ve considered have 
been simple sine waves, but whenever an electron finds itself in an 
electric field, it must have a more complicated wave pattern. Let’s 
consider the example of an electron being accelerated by the elec- 
tron gun at the back of a TV tube. Newton’s laws are not useful, 
because they implicitly assume that the path taken by the particle is 
a meaningful concept. Conservation of energy is still valid in quan- 
tum physics, however. In terms of energy, the electron is moving 
from a region of low voltage into a region of higher voltage. Since 
its charge is negative, it loses electrical energy by moving to a higher 
voltage, so its kinetic energy increases. As its electrical energy goes 
down, its kinetic energy goes up by an equal amount, keeping the 
total energy constant. Increasing kinetic energy implies a growing 
momentum, and therefore a shortening wavelength, i 


The wavefunction as a whole does not have a single well-defined 
wavelength, but the wave changes so gradually that if you only look 
at a small part of it you can still pick out a wavelength and relate 
it to the momentum and energy. (The picture actually exagger- 
ates by many orders of magnitude the rate at which the wavelength 
changes. ) 


But what if the electric field was stronger? The electric field in 
a TV is only ~ 10° N/C, but the electric field within an atom is 
more like 10! N/C. In figure k, the wavelength changes so rapidly 
that there is nothing that looks like a sine wave at all. We could 
get a rough idea of the wavelength in a given region by measuring 
the distance between two peaks, but that would only be a rough 
approximation. Suppose we want to know the wavelength at point 
P. The trick is to construct a sine wave, like the one shown with the 
dashed line, which matches the curvature of the actual wavefunction 
as closely as possible near P. The sine wave that matches as well as 
possible is called the “osculating” curve, from a Latin word meaning 
“to kiss.” The wavelength of the osculating curve is the wavelength 
that will relate correctly to conservation of energy. 


direction of motion 
(speeding up) 
— 


high PE low PE 
low KE high KE 
low momentum high momentum 
long wavelength short wavelength 
weak curvature strong curvature 


i/An_ electron in a_ gentle 
electric field gradually shortens 
its wavelength as it gains energy. 


wavefunction’s tails 


j/ The 
go where classical physics says 
they shouldn't. 
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k/A typical wavefunction of an electron in an atom (heavy curve) 
and the osculating sine wave (dashed curve) that matches its curvature 
at point P. 


Tunneling 


We implicitly assumed that the particle-in-a-box wavefunction 
would cut off abruptly at the sides of the box, j/1, but that would be 
unphysical. A kink has infinite curvature, and curvature is related 
to energy, so it can’t be infinite. A physically realistic wavefunction 
must always “tail off” gradually, j/2. In classical physics, a particle 
can never enter a region in which its interaction energy U would be 
greater than the amount of energy it has available. But in quantum 
physics the wavefunction will always have a tail that reaches into 
the classically forbidden region. If it was not for this effect, called 
tunneling, the fusion reactions that power the sun would not occur 
due to the high electrical energy nuclei need in order to get close 
together! Tunneling is discussed in more detail in the following 
subsection. 


13.3.6 The Schrodinger equation 


In subsection 13.3.5 we were able to apply conservation of energy 
to an electron’s wavefunction, but only by using the clumsy graph- 
ical technique of osculating sine waves as a measure of the wave’s 
curvature. You have learned a more convenient measure of curvature 
in calculus: the second derivative. To relate the two approaches, we 
take the second derivative of a sine wave: 


2 
se sin(272/r) = : (3 COS ar) 


dx? dz \ X r 
_ 27 is 27x 
aa a ee ae 


Taking the second derivative gives us back the same function, 
but with a minus sign and a constant out in front that is related 
to the wavelength. We can thus relate the second derivative to the 
osculating wavelength: 
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This could be solved for A in terms of WV, but it will turn out 
below to be more convenient to leave it in this form. 


Applying this to conservation of energy, we have 


Note that both equation [1] and equation [2] have \? in the denom- 
inator. We can simplify our algebra by multiplying both sides of 
equation [2] by W to make it look more like equation [1]: 


(h/d)* 
2 


E-W= V+U-V 


m 
ei aed a oa 
= (5) (=) Tage 


A WWRY Ge a aie 


2m \2r) da? 


Further simplification is achieved by using the symbol h (h with a 
slash through it, read “h-bar”) as an abbreviation for h/27. We then 
have the important result known as the Schr6dinger equation: 


(Actually this is a simplified version of the Schrédinger equation, 
applying only to standing waves in one dimension.) Physically it 
is a statement of conservation of energy. The total energy FE must 
be constant, so the equation tells us that a change in interaction 
energy U must be accompanied by a change in the curvature of 
the wavefunction. This change in curvature relates to a change 
in wavelength, which corresponds to a change in momentum and 
kinetic energy. 


self-check F 

Considering the assumptions that were made in deriving the Schrdédinger 
equation, would it be correct to apply it to a photon? To an electron mov- 
ing at relativistic soeeds? > Answer, p. 
874 


Usually we know right off the bat how U depends on z, so the 
basic mathematical problem of quantum physics is to find a function 
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| / Tunneling through a barrier. 


W(x) that satisfies the Schrédinger equation for a given interaction- 
energy function U(x). An equation, such as the Schrédinger equa- 
tion, that specifies a relationship between a function and its deriva- 
tives is known as a differential equation. 


The detailed study of the solution of the Schrédinger equation 
is beyond the scope of this book, but we can gain some important 
insights by considering the easiest version of the Schrodinger equa- 
tion, in which the interaction energy U is constant. We can then 
rearrange the Schrddinger equation as follows: 


uv %mM(U- E), 
da? te ; 


which boils down to 


ew 


wo 7 


where, according to our assumptions, a is independent of x. We need 
to find a function whose second derivative is the same as the original 
function except for a multiplicative constant. The only functions 
with this property are sine waves and exponentials: 


2 


Pe) [qsin(rx +s) ] = —qr?sin(rz + s) 
x 
d2 
aaa [ge"*t*] = greets 
x 
The sine wave gives negative values of a, a = —r?, and the 


exponential gives positive ones, a = r?. The former applies to the 
classically allowed region with U < E. 


This leads us to a quantitative calculation of the tunneling ef- 
fect discussed briefly in the preceding subsection. The wavefunction 
evidently tails off exponentially in the classically forbidden region. 
Suppose, as shown in figure 1, a wave-particle traveling to the right 
encounters a barrier that it is classically forbidden to enter. Al- 
though the form of the Schrédinger equation we’re using technically 
does not apply to traveling waves (because it makes no reference 
to time), it turns out that we can still use it to make a reasonable 
calculation of the probability that the particle will make it through 
the barrier. If we let the barrier’s width be w, then the ratio of the 
wavefunction on the left side of the barrier to the wavefunction on 
the right is 

ger _ ore 
ge" (etw)+s 


Probabilities are proportional to the squares of wavefunctions, so 
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the probability of making it through the barrier is 


self-check G 


If we were to apply this equation to find the probability that a person can 


walk through a wall, what would the small value of Planck’s constant 
imply? > Answer, p. 874 
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and total interaction energies for 
an alpha particle escaping from a 
nucleus. 


Tunneling in aloha decay example 17 
Naively, we would expect alpha decay to be a very fast process. 
The typical speeds of neutrons and protons inside a nucleus are 
extremely high (see problem 20). If we imagine an alpha particle 
coalescing out of neutrons and protons inside the nucleus, then 
at the typical speeds we're talking about, it takes a ridiculously 
small amount of time for them to reach the surface and try to 
escape. Clattering back and forth inside the nucleus, we could 
imagine them making a vast number of these “escape attempts” 
every second. 


Consider figure m, however, which shows the interaction energy 
for an alpha particle escaping from a nucleus. The electrical en- 
ergy is kq;q2/r when the alpha is outside the nucleus, while its 
variation inside the nucleus is proportional to r, as a consequence 
of the shell theorem. The nuclear energy is constant when the al- 
pha is inside the nucleus, because the forces from all the neigh- 
boring neutrons and protons cancel out; it rises sharply near the 
surface, and flattens out to zero over a distance of ~ 1 fm, which 
is the maximum distance scale at which the strong force can op- 
erate. There is a classically forbidden region immediately outside 
the nucleus, so the alpha particle can only escape by quantum 
mechanical tunneling. (It’s true, but somewhat counterintuitive, 
that a repulsive electrical force can make it more difficult for the 
alpha to get out.) 


In reality, alpha-decay half-lives are often extremely long — some- 
times billions of years — because the tunneling probability is so 
small. Although the shape of the barrier is not a rectangle, the 
equation for the tunneling probability on page 819 can still be 
used as a rough guide to our thinking. Essentially the tunneling 
probability is so small because U — E is fairly big, typically about 
30 MeV at the peak of the barrier. 


Three dimensions 


For simplicity, we’ve been considering the Schrédinger equation 
in one dimension, so that W is only a function of x, and has units of 
m~!/2 rather than m~?/2. Since the Schrédinger equation is a state- 
ment of conservation of energy, and energy is a scalar, the general- 
ization to three dimensions isn’t particularly complicated. The total 
energy term E-W and the interaction energy term U-W involve noth- 
ing but scalars, and don’t need to be changed at all. In the kinetic 
energy term, however, we’re essentially basing our computation of 
the kinetic energy on the squared magnitude of the momentum, p?, 
and in three dimensions this would clearly have to be generalized to 
pet Dp, +p2. The obvious way to achieve this is to replace the second 
derivative d?U/da? with the sum 0?U/dx? + 0?U/dy? + 6?U/d2?. 
Here the partial derivative symbol 0, introduced on page 214, indi- 
cates that when differentiating with respect to a particular variable, 
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the other variables are to be considered as constants. This operation 
on the function V is notated V?W, and the derivative-like operator 
V2 = 07/0x? + 0?/dy? + 6?/A2z is called the Laplacian. It occurs 
elswehere in physics. For example, in classical electrostatics, the 
voltage in a region of vacuum must be a solution of the equation 
V?V =0. Like the second derivative, the Laplacian is essentially a 
measure of curvature. 


Examples of the Laplacian in two dimensions example 18 
> Compute the Laplacians of the following functions in two dimen- 


sions, and interpret them: A = x°+y?, B= —x?—y?, C= x*-y?. 


> The first derivative of function A with respect to x is 0A/Ox = 
2x. Since y is treated as a constant in the computation of the 
partial derivative 0/0x, the second term goes away. The second 
derivative of A with respect to x is 0°A/Ox? = 2. Similarly we have 
0°A/dy* = 2,s0 V?A=4. 


All derivative operators, including V?, have the linear property 
that multiplying the input function by a constant just multiplies the 
output function by the same constant. Since B = —A, and we 
have V2B = —4. 


For function C, the x term contributes a second derivative of 2, 
but the y term contributes —2, so V2C = 0. 


The interpretation of the positive sign in V7A = 4 is that A’s graph 
is shaped like a trophy cup, and the cup is concave up. The neg- 
ative sign in the result for V2B is because B is concave down. 
Function C is shaped like a saddle. Since its curvature along 
one axis is concave up, but the curvature along the other is down 
and equal in magnitude, the function is considered to have zero 
concavity over all. 


A classically allowed region with constant U example 19 
In a classically allowed region with constant U, we expect the 
solutions to the Schrédinger equation to be sine waves. A sine 
wave in three dimensions has the form 


W = sin (kyx + kyy + kz) 


When we compute 0?'/0x?, double differentiation of sin gives 
—sin, and the chain rule brings out a factor of k2. Applying all 
three second derivative operators, we get 


Vew = (-K igs K2) sin (kx + kyy + kzz) 
= (Ke + ke + ke) ¥ 
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The Schrddinger equation gives 


he 
E-Ws-—_V*v4U-y 
2m 
2 
=f (KR + K+ KE) We UY 
h2 
E-U=5— (ke +ky +k?) 


which can be satisfied since we’re in a classically allowed region 
with E — U > 0, and the right-hand side is manifestly positive. 


Use of complex numbers 


In a classically forbidden region, a particle’s total energy, U + 
K, is less than its U, so its K must be negative. If we want to 
keep believing in the equation K = p?/2m, then apparently the 
momentum of the particle is the square root of a negative number. 
This is a symptom of the fact that the Schrédinger equation fails 
to describe all of nature unless the wavefunction and various other 
quantities are allowed to be complex numbers. In particular it is not 
possible to describe traveling waves correctly without using complex 
wavefunctions. 


This may seem like nonsense, since real numbers are the only 
ones that are, well, real! Quantum mechanics can always be re- 
lated to the real world, however, because its structure is such that 
the results of measurements always come out to be real numbers. 
For example, we may describe an electron as having non-real mo- 
mentum in classically forbidden regions, but its average momentum 
will always come out to be real (the imaginary parts average out to 
zero), and it can never transfer a non-real quantity of momentum 
to another particle. 


A complete investigation of these issues is beyond the scope of 
this book, and this is why we have normally limited ourselves to 
standing waves, which can be described with real-valued wavefunc- 
tions. 
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13.4 The Atom 


You can learn a lot by taking a car engine apart, but you will have 
learned a lot more if you can put it all back together again and make 
it run. Half the job of reductionism is to break nature down into 
its smallest parts and understand the rules those parts obey. The 
second half is to show how those parts go together, and that is our 
goal in this chapter. We have seen how certain features of all atoms 
can be explained on a generic basis in terms of the properties of 
bound states, but this kind of argument clearly cannot tell us any 
details of the behavior of an atom or explain why one atom acts 
differently from another. 


The biggest embarrassment for reductionists is that the job of 
putting things back together job is usually much harder than the 
taking them apart. Seventy years after the fundamentals of atomic 
physics were solved, it is only beginning to be possible to calcu- 
late accurately the properties of atoms that have many electrons. 
Systems consisting of many atoms are even harder. Supercomputer 
manufacturers point to the folding of large protein molecules as a 
process whose calculation is just barely feasible with their fastest 
machines. The goal of this chapter is to give a gentle and visually 
oriented guide to some of the simpler results about atoms. 


13.4.1 Classifying states 


We'll focus our attention first on the simplest atom, hydrogen, 
with one proton and one electron. We know in advance a little of 
what we should expect for the structure of this atom. Since the 
electron is bound to the proton by electrical forces, it should dis- 
play a set of discrete energy states, each corresponding to a certain 
standing wave pattern. We need to understand what states there 
are and what their properties are. 
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a/ Eight wavelengths fit around 
this circle (¢ = 8). 


What properties should we use to classify the states? The most 
sensible approach is to used conserved quantities. Energy is one 
conserved quantity, and we already know to expect each state to 
have a specific energy. It turns out, however, that energy alone is 
not sufficient. Different standing wave patterns of the atom can 
have the same energy. 


Momentum is also a conserved quantity, but it is not particularly 
appropriate for classifying the states of the electron in a hydrogen 
atom. The reason is that the force between the electron and the pro- 
ton results in the continual exchange of momentum between them. 
(Why wasn’t this a problem for energy as well? Kinetic energy and 
momentum are related by K = p?/2m, so the much more massive 
proton never has very much kinetic energy. We are making an ap- 
proximation by assuming all the kinetic energy is in the electron, 
but it is quite a good approximation.) 


Angular momentum does help with classification. There is no 
transfer of angular momentum between the proton and the electron, 
since the force between them is a center-to-center force, producing 
no torque. 


Like energy, angular momentum is quantized in quantum physics. 
As an example, consider a quantum wave-particle confined to a cir- 
cle, like a wave in a circular moat surrounding a castle. A sine 
wave in such a “quantum moat” cannot have any old wavelength, 
because an integer number of wavelengths must fit around the cir- 
cumference, C’, of the moat. The larger this integer is, the shorter 
the wavelength, and a shorter wavelength relates to greater momen- 
tum and angular momentum. Since this integer is related to angular 
momentum, we use the symbol for it: 


A= GH 
The angular momentum is 
L=rp 


Here, r = C/2z7, and p= h/A = he/C, so 


_c he 
ow COC 
h 
ont 


In the example of the quantum moat, angular momentum is quan- 
tized in units of h/27. This makes h/27 a pretty important number, 
so we define the abbreviation h = h/27. This symbol is read “h- 
bar.” 


In fact, this is a completely general fact in quantum physics, not 
just a fact about the quantum moat: 
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Quantization of angular momentum 
The angular momentum of a particle due to its motion through 
space is quantized in units of h. 


self-check H 


What is the angular momentum of the wavefunction shown at the be- 
ginning of the section? > Answer, p. 
874 


13.4.2 Angular momentum in three dimensions 


Up until now we’ve only worked with angular momentum in the 
context of rotation in a plane, for which we could simply use pos- 
itive and negative signs to indicate clockwise and counterclockwise 
directions of rotation. A hydrogen atom, however, is unavoidably 
three-dimensional. Before we worry about three-dimensional angu- 
lar momentum in quantum mechanics, we need to understand how it 
works in classical mechanics; the following discussion is a summary 
of the most important facts from optional section 4.3. 


Three-dimensional angular momentum in classical physics 


If we are to completely specify the angular momentum of a clas- 
sical object like a top, b, in three dimensions, it’s not enough to 
say whether the rotation is clockwise or counterclockwise. We must 
also give the orientation of the plane of rotation or, equivalently, the 
direction of the top’s axis. The convention is to specify the direction 
of the axis. There are two possible directions along the axis, and 
as a matter of convention we use the direction such that if we sight 
along it, the rotation appears clockwise. 


Angular momentum can, in fact, be defined as a vector pointing 
along this direction. This might seem like a strange definition, since 
nothing actually moves in that direction, but it wouldn’t make sense 
to define the angular momentum vector as being in the direction of 
motion, because every part of the top has a different direction of 
motion. Ultimately it’s not just a matter of picking a definition 
that is convenient and unambiguous: the definition we’re using is 
the only one that makes the total angular momentum of a system a 
conserved quantity if we let “total” mean the vector sum. 


As with rotation in one dimension, we cannot define what we 
mean by angular momentum in a particular situation unless we pick 
a point as an axis. This is really a different use of the word “axis” 
than the one in the previous paragraphs. Here we simply mean a 
point from which we measure the distance r. In the hydrogen atom, 
the nearly immobile proton provides a natural choice of axis. 


a) 


b/The angular momentum 


vector of a spinning top. 
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c/ Reconciling the uncertainty 
principle with the definition of 
angular momentum. 


Three-dimensional angular momentum in quantum physics 


Once we start to think more carefully about the role of angular 
momentum in quantum physics, it may seem that there is a basic 
problem: the angular momentum of the electron in a hydrogen atom 
depends on both its distance from the proton and its momentum, 
so in order to know its angular momentum precisely it would seem 
we would need to know both its position and its momentum simul- 
taneously with good accuracy. This, however, might seem to be 
forbidden by the Heisenberg uncertainty principle. 


Actually the uncertainty principle does place limits on what can 
be known about a particle’s angular momentum vector, but it does 
not prevent us from knowing its magnitude as an exact integer mul- 
tiple of h. The reason is that in three dimensions, there are really 
three separate uncertainty principles: 


Ap AG ZR 
ApyAy Zz h 
Ap, Az 2h 


Now consider a particle, c/1, that is moving along the x axis at 
position x and with momentum p,. We may not be able to know 
both x and p, with unlimited accurately, but we can still know the 
particle’s angular momentum about the origin exactly: it is zero, 
because the particle is moving directly away from the origin. 


Suppose, on the other hand, a particle finds itself, c/2, at a 
position x along the x axis, and it is moving parallel to the y axis 
with momentum p,. It has angular momentum xp, about the z 
axis, and again we can know its angular momentum with unlimited 
accuracy, because the uncertainty principle on relates x to py and y 
to py. It does not relate x to py. 


As shown by these examples, the uncertainty principle does 
not restrict the accuracy of our knowledge of angular momenta as 
severely as might be imagined. However, it does prevent us from 
knowing all three components of an angular momentum vector si- 
multaneously. The most general statement about this is the follow- 
ing theorem, which we present without proof: 


The angular momentum vector in quantum physics 

The most the can be known about an angular momentum vector 
is its magnitude and one of its three vector components. Both are 
quantized in units of h. 


13.4.3 The hydrogen atom 


Deriving the wavefunctions of the states of the hydrogen atom 
from first principles would be mathematically too complex for this 
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d/A cross-section of a hydrogen wavefunction. 


book, but it’s not hard to understand the logic behind such a wave- 
function in visual terms. Consider the wavefunction from the begin- 
ning of the section, which is reproduced in figure d. Although the 
graph looks three-dimensional, it is really only a representation of 
the part of the wavefunction lying within a two-dimensional plane. 
The third (up-down) dimension of the plot represents the value of 
the wavefunction at a given point, not the third dimension of space. 
The plane chosen for the graph is the one perpendicular to the an- 
gular momentum vector. 


Each ring of peaks and valleys has eight wavelengths going around 
in a circle, so this state has L = 8h, i.e., we label it 2 = 8. The wave- 
length is shorter near the center, and this makes sense because when 
the electron is close to the nucleus it has a lower electrical energy, 
a higher kinetic energy, and a higher momentum. 


Between each ring of peaks in this wavefunction is a nodal cir- 
cle, i.e., a circle on which the wavefunction is zero. The full three- 
dimensional wavefunction has nodal spheres: a series of nested spher- 
ical surfaces on which it is zero. The number of radii at which nodes 
occur, including r = oo, is called n, and n turns out to be closely 
related to energy. The ground state has n = 1 (a single node only 
at r = oo), and higher-energy states have higher n values. There 
is a simple equation relating n to energy, which we will discuss in 
subsection 13.4.4. 


The numbers n and £, which identify the state, are called its 
quantum numbers. A state of a given n and ¢ can be oriented 
in a variety of directions in space. We might try to indicate the 
orientation using the three quantum numbers ¢; = Ly/h, ly = Ly/h, 
and ¢, = L,/h. But we have already seen that it is impossible to 
know all three of these simultaneously. To give the most complete 


unbound states 


n=3 
n=2 
> 
a 
_ 
o 
c 
o 
n=1 


e/The energy of a state in 
the hydrogen atom depends only 


on its n quantum number. 
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possible description of a state, we choose an arbitrary axis, say the 
z axis, and label the state according to n, ¢, and @,.° 


Angular momentum requires motion, and motion implies kinetic 
energy. Thus it is not possible to have a given amount of angular 
momentum without having a certain amount of kinetic energy as 
well. Since energy relates to the n quantum number, this means 
that for a given n value there will be a maximum possible . It turns 
out that this maximum value of equals n — 1. 


In general, we can list the possible combinations of quantum 
numbers as follows: 


n can equal 1, 2, 3,... 
é can range from 0 to n — 1, in steps of 1 
é, can range from —f to @, in steps of 1 


Applying these rules, we have the following list of states: 


n=1, @€=0, ¢=0 one state 
n=2, €=0, ¢,=0 one state 
n=2, €=1, ¢,=-1,0, or 1 | three states 


self-check | 
Continue the list for n = 3. > Answer, p. 874 


Figure f on page 829 shows the lowest-energy states of the hy- 
drogen atom. The left-hand column of graphs displays the wave- 
functions in the x — y plane, and the right-hand column shows the 
probability distribution in a three-dimensional representation. 


Discussion Questions 


A The quantum number n is defined as the number of radii at which 
the wavefunction is zero, including r = oo. Relate this to the features of 
the figures on the facing page. 


B Based on the definition of n, why can’t there be any such thing as 
an n= 0 state? 


C Relate the features of the wavefunction plots in figure f to the corre- 
sponding features of the probability distribution pictures. 


D How can you tell from the wavefunction plots in figure f which ones 
have which angular momenta? 


E Criticize the following incorrect statement: “The @ = 8 wavefunction 
in figure d has a shorter wavelength in the center because in the center 
the electron is in a higher energy level.” 


F Discuss the implications of the fact that the probability cloud in of the 
n= 2, £=1 state is split into two parts. 


°See page 836 for a note about the two different systems of notations that 
are used for quantum numbers. 
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f/The three states of the hydrogen atom having the lowest ener- 
gies. 


13.4.4 Energies of states in hydrogen 
History 


The experimental technique for measuring the energy levels of 
an atom accurately is spectroscopy: the study of the spectrum of 
light emitted (or absorbed) by the atom. Only photons with cer- 
tain energies can be emitted or absorbed by a hydrogen atom, for 
example, since the amount of energy gained or lost by the atom 
must equal the difference in energy between the atom’s initial and 
final states. Spectroscopy had actually become a highly developed 
art several decades before Einstein even proposed the photon, and 
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the Swiss spectroscopist Johann Balmer determined in 1885 that 
there was a simple equation that gave all the wavelengths emitted 
by hydrogen. In modern terms, we think of the photon wavelengths 
merely as indirect evidence about the underlying energy levels of 
the atom, and we rework Balmer’s result into an equation for these 
atomic energy levels: 


22x 10718 J 


En = 5) ’ 


n 
This energy includes both the kinetic energy of the electron and 
the electrical energy. The zero-level of the electrical energy scale 
is chosen to be the energy of an electron and a proton that are 
infinitely far apart. With this choice, negative energies correspond 
to bound states and positive energies to unbound ones. 


Where does the mysterious numerical factor of 2.2x 107-!8 J come 
from? In 1913 the Danish theorist Niels Bohr realized that it was 
exactly numerically equal to a certain combination of fundamental 
physical constants: 


mk2e* 1 


2 2 , 


En = - 


where m is the mass of the electron, and k is the Coulomb force 
constant for electric forces. 


Bohr was able to cook up a derivation of this equation based on 
the incomplete version of quantum physics that had been developed 
by that time, but his derivation is today mainly of historical interest. 
It assumes that the electron follows a circular path, whereas the 
whole concept of a path for a particle is considered meaningless in 
our more complete modern version of quantum physics. Although 
Bohr was able to produce the right equation for the energy levels, 
his model also gave various wrong results, such as predicting that 
the atom would be flat, and that the ground state would have ¢ = 1 
rather than the correct @ = 0. 


Approximate treatment 


Rather than leaping straight into a full mathematical treatment, 
we’ll start by looking for some physical insight, which will lead to 
an approximate argument that correctly reproduces the form of the 
Bohr equation. 


A typical standing-wave pattern for the electron consists of a 
central oscillating area surrounded by a region in which the wave- 
function tails off. As discussed in subsection 13.3.6, the oscillating 
type of pattern is typically encountered in the classically allowed 
region, while the tailing off occurs in the classically forbidden re- 
gion where the electron has insufficient kinetic energy to penetrate 
according to classical physics. We use the symbol r for the radius 
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of the spherical boundary between the classically allowed and clas- 
sically forbidden regions. Classically, r would be the distance from 
the proton at which the electron would have to stop, turn around, 
and head back in. 


If r had the same value for every standing-wave pattern, then 
we'd essentially be solving the particle-in-a-box problem in three 
dimensions, with the box being a spherical cavity. Consider the 
energy levels of the particle in a box compared to those of the hy- 
drogen atom, g. They’re qualitatively different. The energy levels 
of the particle in a box get farther and farther apart as we go higher 
in energy, and this feature doesn’t even depend on the details of 
whether the box is two-dimensional or three-dimensional, or its ex- 
act shape. The reason for the spreading is that the box is taken to 
be completely impenetrable, so its size, r, is fixed. A wave pattern 
with n humps has a wavelength proportional to r/n, and therefore a 
momentum proportional to n, and an energy proportional to n?. In 
the hydrogen atom, however, the force keeping the electron bound 
isn’t an infinite force encountered when it bounces off of a wall, it’s 
the attractive electrical force from the nucleus. If we put more en- 
ergy into the electron, it’s like throwing a ball upward with a higher 
energy — it will get farther out before coming back down. This 
means that in the hydrogen atom, we expect r to increase as we go 
to states of higher energy. This tends to keep the wavelengths of 
the high energy states from getting too short, reducing their kinetic 
energy. The closer and closer crowding of the energy levels in hydro- 
gen also makes sense because we know that there is a certain energy 
that would be enough to make the electron escape completely, and 
therefore the sequence of bound states cannot extend above that 
energy. 


When the electron is at the maximum classically allowed distance 
r from the proton, it has zero kinetic energy. Thus when the electron 
is at distance r, its energy is purely electrical: 


ke? 


(1) B=-— 


Now comes the approximation. In reality, the electron’s wavelength 
cannot be constant in the classically allowed region, but we pretend 
that it is. Since n is the number of nodes in the wavefunction, we 
can interpret it approximately as the number of wavelengths that 
fit across the diameter 2r. We are not even attempting a derivation 
that would produce all the correct numerical factors like 2 and 7 
and so on, so we simply make the approximation 


r 
[2] An 

n 
Finally we assume that the typical kinetic energy of the electron is 
on the same order of magnitude as the absolute value of its total 


particle in a box hydrogen atom 


g/The energy levels of a 
particle in a box, contrasted with 


those of the hydrogen atom. 
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energy. (This is true to within a factor of two for a typical classical 
system like a planet in a circular orbit around the sun.) We then 
have 


[3] absolute value of total energy 
_ ke? 
of 
~K 


= p?/2m 
= (h/d)?/2m 


h?n? /2mr? 


We now solve the equation ke?/r ~ h?n?/2mr? for r and throw 
away numerical factors we can’t hope to have gotten right, yielding 


h2 n2 


mke? 


[4] 

Plugging n = 1 into this equation gives r = 2 nm, which is indeed 
on the right order of magnitude. Finally we combine equations [4] 
and [1] to find 


which is correct except for the numerical factors we never aimed to 


find. 


Exact treatment of the ground state 


The general proof of the Bohr equation for all values of n is 
beyond the mathematical scope of this book, but it’s fairly straight- 
forward to verify it for a particular n, especially given a lucky guess 
as to what functional form to try for the wavefunction. The form 
that works for the ground state is 


U =he"/2 


where r = \/x? + y? +22 is the electron’s distance from the pro- 
ton, and h provides for normalization. In the following, the result 
Or/Ox = x/r comes in handy. Computing the partial derivatives 
that occur in the Laplacian, we obtain for the x term 


av _ dwar 
Ox Or Ox 
ey 
ar 
Orw 1 ef01 x\2 
Ox? ae” a (=5) e+ (2) . 
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sO 
a+ ul 
V-v = (-2+5) U 


The Schrédinger equation gives 


h2 
E.V=-—V?U4U- 
2m 
2 2 
_fh 2 1 v ke" ig 
2m \ar a? r 


If we require this equation to hold for all r, then we must have 
equality for both the terms of the form (constant) x V and for those 
of the form (constant/r) x UV. That means 


h2 
E=-— 
2ma2 
and 
_ h ke? 
mar r 


and 


mk?e4 
Qh? , 


where the result for the energy agrees with the Bohr equation for 
n= 1. The calculation of the normalization constant h is relegated 
to homework problem 36. 


self-check J 

We've verified that the function Y = he-'/2 is a solution to the Schrédinger 
equation, and yet it has a kink in it. What’s going on here? Didn’t | argue 
before that kinks are unphysical? > Answer, p. 874 


Wave phases in the hydrogen molecule example 20 
In example 15 on page 809, | argued that the existence of the 
Hs molecule could essentially be explained by a particle-in-a-box 
argument: the molecule is a bigger box than an individual atom, 
so each electron’s wavelength can be longer, its kinetic energy 
lower. Now that we’re in possession of a mathematical expression 
for the wavefunction of the hydrogen atom in its ground state, we 
can make this argument a little more rigorous and detailed. Sup- 
pose that two hydrogen atoms are in a relatively cool sample of 
monoatomic hydrogen gas. Because the gas is cool, we can as- 
sume that the atoms are in their ground states. Now suppose that 


Section 13.4 The Atom 


833 


the two atoms approach one another. Making use again of the as- 
sumption that the gas is cool, it is reasonable to imagine that the 
atoms approach one another slowly. Now the atoms come a lit- 
tle closer, but still far enough apart that the region between them 
is classically forbidden. Each electron can tunnel through this 
classically forbidden region, but the tunneling probability is small. 
Each one is now found with, say, 99% probability in its original 
home, but with 1% probability in the other nucleus. Each elec- 
tron is now in a state consisting of a superposition of the ground 
state of its own atom with the ground state of the other atom. 
There are two peaks in the superposed wavefunction, but one is 
a much bigger peak than the other. 


An interesting question now arises. What are the relative phases 
of the two electrons? As discussed on page 803, the absolute 
phase of an electron’s wavefunction is not really a meaningful 
concept. Suppose atom A contains electron Alice, and B elec- 
tron Bob. Just before the collision, Alice may have wondered, “Is 
my phase positive right now, or is it negative? But of course | 
shouldn’t ask myself such silly questions,” she adds sheepishly. 


h/ Example 20. 


Limokec! 
image 
rot fourci 


Limkec 
image 
rot Fourci 


But relative phases are well defined. As the two atoms draw 
closer and closer together, the tunneling probability rises, and 
eventually gets so high that each electron is spending essentially 
50% of its time in each atom. It’s now reasonable to imagine that 
either one of two possibilities could obtain. Alice’s wavefunction 
could either look like h/1, with the two peaks in phase with one 
another, or it could look like h/2, with opposite phases. Because 
relative phases of wavefunctions are well defined, states 1 and 2 
are physically distinguishable. In particular, the kinetic energy of 
state 2 is much higher; roughly speaking, it is like the two-hump 
wave pattern of the particle in a box, as opposed to 1, which looks 
roughly like the one-hump pattern with a much longer wavelength. 
Not only that, but an electron in state 1 has a large probability of 
being found in the central region, where it has a large negative 
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electrical energy due to its interaction with both protons. State 2, 
on the other hand, has a low probability of existing in that region. 
Thus state 1 represents the true ground-state wavefunction of the 
H2 molecule, and putting both Alice and Bob in that state results 
in a lower energy than their total energy when separated, so the 
molecule is bound, and will not fly apart spontaneously. 


State h/3, on the other hand, is not physically distinguishable from 
h/2, nor is h/4 from h/1. Alice may say to Bob, “Isn’t it wonderful 
that we’re in state 1 or 4? | love being stable like this.” But she 
knows it’s not meaningful to ask herself at a given moment which 
state she’s in, 1 or 4. 


Discussion Questions 


A __ States of hydrogen with n greater than about 10 are never observed 
in the sun. Why might this be? 


B Sketch graphs of r and E versus n for the hydrogen, and compare 
with analogous graphs for the one-dimensional particle in a box. 


13.4.5 Electron spin 


It’s disconcerting to the novice ping-pong player to encounter 
for the first time a more skilled player who can put spin on the ball. 
Even though you can’t see that the ball is spinning, you can tell 
something is going on by the way it interacts with other objects in 
its environment. In the same way, we can tell from the way electrons 
interact with other things that they have an intrinsic spin of their 
own. Experiments show that even when an electron is not moving 
through space, it still has angular momentum amounting to h/2. 


This may seem paradoxical because the quantum moat, for in- 
stance, gave only angular momenta that were integer multiples of 
h, not half-units, and I claimed that angular momentum was al- 
ways quantized in units of f, not just in the case of the quantum 
moat. That whole discussion, however, assumed that the angular 
momentum would come from the motion of a particle through space. 
The fA/2 angular momentum of the electron is simply a property of 
the particle, like its charge or its mass. It has nothing to do with 
whether the electron is moving or not, and it does not come from any 
internal motion within the electron. Nobody has ever succeeded in 
finding any internal structure inside the electron, and even if there 
was internal structure, it would be mathematically impossible for it 
to result in a half-unit of angular momentum. 


We simply have to accept this i/2 angular momentum, called 
the “spin” of the electron — Mother Nature rubs our noses in it as 
an observed fact. | Protons and neutrons have the same /i/2 spin, 
while photons have an intrinsic spin of h. In general, half-integer 
spins are typical of material particles. Integral values are found for 
the particles that carry forces: photons, which embody the electric 
and magnetic fields of force, as well as the more exotic messengers 


Se spin, s 


orbital angular 
momentum, 


i/The top has angular mo- 
mentum both because of the 
motion of its center of mass 
through space and due to its 
internal rotation. Electron spin is 
roughly analogous to the intrinsic 
spin of the top. 


Section 13.4 The Atom 835 


of the nuclear and gravitational forces. 


As was the case with ordinary angular momentum, we can de- 
scribe spin angular momentum in terms of its magnitude, and its 
component along a given axis. We write s and s, for these quantities, 
expressed in units of h, so an electron has s = 1/2 and s, = +1/2 
or —1/2. 


Taking electron spin into account, we need a total of four quan- 
tum numbers to label a state of an electron in the hydrogen atom: 
n, €, €,, and s,. (We omit s because it always has the same value.) 
The symbols and include only the angular momentum the electron 
has because it is moving through space, not its spin angular mo- 
mentum. The availability of two possible spin states of the electron 
leads to a doubling of the numbers of states: 


f=). £=0,> = 0, 8, = +1/2 or —1/2 | two states 
na2, £0, t= 0, s, = +1/2 or —1/2 | two states 
n=2, €=1, ¢,=-1,0,orl, s,=+1/2 or —1/2 | six states 


A note about notation 


There are unfortunately two inconsistent systems of notation 
for the quantum numbers we’ve been discussing. The notation I’ve 
been using is the one that is used in nuclear physics, but there is a 
different one that is used in atomic physics. 


nuclear physics | atomic physics 
same 
£ same 
Le no notation 
fy no notation 
ty m 
gly? no notation (sometimes o) 
Se no notation 
Sy no notation 
Sz 8 


The nuclear physics notation is more logical (not giving special sta- 
tus to the z axis) and more memorable (¢, rather than the obscure 
m), which is why I use it consistently in this book, even though 
nearly all the applications we’ll consider are atomic ones. 


We are further encumbered with the following historically de- 
rived letter labels, which deserve to be eliminated in favor of the 
simpler numerical ones: 


£=0 Leh" L=2: £3 
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The spdf labels are used in both nuclear® and atomic physics, while 
the KLMNOPQ letters are used only to refer to states of electrons. 


And finally, there is a piece of notation that is good and useful, 
but which I simply haven’t mentioned yet. The vector j = €+s 
stands for the total angular momentum of a particle in units of h, 
including both orbital and spin parts. This quantum number turns 
out to be very useful in nuclear physics, because nuclear forces tend 
to exchange orbital and spin angular momentum, so a given energy 
level often contains a mixture of @ and s values, while remaining 
fairly pure in terms of 7. 


13.4.6 Atoms with more than one electron 


What about other atoms besides hydrogen? It would seem that 
things would get much more complex with the addition of a second 
electron. A hydrogen atom only has one particle that moves around 
much, since the nucleus is so heavy and nearly immobile. Helium, 
with two, would be a mess. Instead of a wavefunction whose square 
tells us the probability of finding a single electron at any given lo- 
cation in space, a helium atom would need to have a wavefunction 
whose square would tell us the probability of finding two electrons 
at any given combination of points. Ouch! In addition, we would 
have the extra complication of the electrical interaction between the 
two electrons, rather than being able to imagine everything in terms 
of an electron moving in a static field of force created by the nucleus 
alone. 


Despite all this, it turns out that we can get a surprisingly good 
description of many-electron atoms simply by assuming the elec- 
trons can occupy the same standing-wave patterns that exist in a 
hydrogen atom. The ground state of helium, for example, would 
have both electrons in states that are very similar to the n = 1 
states of hydrogen. The second-lowest-energy state of helium would 
have one electron in an n = 1 state, and the other in an n = 2 states. 
The relatively complex spectra of elements heavier than hydrogen 
can be understood as arising from the great number of possible com- 
binations of states for the electrons. 


A surprising thing happens, however, with lithium, the three- 
electron atom. We would expect the ground state of this atom to 
be one in which all three electrons settle down into n = 1 states. 
What really happens is that two electrons go into n = 1 states, but 
the third stays up in an n = 2 state. This is a consequence of a new 
principle of physics: 


The Pauli Exclusion Principle 
Only one electron can ever occupy a given state. 


° After f, the series continues in alphabetical order. In nuclei that are spinning 
rapidly enough that they are almost breaking apart, individual protons and 
neutrons can be stirred up to @ values as high as 7, which is j. 
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Na 


j/The beginning of the peri- 
odic table. 


k/Hydrogen is highly — reac- 
tive. 


There are two n = 1 states, one with s, = +1/2 and one with 
s, = —1/2, but there is no third n = 1 state for lithium’s third 
electron to occupy, so it is forced to go into an n = 2 state. 


It can be proved mathematically that the Pauli exclusion prin- 
ciple applies to any type of particle that has half-integer spin. Thus 
two neutrons can never occupy the same state, and likewise for two 
protons. Photons, however, are immune to the exclusion principle 
because their spin is an integer. 


Deriving the periodic table 


We can now account for the structure of the periodic table, which 
seemed so mysterious even to its inventor Mendeleev. The first row 
consists of atoms with electrons only in the n = 1 states: 


H_ 1 electron in an n = 1 state 
He 2 electrons in the two n = 1 states 


The next row is built by filling the n = 2 energy levels: 


Li 2 electrons in n = 1 states, 1 electron in an n = 2 state 
Be 2 electrons in n = 1 states, 2 electrons in n = 2 states 


2 electrons in n = 1 states, 6 electrons in n = 2 states 
2 electrons in n = 1 states, 7 electrons in n = 2 states 
Ne 2 electrons in n = 1 states, 8 electrons in n = 2 states 


moO: 


In the third row we start in on the n = 3 levels: 


Na _ 2 electrons in n = 1 states, 8 electrons in n = 2 states, 1 
electron in an n = 3 state 


We can now see a logical link between the filling of the energy 
levels and the structure of the periodic table. Column 0, for exam- 
ple, consists of atoms with the right number of electrons to fill all 
the available states up to a certain value of n. Column I contains 
atoms like lithium that have just one electron more than that. 


This shows that the columns relate to the filling of energy levels, 
but why does that have anything to do with chemistry? Why, for 
example, are the elements in columns I and VII dangerously reac- 
tive? Consider, for example, the element sodium (Na), which is so 
reactive that it may burst into flames when exposed to air. The 
electron in the n = 3 state has an unusually high energy. If we let 
a sodium atom come in contact with an oxygen atom, energy can 
be released by transferring the n = 3 electron from the sodium to 
one of the vacant lower-energy n = 2 states in the oxygen. This 
energy is transformed into heat. Any atom in column I is highly 
reactive for the same reason: it can release energy by giving away 
the electron that has an unusually high energy. 


Column VII is spectacularly reactive for the opposite reason: 
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these atoms have a single vacancy in a low-energy state, so energy 
is released when these atoms steal an electron from another atom. 


It might seem as though these arguments would only explain 
reactions of atoms that are in different rows of the periodic table, 
because only in these reactions can a transferred electron move from 
a higher-n state to a lower-n state. This is incorrect. An n = 2 elec- 
tron in fluorine (F), for example, would have a different energy than 
an n = 2 electron in lithium (Li), due to the different number of 
protons and electrons with which it is interacting. Roughly speak- 
ing, the n = 2 electron in fluorine is more tightly bound (lower in 
energy) because of the larger number of protons attracting it. The 
effect of the increased number of attracting protons is only partly 
counteracted by the increase in the number of repelling electrons, 
because the forces exerted on an electron by the other electrons are 
in many different directions and cancel out partially. 
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Problems 
The symbols v, “, etc. are explained on page 848. 


1 If a radioactive substance has a half-life of one year, does this 
mean that it will be completely decayed after two years? Explain. 


~ 


2 What is the probability of rolling a pair of dice and getting 
“snake eyes,” i.e., both dice come up with ones? & 


3 Use a calculator to check the approximation that a? ~ 1+ 
blna, if b < 1, using some arbitrary numbers. See how good the 
approximation is for values of b that are not quite as small compared 
to one. B 


4 Make up an example of a numerical problem involving a rate 
of decay where At < t/2, but the exact expression for the rate 
of decay on page 782 can still be evaluated on a calculator with- 
out getting something that rounds off to zero. Check that you get 
approximately the same result using both methods from subsection 
13.1.4 to calculate the number of decays between ¢ and t+ At. Keep 
plenty of significant figures in your results, in order to show the dif- 
ference between them. gz 


5 Refer to the probability distribution for people’s heights in 
figure f on page 778. 

(a) Show that the graph is properly normalized. 

(b) Estimate the fraction of the population having heights between 
140 and 150 cm. Vv Oo 


6 (a) A nuclear physicist is studying a nuclear reaction caused in 
an accelerator experiment, with a beam of ions from the accelerator 
striking a thin metal foil and causing nuclear reactions when a nu- 
cleus from one of the beam ions happens to hit one of the nuclei in 
the target. After the experiment has been running for a few hours, 
a few billion radioactive atoms have been produced, embedded in 
the target. She does not know what nuclei are being produced, but 
she suspects they are an isotope of some heavy element such as Pb, 
Bi, Fr or U. Following one such experiment, she takes the target foil 
out of the accelerator, sticks it in front of a detector, measures the 
activity every 5 min, and makes a graph (figure). The isotopes she 
thinks may have been produced are: 


isotope _ half-life (minutes) 


2Ph —s-. 36.1 
24PH 26.8 
cal 19.7 
223 Fy 21.8 
aon, 23.5 


Which one is it? 
(b) Having decided that the original experimental conditions pro- 
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duced one specific isotope, she now tries using beams of ions travel- 
ing at several different speeds, which may cause different reactions. 
The following table gives the activity of the target 10, 20 and 30 min- 
utes after the end of the experiment, for three different ion speeds. 


activity (millions of decays/s) after... 
10min 20min 30 min 

first ion speed 1.933 0.832 0.382 

second ion speed 1.200 0.545 0.248 

third ion speed 6.544 1.296 0.248 


Since such a large number of decays is being counted, assume that 
the data are only inaccurate due to rounding off when writing down 
the table. Which are consistent with the production of a single 
isotope, and which imply that more than one isotope was being 
created? es 


7 Devise a method for testing experimentally the hypothesis 
that a gambler’s chance of winning at craps is independent of her 
previous record of wins and losses. a 


8 A blindfolded person fires a gun at a circular target of radius 
b, and is allowed to continue firing until a shot actually hits it. Any 
part of the target is equally likely to get hit. We measure the ran- 
dom distance r from the center of the circle to where the bullet went 
in. 

(a) Show that the probability distribution of r must be of the form 
D(r) = kr, where k is some constant. (Of course we have D(r) = 0 


for r > b.) 
(b) Determine k by requiring D to be properly normalized. v 
(c) Find the average value of r. Vv 


(d) Interpreting your result from part c, how does it compare with 
b/2? Does this make sense? Explain. = 


9 We are given some atoms of a certain radioactive isotope, with 
half-life t,/2. We pick one atom at random, and observe it for one 
half-life, starting at time zero. If it decays during that one-half- 
life period, we record the time t at which the decay occurred. If it 
doesn’t, we reset our clock to zero and keep trying until we get an 
atom that cooperates. The final result is a time 0 < t < ty/2, witha 
distribution that looks like the usual exponential decay curve, but 
with its tail chopped off. 

(a) Find the distribution D(t), with the proper normalization. V 


(b) Find the average value of t. Vv 


(c) Interpreting your result from part b, how does it compare with 
ty 2/2? Does this make sense? Explain. a 


Problems 
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10 The speed, v, of an atom in an ideal gas has a probability 
distribution of the form D(v) = bue~°”, where 0 < v < 00, c relates 
to the temperature, and } is determined by normalization. 

(a) Sketch the distribution. 

(b) Find 6 in terms of c. Vv 


(c) Find the average speed in terms of c, eliminating b. (Don’t 
try to do the indefinite integral, because it can’t be done in closed 
form. The relevant definite integral can be found in tables or done 
with computer software.) vo 


11 All helium on earth is from the decay of naturally occurring 
heavy radioactive elements such as uranium. Each alpha particle 
that is emitted ends up claiming two electrons, which makes it a 
helium atom. If the original 2°°U atom is in solid rock (as opposed 
to the earth’s molten regions), the He atoms are unable to diffuse 
out of the rock. This problem involves dating a rock using the 
known decay properties of uranium 238. Suppose a geologist finds 
a sample of hardened lava, melts it in a furnace, and finds that it 
contains 1230 mg of uranium and 2.3 mg of helium. ??°U decays by 
alpha emission, with a half-life of 4.5 x 10° years. The subsequent 
chain of alpha and electron (beta) decays involves much shorter half- 
lives, and terminates in the stable nucleus ?°°Pb. Almost all natural 
uranium is 2°°U, and the chemical composition of this rock indicates 
that there were no decay chains involved other than that of ?2°U. 
(a) How many alphas are emitted per decay chain? [Hint: Use 
conservation of mass.| 

(b) How many electrons are emitted per decay chain? [Hint: Use 
conservation of charge.] 

(c) How long has it been since the lava originally hardened? V 


12 When light is reflected from a mirror, perhaps only 80% of 
the energy comes back. One could try to explain this in two different 
ways: (1) 80% of the photons are reflected, or (2) all the photons are 
reflected, but each loses 20% of its energy. Based on your everyday 
knowledge about mirrors, how can you tell which interpretation is 
correct? [Based on a problem from PSSC Physics.| g 


13 Suppose we want to build an electronic light sensor using an 
apparatus like the one described in the section on the photoelectric 
effect. How would its ability to detect different parts of the spectrum 
depend on the type of metal used in the capacitor plates? a 
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14 The photoelectric effect can occur not just for metal cathodes 
but for any substance, including living tissue. Ionization of DNA 
molecules can cause cancer or birth defects. If the energy required 
to ionize DNA is on the same order of magnitude as the energy 
required to produce the photoelectric effect in a metal, which of the 
following types of electromagnetic waves might pose such a hazard? 
Explain. 


60 Hz waves from power lines 

100 MHz FM radio 

microwaves from a microwave oven 
visible light 

ultraviolet light 

x-rays 


~ 


15 (a) Rank-order the photons according to their wavelengths, 
frequencies, and energies. If two are equal, say so. Explain all your 
answers. 

(b) Photon 3 was emitted by a xenon atom going from its second- 
lowest-energy state to its lowest-energy state. Which of photons 1, 
2, and 4 are capable of exciting a xenon atom from its lowest-energy 
state to its second-lowest-energy state? Explain. E 


16 Which figure could be an electron speeding up as it moves 
to the right? Explain. le 


17 The beam of a 100-W overhead projector covers an area 
of 1 m x 1 m when it hits the screen 3 m away. Estimate the 
number of photons that are in flight at any given time. (Since this 
is only an estimate, we can ignore the fact that the beam is not 
parallel.) vo 


18 In the photoelectric effect, electrons are observed with virtu- 
ally no time delay (~ 10 ns), even when the light source is very weak. 
(A weak light source does however only produce a small number of 
ejected electrons.) The purpose of this problem is to show that the 
lack of a significant time delay contradicted the classical wave the- 
ory of light, so throughout this problem you should put yourself in 
the shoes of a classical physicist and pretend you don’t know about 
photons at all. At that time, it was thought that the electron might 
have a radius on the order of 10~'° m. (Recent experiments have 
shown that if the electron has any finite size at all, it is far smaller.) 
(a) Estimate the power that would be soaked up by a single electron 
in a beam of light with an intensity of 1 mW/m?. Vv 
(b) The energy, F,, required for the electron to escape through the 
surface of the cathode is on the order of 107!9 J. Find how long it 
would take the electron to absorb this amount of energy, and explain 
why your result constitutes strong evidence that there is something 
wrong with the classical theory. vio 


Problem 15. 
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19 = Ina television, suppose the electrons are accelerated from rest 
through a voltage difference of 104 V. What is their final wavelength? 
vo 


20 Use the Heisenberg uncertainty principle to estimate the 
minimum velocity of a proton or neutron in a ?°8Pb nucleus, which 
has a diameter of about 13 fm (1 fm=10~!° m). Assume that the 
speed is nonrelativistic, and then check at the end whether this 
assumption was warranted. vo 


21 Find the energy of a particle in a one-dimensional box of 
length DL, expressing your result in terms of L, the particle’s mass 
m, the number of peaks and valleys n in the wavefunction, and 
fundamental constants. vo 


22 A free electron that contributes to the current in an ohmic 
material typically has a speed of 10° m/s (much greater than the 
drift velocity). 


(a) Estimate its de Broglie wavelength, in nm. v 
(b) If a computer memory chip contains 10° electric circuits in a 1 
cm? area, estimate the linear size, in nm, of one such circuit. v 


(c) Based on your answers from parts a and b, does an electrical 
engineer designing such a chip need to worry about wave effects 
such as diffraction? 

(d) Estimate the maximum number of electric circuits that can fit on 
a 1 cm? computer chip before quantum-mechanical effects become 
important. a 


23 In classical mechanics, an interaction energy of the form 
Ui = ska? gives a harmonic oscillator: the particle moves back 
and forth at a frequency w = \/k/m. This form for U(x) is often 
a good approximation for an individual atom in a solid, which can 
vibrate around its equilibrium position at x = 0. (For simplicity, we 
restrict our treatment to one dimension, and we treat the atom as 
a single particle rather than as a nucleus surrounded by electrons). 
The atom, however, should be treated quantum-mechanically, not 
clasically. It will have a wave function. We expect this wave function 
to have one or more peaks in the classically allowed region, and we 
expect it to tail off in the classically forbidden regions to the right 
and left. Since the shape of U(x) is a parabola, not a series of flat 
steps as in figure 1 on page 818, the wavy part in the middle will not 
be a sine wave, and the tails will not be exponentials. 

(a) Show that there is a solution to the Schrédinger equation of the 
form 

Ue):= ene ; 


and relate b to k, m, and h. To do this, calculate the second deriva- 
tive, plug the result into the Schrodinger equation, and then find 
what value of b would make the equation valid for all values of x. 
This wavefunction turns out to be the ground state. Note that this 
wavefunction is not properly normalized — don’t worry about that. 
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(b) Sketch a graph showing what this wavefunction looks like. 

(c) Let’s interpret b. If you changed b, how would the wavefunc- 
tion look different? Demonstrate by sketching two graphs, one for 
a smaller value of b, and one for a larger value. 

(d) Making k greater means making the atom more tightly bound. 
Mathematically, what happens to the value of 6 in your result from 
part aif you make k& greater? Does this make sense physically when 
you compare with part c? vo 


24 (a) A distance scale is shown below the wavefunctions and 
probability densities illustrated in figure f on page 829. Compare this 
with the order-of-magnitude estimate derived in subsection 13.4.4 
for the radius r at which the wavefunction begins tailing off. Was 
the estimate on the right order of magnitude? 

(b) Although we normally say the moon orbits the earth, actually 
they both orbit around their common center of mass, which is below 
the earth’s surface but not at its center. The same is true of the 
hydrogen atom. Does the center of mass lie inside the proton, or 
outside it? 


= 


25 The figure shows eight of the possible ways in which an 
electron in a hydrogen atom could drop from a higher energy state to 
a state of lower energy, releasing the difference in energy as a photon. 
Of these eight transitions, only D, E, and F produce photons with 
wavelengths in the visible spectrum. 

(a) Which of the visible transitions would be closest to the violet 
end of the spectrum, and which would be closest to the red end? 
Explain. 

(b) In what part of the electromagnetic spectrum would the photons 
from transitions A, B, and C lie? What about G and H? Explain. 
(c) Is there an upper limit to the wavelengths that could be emitted 
by a hydrogen atom going from one bound state to another bound 
state? Is there a lower limit? Explain. E 


26 Find an equation for the wavelength of the photon emit- 
ted when the electron in a hydrogen atom makes a transition from 
energy level n to level ng. vo 


27 Estimate the angular momentum of a spinning basketball, 
in units of h. Explain how this result relates to the correspondence 
principle. g 


28 Assume that the kinetic energy of an electron the n = 1 
state of a hydrogen atom is on the same order of magnitude as the 
absolute value of its total energy, and estimate a typical speed at 
which it would be moving. (It cannot really have a single, definite 
speed, because its kinetic and interaction energy trade off at different 
distances from the proton, but this is just a rough estimate of a 
typical speed.) Based on this speed, were we justified in assuming 
that the electron could be described nonrelativistically? a 


S 
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29 Before the quantum theory, experimentalists noted that in 
many cases, they would find three lines in the spectrum of the same 
atom that satisfied the following mysterious rule: 1/A; = 1/A2 + 
1/A3. Explain why this would occur. Do not use reasoning that 
only works for hydrogen — such combinations occur in the spectra 
of all elements. [Hint: Restate the equation in terms of the energies 
of photons.| 5 


30 The wavefunction of the electron in the ground state of a 
hydrogen atom is 


v= gq t/2q-3/2e-7/a 


where r is the distance from the proton, and a = 5.3x 107'' misa 
constant that sets the size of the wave. 

(a) Calculate symbolically, without plugging in numbers, the prob- 
ability that at any moment, the electron is inside the proton. As- 
sume the proton is a sphere with a radius of b = 0.5 fm. [Hint: 
Does it matter if you plug in r = 0 or r = b in the equation for the 
wavefunction? | Vv 
(b) Calculate the probability numerically. Vv 
(c) Based on the equation for the wavefunction, is it valid to think 
of a hydrogen atom as having a finite size? Can a be interpreted 
as the size of the atom, beyond which there is nothing? Or is there 
any limit on how far the electron can be from the proton? g 


31 Use physical reasoning to explain how the equation for the 
energy levels of hydrogen, 
mkze* 1 


he n2 , 


En = — 


should be generalized to the case of an atom with atomic number Z 
that has had all its electrons removed except for one. ee 


32 A muon is a subatomic particle that acts exactly like an 
electron except that its mass is 207 times greater. Muons can be 
created by cosmic rays, and it can happen that one of an atom’s 
electrons is displaced by a muon, forming a muonic atom. If this 
happens to a hydrogen atom, the resulting system consists simply 
of a proton plus a muon. 

(a) Based on the results of section 13.4.4, how would the size of a 
muonic hydrogen atom in its ground state compare with the size of 
the normal atom? 

(b) If you were searching for muonic atoms in the sun or in the 
earth’s atmosphere by spectroscopy, in what part of the electromag- 
netic spectrum would you expect to find the absorption lines? = & 


33 A photon collides with an electron and rebounds from the 
collision at 180 degrees, i.e., going back along the path on which 
it came. The rebounding photon has a different energy, and there- 
fore a different frequency and wavelength. Show that, based on 
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conservation of energy and momentum, the difference between the 
photon’s initial and final wavelengths must be 2h/mc, where m is 
the mass of the electron. The experimental verification of this type 
of “pool-ball” behavior by Arthur Compton in 1923 was taken as 
definitive proof of the particle nature of light. Note that we’re not 
making any nonrelativistic approximations. To keep the algebra 
simple, you should use natural units — in fact, it’s a good idea to 
use even-more-natural-than-natural units, in which we have not just 
c= 1 but also h = 1, and m = 1 for the mass of the electron. You'll 
also probably want to use the relativistic relationship E? — p? = m?, 
which becomes E? — p? = 1 for the energy and momentum of the 
electron in these units. g 


34 Generalize the result of problem 33 to the case where the 
photon bounces off at an angle other than 180°with respect to its 
initial direction of motion. B 


35 On page 818 we derived an expression for the probability 
that a particle would tunnel through a rectangular barrier, i.e., a 
region in which the interaction energy U(x) has a graph that looks 
like a rectangle. Generalize this to a barrier of any shape. [Hints: 
First try generalizing to two rectangular barriers in a row, and then 
use a series of rectangular barriers to approximate the actual curve 
of an arbitrary function U(x). Note that the width and height of 
the barrier in the original equation occur in such a way that all that 
matters is the area under the U-versus-x curve. Show that this is 
still true for a series of rectangular barriers, and generalize using an 
integral.| If you had done this calculation in the 1930’s you could 
have become a famous physicist. = 


36 Show that the wavefunction given in problem 30 is properly 
normalized. z 


37 Show that a wavefunction of the form UV = sinaxe’Y is a 
possible solution of the Schrédinger equation. Would it apply to a 
classically allowed region, or a classically forbidden one? a 


38 Find the energy levels of a particle in a three-dimensional 
rectangular box with sides of length a, b, and c. Eg 


39 Americium-241 is an artificial isotope used in smoke de- 
tectors. It undergoes alpha decay, with a half-life of 432 years. As 
discussed in example 17 on page 820, alpha decay can be understood 
as a tunneling process, and although the barrier is not rectangular 
in shape, the equation for the tunneling probability on page 819 
can still be used as a rough guide to our thinking. For americium- 
241, the tunneling probability is about 1 x 10779. Suppose that 
this nucleus were to decay by emitting a tritium (helium-3) nucleus 
instead of an alpha particle (helium-4). Estimate the relevant tun- 
neling probability, assuming that the total energy E remains the 
same. This higher probability is contrary to the empirical observa- 
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tion that this nucleus is not observed to decay by tritium emission 
with any significant probability, and in general tritium emission is 
almost unknown in nature; this is mainly because the tritium nu- 
cleus is far less stable than the helium-4 nucleus, and the difference 
in binding energy causes energy the available for the decay to be 
lower. x 


40 As far as we know, the mass of the photon is zero. However, 
it’s not possible to prove by experiments that anything is zero; all 
we can do is put an upper limit on the number. As of 2008, the 
best experimental upper limit on the mass of the photon is about 
1 x 10-2 kg. Suppose that the photon’s mass really isn’t zero, and 
that the value is at the top of the range that is consistent with 
the present experimental evidence. In this case, the c occurring in 
relativity would no longer be interpreted as the speed of light. As 
with material particles, the speed v of a photon would depend on its 
energy, and could never be as great as c. Estimate the relative size 
(c — v)/c of the discrepancy in speed, in the case of a photon with 
a frequency of 1 kHz, lying in the very low frequency radio range. 
> Answer, p. 875 © 


Key to symbols: 
easy © typical challenging Mdifficult MM very difficult 
Vv An answer check is available at www.lightandmatter.com. 
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Exercises 
Exercise 13A: Quantum Versus Classical Randomness 


1. Imagine the classical version of the particle in a one-dimensional box. Suppose you insert 
the particle in the box and give it a known, predetermined energy, but a random initial position 
and a random direction of motion. You then pick a random later moment in time to see where 
it is. Sketch the resulting probability distribution by shading on top of a line segment. Does 
the probability distribution depend on energy? 


2. Do similar sketches for the first few energy levels of the quantum mechanical particle in a 
box, and compare with 1. 


3. Do the same thing as in 1, but for a classical hydrogen atom in two dimensions, which acts 
just like a miniature solar system. Assume you’re always starting out with the same fixed values 
of energy and angular momentum, but a position and direction of motion that are otherwise 
random. Do this for L = 0, and compare with a real L = 0 probability distribution for the 
hydrogen atom. 


4. Repeat 3 for a nonzero value of L, say L=h. 


5. Summarize: Are the classical probability distributions accurate? What qualitative features 
are possessed by the classical diagrams but not by the quantum mechanical ones, or vice-versa? 
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Three Essential Mathematical Skills 


More often than not when a search-and-rescue team finds a hiker 
dead in the wilderness, it turns out that the person won a Darwin 
Award by not carrying some item from a short list of essentials, such 
as water and a map. There are three mathematical essentials in this 
course. 


1. Converting units 


basic technique: subsection 0.1.9, p. 29; conversion of area, vol- 
ume, etc.: subsection 0.2.1, p. 34 


Examples: 


10° g 


0.7 ké x i 


To check that we have the conversion factor the right way up (10° 
rather then 1/10), we note that the smaller unit of grams has been 
compensated for by making the number larger. 


For units like m?, kg/m®, etc., we have to raise the conversion 
factor to the appropriate power: 


3 10? mm 3 9 mm? 9 3 
4m? x {| ———— ] =4x 10 wf? x = 4x 10° mm 


lm wo 


Examples with solutions — p. 47, #6; p. 51, #31 


Problems you can check at lightandmatter.com/areaichecker. 
html — p. 47, #5; p. 47, #4; p. 47, #7; p. 51, #22; p. 52, #40 


2. Reasoning about ratios and proportionali- 
ties 

The technique is introduced in subsection 0.2.2, p. 35, in the 
context of area and volume, but it applies more generally to any 
relationship in which one variable depends on another raised to some 
power. 


Example: When a car or truck travels over a road, there is wear 
and tear on the road surface, which incurs a cost. Studies show that 
the cost per kilometer of travel C' is given by 


C = kw* ; 


where w is the weight per axle and & is a constant. The weight per 
axle is abot 13 times higher for a semi-trailer than for my Honda 
Fit. How many times greater is the cost imposed on the federal 
government when the semi travels a given distance on an interstate 
freeway? 
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> First we convert the equation into a proportionality by throwing 
out k, which is the same for both vehicles: 


Cx w* 


Next we convert this proportionality to a statement about ratios: 


Cl Wi ‘ 
— = (— ] #& 29,000 
C2 (= ) . 

Since the gas taxes I pay to drive my Fit are nowhere near 29,000 
times more than those paid to drive the truck the same distance, 
the federal government is effectively awarding a massive subsidy to 
the trucking company. Plus my Fit is cuter. 


Examples with solutions — p. 51, #32; p. 51, #33; p. 52, #38; 
p. 49, #17 


Problems you can check at lightandmatter.com/areaichecker. 


html — p. 52, #37; p. 52, #39; p. 121, #24; p. 122, #27; p. 119, 
#9; p. 286, #3 

3. Vector addition 

subsection 3.4.3, p. 197 


Example: The Ar vector from San Diego to Los Angeles has 
magnitude 190 km and direction 129°counterclockwise from east. 
The one from LA to Las Vegas is 370 km at 38°counterclockwise 
from east. Find the distance and direction from San Diego to Las 
Vegas. 


> Graphical addition is discussed on p. 198. Here we concentrate on 
analytic addition, which involves adding the x components to find 
the total « component, and similarly for y. The trig needed in order 
to find the components of the second leg (LA to Vegas) is laid out 
in figure 1 on p. 196 and explained in detail in example 58 on p. 196: 


Ax = (370 km) cos 38° = 292 km 
Ay2 = (370 km) sin 38° = 228 km 
(Since these are intermediate results, we keep an extra sig fig to 
avoid accumulating too much rounding error.) Once we understand 
the trig for one example, we don’t need to reinvent the wheel every 
time. The pattern is completely universal, provided that we first 
make sure to get the angle expressed according to the usual trig 
convention, counterclockwise from the x axis. Applying the pattern 
to the first leg, we have: 
Ax, = (190 km) cos 129° = —120 km 
Ay; = (190 km) sin 129° = 148 km 


For the vector directly from San Diego to Las Vegas, we have 


Aa = Ag, + Avg = 172 km 
Ay = Ay; + Ay2 = 376 km 


Exercises 


851 


852 


The distance from San Diego to Las Vegas is found using the Pythagorean 
theorem, 


/(172 km)? + (376 km)? = 410 km 
(rounded to two sig figs because it’s one of our final results). The 
direction is one of the two possible values of the inverse tangent 
tan !(Ay/Az) = {65°, 245°} 
Consulting a sketch shows that the first of these values is the correct 
one. 


Examples with solutions — p. 229, #62; p. 229, #63; p. 226, 
#45 

Problems you can check at lightandmatter.com/areaichecker. 
html — p. 227, #£53; p. 228, #57; p. 228, #£58; p. 235, #79; p. 226, 
#46 
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Programming With Py 


The purpose of this tutorial is to help you 
get familiar with a computer programming lan- 
guage called Python, which I’ve chosen be- 
cause (a) it’s free, and (b) it’s easy to use inter- 
actively. I won’t assume you have any previous 
experience with computer programming; you 
won’t need to learn very much Python, and 
what little you do need to learn [ll explain 
explicitly. If you really want to learn Python 
more thoroughly, there are a couple of excel- 
lent books that you can download for free on 
the Web: 


How to Think Like a Computer Sci- 


entist (Python Version), Allen B. Dow- 


ney, Jeffrey Elkner, Moshe Zadka, 
http://www. ibiblio.org/obp/ 


Dive Into Python, Mark Pilgrim, 
http://diveintopython.net/ 


The first book is meant for people who have 
never programmed before, while the second is 
a more complete introduction aimed at vet- 
eran programmers who know a different lan- 
guage already. 


Using Python as a calculator 


The easiest way to get Python going is to 
go to the web site ideone.com. Under “choose 
a language,” select Python. Inside the win- 
dow where it says “paste your source code or 
insert template or sample,” type print (2+2). 
Click on the “submit” button. The result, 4, 
is shown under “output.” In other words, you 
can use Python just like a calculator. 


For compactness, Ill show examples in the 
following style: 


>>> print (2+2) 
4 


Here the >>> is not something you would type 
yourself; it’s just a marker to distinguish your 
input from the program’s output. (In some 


tO Msions of Python, the computer will 


actually print out >>> as a prompt to tell you 
it’s ready to type something.) 


There are only a couple of things to watch 
out for. First, Python distinguishes between 
integers and real numbers, so the following 
gives an unexpected result: 


>>> print (2/3) 
0) 


To get it to treat these values as real numbers, 
you have to use decimal points: 


>>> print(2./3.) 
0. 6666666666666666666663 


GOK 


Multiplication is represented by 


>>> print (2.*3.) 
6.0 


Also, Python doesn’t know about its own li- 
brary of math functions unless you tell it ex- 
plicitly to load them in: 


>>> print (sqrt (2.)) 
Traceback (most recent call last): 
File ‘‘<stdin>’’, line 1, in ? 


NameError: There is no variable named ‘sqrt’ 


Here are the steps you have to go through to 
calculate the square root of 2 successfully: 


>>> import math 
>>> print (math.sqrt(2.)) 
1.4142135623730951 


The first line is just something you can make a 
habit of doing every time you start up Python. 
In the second line, the name of the square 
root function had to be prefixed with “math.” 
to tell Python where you wanted to get this 
“sqrt” function from. (All of this may seem 
like a nuisance if you’re just using Python as a 
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calculator, but it’s a good way to design a pro- 
gramming language so that names of functions 
never conflict.) 


Try it. Experiment and figure out whether 
Python’s trig functions assume radians or 
degrees. 


Variables 


Python lets you define variables and assign 
values to them using an equals sign: 


>>> dwarfs=7 

>>> print (dwarfs) 
>>> print (dwarfs+3) 
‘6 

10 


Note that a variable in computer program- 
ming isn’t quite like a variable in algebra. In 
algebra, if a=7 then a=7 always, throughout a 
particular calculation. But in a programming 
language, the variable name really represents 
a place in memory where a number can be 
stored, so you can change its value: 


>>> dwarfs=7 

>>> dwarfs=37 
>>> print (dwarfs) 
37 


You can even do stuff like this, 


>>> dwarfs=37 

>>> dwarfs=dwarfs+1 
>>> print (dwarfs) 
38 


In algebra it would be nonsense to have a vari- 
able equal to itself plus one, but in a com- 
puter program, it’s not an assertion that the 
two things are equal, its a command to calcu- 
late the value of the expression on the right 
side of the equals, and then put that number 
into the memory location referred to by the 
variable name on the left. 


Try it. What happens if you do dwarfs+1 = 
dwarfs? Do you understand why? 
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Functions 


Somebody had to teach Python how to do 
functions like sqrt, and it’s handy to be able 
to define your own functions in the same way. 
Here’s how to do it: 


>>> def double(x): 
>>> return 2.*x 
>>> print (double(5.)) 
10.0 


Note that the indentation is mandatory. The 
first and second lines define a function called 
double. The final line evaluates that function 
with an input of 5. 


Loops 
Suppose we want to add up all the numbers 
from 0 to 99. 


Automating this kind of thing is exactly 
what computers are best at, and Python pro- 
vides a mechanism for this called a loop: 


>>> sum=0 

>>> for j in range(100): 
>>> sum=sumt+j 

>>> print (sum) 

4950 


The stuff that gets repeated — the inside of 
the loop — has to be indented, just like in 
a function definition. Python always counts 
loops starting from 0, so for j in range(100) 
actually causes j to range from 0 to 99, not 
from 1 to 100. 


Appendix 2: Miscellany 


Unphysical “hovering” solutions to conservation of energy 


On page 83, I gave the following derivation for the acceleration of an object under the influence 


(sz) = (Ge) (a) (Ge) GE) 


(=) roma 


MU 


of gravity: 


= 9 


There is a loophole in this argument, however. When I say du/dk = 1/(mv), that only works 
when the object is moving. If it’s at rest, v is nondifferentiable as a function of Kk (or we could 
say that the derivative is infinite). Energy can in fact be conserved by an object that simply 
hovers above the ground: its kinetic energy is constant, and its gravitational energy is also 
constant. Why, then, do we never observe such behavior, except in Coyote and Roadrunner 
cartoons when the Coyote runs off the edge of a cliff without noticing it at first? 


Suppose we toss a baseball straight up, and pick a coordinate system in which upward 
velocities are positive. The ball’s velocity is a continuous function of time, and it changes 
from being positive to being negative, so there must be some instant at which it equals zero. 
Conservation of energy would be satisfied if the velocity were to remain at zero for a minute 
or an hour before the ball finally made the decision to fall. One thing that seems odd about 
all this is that there’s no obvious way for the ball to “decide” when it was time to go ahead 
and fall back down again. It violates the principle that the laws of physics are supposed to be 
deterministic. 


One reason that we could never hope to observe such behavior in reality is that the ball 
would have to spend some time being exactly at rest, and yet no object can ever stay exactly 
at rest for any finite amount of time. Objects in the real world are buffeted by air currents, 
for example. At the atomic level, the interaction of these air currents with the ball consists of 
discrete collisions with whizzing air molecules, and a quick back-of-the-envelope estimate shows 
that for an object this size, the typical time between collisions is on the order of 107?’ s, which 
would limit the duration of the hovering to a time far too short to allow it to be observed. 
Nevertheless this is not a completely satisfying explanation. It makes us wonder whether we 
ought to apply to the government for a research grant to do an experiment in which a baseball 
would be shot upward in a chamber that had been pumped out to an ultra-high vacuum! 


A somewhat better approach is to consider that motion is relative, so the ball’s velocity can 
only be zero in one particular frame of reference. It wouldn’t make sense for the ball to exhibit 
qualitatively different behavior when it was at rest, because different observers don’t even agree 
when the ball is at rest. But this argument also fails to resolve the issue completely, because this 
is a ball interacting with the planet Earth via gravitational forces, so it could make a difference 
whether the ball was at rest relative to the earth. Suppose we go into a frame of reference defined 
by an observer watching the ball as she descends in a glass elevator. At the moment when the 
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ball is at rest relative to the earth, she sees both the ball and the earth as moving upward at the 
same speed. It would be perfectly consistent with conservation of energy if she were to see them 
maintain this distance from one another for several minutes. In her frame, their kinetic energies 
would be nonzero, but constant, and the gravitational energy only depends on the separation 
between the ball and the earth, so it would be constant as well. 


Now that we’re thinking of the ball and the earth as two objects interacting with one another, 
it becomes natural to think of them on the same footing. What about the motion of the Earth? 
The earth feels a gravitational attraction from the ball, just as the ball feels one from the 
Earth. To make this symmetry more evident, let’s imagine two planets of equal mass, Foo and 
Bar, initially at rest with respect to one another. The Fooites and Barians realize that the 
gravitational interaction between their planets will cause them to drop together and collide. 
It seems that they should get ready for the end of the world. And yet before they riot, get 
drunk, or tell their spouses that yes, they really do look fat in that dress, maybe they should 
consider the possibility that the two planets will simply hover in place for some amount of 
time, because that would satisfy conservation of energy. Now the physical implausibility of the 
hovering solution becomes even more apparent. Not only does one planet have to “decide” at 
precisely what microsecond to go ahead and fall, but the other planet has to make the same 
decision at the same instant, or else conservation of energy will be violated. There is no physical 
process or interaction between the two planets that could perfectly synchronize their “decisions” 
like this. (The mechanism can’t be gravity, because nothing about the gravitational interaction 
provides any kind of a count-down that would pick out one particular time as the one at which 
the planets should start moving.) 


The key to making sense of all this is to realize that each planet can only “feel” the gravita- 
tional field in its own region of space. Its acceleration can only depend on the field, and not on 
the detailed arrangement of masses elsewhere in the universe that caused that field. Granting 
this kind of “real” status to fields can be considered as a logically necessary supplement to 
conservation of energy. 


Automated search for the brachistochrone 


See page 95. 

d=.01 

ci=.61905 

c2=- .94427 

a=il. 

b= 1. 

for i in range(100): 
bestt = 99. 
for j in range(3): 

for k in range(3): 


try_ci = c1+(j-1)*d 
try_c2 = c2+(k-1)*d 


t = timeb(a,b,try_c1,try_c2,100000) 
if t<bestt : 

bestcl = try_ci 

bestc2 = try_c2 

best] = j 

bestk = k 
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bestt = t 
ci = bestcl 
c2 = bestc2 


c3 = (b-c1*a-c2*a**2) / (a**3) 

print(c1, c2, c3, bestt) 

if (bestj == 1) and (bestk == 1) 
d = d*.5 


Derivation of the steady state for damped, driven oscillations 
Using the trig identities for the sine of a sum and cosine of a sum, we can change equation [2] 
on page 176 into the form 
[(—mw? + k) cos 6 — bw sind — F,,/A] sinwt 
+ [(—mw? + k) sind + bw cos 6] coswt = 0 


Both the quantities in square brackets must equal zero, which gives us two equations we can 
use to determine the unknowns A and 6. The results are 


bw 
-1 
6 = tan een 
= tan! eee 
Q(w3 — w?) 
and 
A= Fm 
(mw? —k)? + bw? 
Fin 


my/ (w? — w2)? + w2w?Q-? 


Proofs relating to angular momentum 

Uniqueness of the cross product 

The vector cross product as we have defined it has the following properties: 
(1) It does not violate rotational invariance. 

(2) It has the property A x (B+C)=AxB+AxC. 

(3) It has the property A x (kB) = k(A x B), where & is a scalar. 


Theorem: The definition we have given is the only possible method of multiplying two vectors 
to make a third vector which has these properties, with the exception of trivial redefinitions 
which just involve multiplying all the results by the same constant or swapping the names 
of the axes. (Specifically, using a left-hand rule rather than a right-hand rule corresponds to 
multiplying all the results by —1.) 


Proof: We prove only the uniqueness of the definition, without explicitly proving that it has 
properties (1) through (3). 
Using properties (2) and (3), we can break down any vector multiplication (A;x + Ayy + 


A,Z) x (B;X + Byy + Bz) into terms involving cross products of unit vectors. 


A “self-term” like x x X must either be zero or lie along the x axis, since any other direction 


would violate property (1). If was not zero, then (—x) x (—x) would have to lie in the opposite 
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direction to avoid breaking rotational invariance, but property (3) says that (—x) x (—x) is the 
same as X X X, which is a contradiction. Therefore the self-terms must be zero. 


An “other-term” like x x y could conceivably have components in the x-y plane and along 
the z axis. If there was a nonzero component in the x-y plane, symmetry would require that it 
lie along the diagonal between the x and y axes, and similarly the in-the-plane component of 
(—x) x y would have to be along the other diagonal in the z-y plane. Property (3), however, 
requires that (—x) x y equal —(X x y), which would be along the original diagonal. The only 
way it can lie along both diagonals is if it is zero. 


We now know that x x y must lie along the z axis. Since we are not interested in trivial 
differences in definitions, we can fix x x y = Z, ignoring peurile possibilities such as X x y = 7z 
or the left-handed definition x x y = —z. Given x x y = Z, the symmetry of space requires that 
similar relations hold for y x @ and % x X, with at most a difference in sign. A difference in sign 
could always be eliminated by swapping the names of some of the axes, so ignoring possible trivial 
differences in definitions we can assume that the cyclically related set of relations xX x y = 2, 
y X Z=xX, and zx xX = y holds. Since the arbitrary cross-product with which we started can 
be broken down into these simpler ones, the cross product is uniquely defined. 


The choice of axis theorem 

Theorem: Suppose a closed system of material particles conserves angular momentum in one 
frame of reference, with the axis taken to be at the origin. Then conservation of angular 
momentum is unaffected if the origin is relocated or if we change to a frame of reference that 
is in constant-velocity motion with respect to the first one. The theorem also holds in the case 
where the system is not closed, but the total external force is zero. 


Proof: In the original frame of reference, angular momentum is conserved, so we have dL/dt=0. 
From example 28 on page 282, this derivative can be rewritten as 


= ox Fi ’ 


v7 


where F; is the total force acting on particle 7. In other words, we’re adding up all the torques 
on all the particles. 


By changing to the new frame of reference, we have changed the position vector of each 
particle according to r; > r; +k — ut, where k is a constant vector that indicates the relative 
position of the new origin at t = 0, and u is the velocity of the new frame with respect to the 
old one. The forces are all the same in the new frame of reference, however. In the new frame, 
the rate of change of the angular momentum is 


L 
ae S “(ri +k - ut) x x F; 


a 


=Sonx Fi+(k- ut) ae 


a 


The first term is the expression for the rate of change of the angular momentum in the original 
frame of reference, which is zero by assumption. The second term vanishes by Newton’s third 
law; since the system is closed, every force F; cancels with some force Fj. (If external forces 
act, but they add up to zero, then the sum can be broken up into a sum of internal forces and 
a sum of external forces, each of which is zero.) The rate of change of the angular momentum 
is therefore zero in the new frame of reference. 
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The spin theorem 

Theorem: An object’s angular momentum with respect to some outside axis A can be found 
by adding up two parts: 

(1) The first part is the object’s angular momentum found by using its own center of mass as 
the axis, i.e. the angular momentum the object has because it is spinning. 

(2) The other part equals the angular momentum that the object would have with respect to 
the axis A if it had all its mass concentrated at and moving with its center of mass. 


Proof: Let re, be the position of the center of mass. The total angular momentum is 
L= ye Yr; X p; F 
i 
which can be rewritten as 


L= 0 (tom +¥i — Tem) X Pi ’ 
a 


where rj; — rem is particle i’s position relative to the center of mass. We then have 


L=rom x) pit >_ (ri — Tem) x Pp; 
a a 

= Tem X Ptotal + > (ti — Tem) X Pi 
a 


= Lem X MtotalVem + y (ri = Ken) xX Pp; 
a 


The first and second terms in this expression correspond to the quantities (2) and (1), respec- 
tively. 
Different Forms of Maxwell’s Equations 


First we reproduce Maxwell’s equations as stated on page 649, in integral form, using the SI 
(meter-kilogram-second) system of units, with the coupling constants written in terms of k and 
C: 


Op = Arkdin 
®p= 
O® p 
[eee ee 
a at 
O® 
cTp = a af AtkI through 


Homework problem 39 on page 679 deals with rewriting these in terms of «, = 1/47k and 
Mo = 4rk/c? rather than k and ce. 


For the reader who has been studying the optional sections giving Maxwell’s equations in 
differential form, here is a summary: 
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div E = 4rkp 


divB=0 
OB 
1E=—— 
cur Dt 
E 
curl B= — +4nkj 


Although all engineering and most scientific work these days is done in the SI (mks) system, 
one may still encounter the older cgs (centimeter-gram-second) system, especially in astronomy 
and particle physics. The mechanical units in this system include the dyne (g-cm/s”) for force, 
and the erg (g-cm?/s”) for energy. The system is extended to electrical units by taking k = 1 
as a matter of definition, so the Coulomb force law is F = qiq2/r?. This equation indirectly 
defines a unit of charge called the elestrostatic unit, with 1 C = 2.998 x 10° esu, the factor of 
2.998 arising from the speed of light. The unit of voltage is the statvolt, 1 statvolt = 299.8 V. 
In this system, the electric and magnetic fields have the same units, dynes/esu, but to avoid 
confusion, magnetic fields are normally written using the equivalent unit of gauss, 1 gauss=1 
dyne/esu=10~* T. The force on a charged particle is F = qE + q~ x B, which differs from the 
mks version by the 1/c factor in the magnetic term. Maxwell’s equations are: 


Op = Ar din 
Pp = 
10®p 
Cea ese 
= c Ot 
10%, An 
[Tgp= Caer ar ~~ Lihrough 
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Except as specifically noted below or in a parenthetical credit in the caption of a figure, all the illustrations in 
this book are by under my own copyright, and are copyleft licensed under the same license as the rest of the book. 


In some cases it’s clear from the date that the figure is public domain, but I don’t know the name of the 
artist or photographer; I would be grateful to anyone who could help me to give proper credit. I have assumed 
that images that come from U.S. government web pages are copyright-free, since products of federal agencies fall 
into the public domain. When “PSSC Physics” is given as a credit, it indicates that the figure is from the second 
edition of the textbook entitled Physics, by the Physical Science Study Committee; these are used according to a 
blanket permission given in the later PSSC College Physics edition, which states on the copyright page that “The 
materials taken from the original and second editions and the Advanced Topics of PSSC PHYSICS included in 
this text will be available to all publishers for use in English after December 31, 1970, and in translations after 
December 31, 1975.” 


In a few cases, I have made use of images under the fair use doctrine. However, I am not a lawyer, and the 
laws on fair use are vague, so you should not assume that it’s legal for you to use these images. In particular, 
fair use law may give you less leeway than it gives me, because I’m using the images for educational purposes, 
and giving the book away for free. Likewise, if the photo credit says “courtesy of ...,” that means the copyright 
owner gave me permission to use it, but that doesn’t mean you have permission to use it. 
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Appendix 4: Hints and Solutions 


Hints 


Hints for Chapter 2 

Page 120, problem 16: You can use either the chain-rule technique from page 83 or the 
technique prescribed in problem 15 on p. 120. The positions and velocities of the two masses are 
related to each other, and you'll need to use this relationship to eliminate one mass’s position and 
velocity and get everything in terms of the other mass’s position and velocity. The relationship 
between the two positions will involve some extraneous variables like the length of the string, 
which won’t have any effect on your final result. 


Page 120, problem 17: This is similar to problem ??, but you’re trying to find the combination 
of masses that will result in zero acceleration. In this problem, the distance dropped by one 
weight is different from, but still related to, the distance by which the other weight rises. Try 
relating the heights of the two weights to each other, so you can get the total gravitational 
energy in terms of only one of these heights. 


Page 120, problem 18: This is similar to problem 17, in that you’re looking for a setup that 
will give zero acceleration, and the distance the middle weight rises or falls is not the same as 
the distance the other two weights fall or rise. The simplest approach is to get the three heights 
in terms of 6, so that you can write the gravitational energy in terms of 6. 


Page 120, problem 19: This is very similar to problems ?? and 17. 


Page 120, problem 20: The first two parts can be done more easily by setting a = 1, since 
the value of a only changes the distance scale. 


Page 121, problem 22: The condition for a circular orbit contains three unknowns, v, g, and 
r, so you can’t just solve it for r. You’ll need more equations to make three equations in three 
unknowns. You'll need a relationship between g and r, and also a relationship between v and r 
that uses the given fact that it’s supposed to take 24 hours for an orbit. 


Page 121, problem 25: What does the total energy have to be if the projectile’s velocity is 
exactly escape velocity? Write down conservation of energy, change v to dr/dt, separate the 
variables, and integrate. 


Page 121, problem 26: The analytic approach is a little cumbersome, although it can be 
done by using approximations like 1/1 + ¢ = 1— (1/2)e. A more straightforward, brute-force 
method is simply to write a computer program that calculates U/m for a given point in spherical 
coordinates. By trial and error, you can fairly rapidly find the r that gives a desired value of 
U/m. 

Page 123, problem 33: Use calculus to find the minimum of U. 

Page 123, problem 35: The spring constant of this spring, k, is not the quantity you need 
in the equation for the period. What you need in that equation is the second derivative of the 


spring’s energy with respect to the position of the thing that’s oscillating. You need to start 
by finding the energy stored in the spring as a function of the vertical position, y, of the mass. 
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This is similar to example 23 on page 116. 


Page 124, problem 37: The variables x; and x2 will adjust themselves to reach an equilibrium. 
Write down the total energy in terms of x; and 2x9, then eliminate one variable, and find the 
equilibrium value of the other. Finally, eliminate both x; and x2 from the total energy, getting 
it just in terms of b. 


Hints for Chapter 3 
Page 219, problem 20: Write down two equations, one for Newton’s second law applied to 
each object. Solve these for the two unknowns T' and a. 


Page 225, problem 41: The whole expression for the amplitude has maxima where the stuff 
inside the square root is at a minimum, and vice versa, so you can save yourself a lot of work 
by just working on the stuff inside the square root. For normal, large values of Q, the there 
are two extrema, one at w = 0 and one at resonance; one of these is a maximum and one is a 
minimum. You want to find out at what value of Q the zero-frequency extremum switches over 
from being a maximum to being a minimum. 


Page 230, problem 69: You can use the geometric interpretation of the dot product. 


Page 230, problem 70: The easiest way to do this problem is to use two different coordinate 
systems: one that’s tilted to coincide with the upper slope, and one that’s tilted to coincide 
with the lower one. 


Hints for Chapter 4 

Page 286, problem 8: The choice of axis theorem only applies to a closed system, or to a 
system acted on by a total force of zero. Even if the box is not going to rotate, its center of 
mass is going to accelerate, and this can still cause a change in its angular momentum, unless 
the right axis is chosen. For example, if the axis is chosen at the bottom right corner, then the 
box will start accumulating clockwise angular momentum, even if it is just accelerating to the 
right without rotating. Only by choosing the axis at the center of mass (or at some other point 
on the same horizontal line) do we get a constant, zero angular momentum. 


Page 287, problem 11: There are four forces on the wheel at first, but only three when it lifts 
off. Normal forces are always perpendicular to the surface of contact. Note that the corner of 
the step cannot be perfectly sharp, so the surface of contact for this force really coincides with 
the surface of the wheel. 


Page 287, problem 12: How is this different from the case where you whirl a rock in a circle 
on a string and gradually pull in the string? 


Page 290, problem 24: The acceleration and the tension in the string are unknown. Use 
T = dL/dt and F = ma to determine these two unknowns. 


Page 292, problem 35: You'll need the result of problem 19 in order to relate the energy and 
angular momentum of a rigidly rotating body. Since this relationship involves a variable raised 
to a power, you can’t just graph the data and get the moment of inertia directly. One way 
to get around this is to manipulate one of the variables to make the graph linear. Here is an 
example of this technique from another context. Suppose you were given a table of the masses, 
m, of cubical pieces of wood, whose sides had various lengths, b. You want to find a best-fit 
value for the density of the wood. The relationship is m = pb®. The graph of m versus b would 
be a curve, and you would not have any easy way to get the density from such a graph. But 
by graphing m versus b®, you can produce a graph that is linear, and whose slope equals the 
density. 
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Hints for Chapter 6 
Page 373, problem 4: How could you change the values of x and ¢ so that the value of y 
would remain the same? What would this represent physically? 


Page 374, problem 12: The answers to the two parts are not the same. 


Page 374, problem 8: (a) The most straightforward approach is to apply the equation 
0?y/Ot? = (T/p)0?y/dx?. Although this equation was developed in the main text in the context 
of a straight string with a curvy wave on it, it works just as well for a circular loop; the left-hand 
side is simply the inward acceleration of any point on the rope. Note, however, that we’ve been 
assuming the string was (at least approximately) parallel to the x axis, which will only be true 
if you choose a specific value of x. You need to get an equation for y in terms of x in order to 
evaluate the right-hand side. 


Hints for Chapter 8 

Page 463, problem 15: The force on the lithium ion is the vector sum of all the forces of 
all the quadrillions of sodium and chlorine atoms, which would obviously be too laborious to 
calculate. Nearly all of these forces, however, are canceled by a force from an ion on the opposite 
side of the lithium. 


Hints for Chapter 10 
Page 588, problem 15: Use the approximation (1+ €)? © 1+ pe, which is valid for small e. 


Page 592, problem 31: The math is messy if you put the origin of your polar coordinates at 
the center of the disk. It comes out much simpler if you put the origin at the edge, right on top 
of the point at which we’re trying to compute the voltage. 


Page 591, problem 26: Since we have t < r, the volume of the membrane is essentially the 
same as if it was unrolled and flattened out, and the field’s magnitude is nearly constant. 


Page 591, problem 25: First find the energy stored in a spherical shell extending from r to 
r+dr, then integrate to find the total energy. 


Hints for Chapter 11 
Page 676, problem 24: A stable system has low energy; energy would have to be added to 
change its configuration. 


Page 679, problem 41: We're ignoring the fact that the light consists of little wavepackets, 
and imagining it as a simple sine wave. But wait, there’s more good news! The energy density 
depends on the squares of the fields, which means the squares of some sine waves. Well, when 
you square a sine wave that varies from —1 to +1, you get a sine wave that goes from 0 to +1, 
and the average value of that sine wave is 1/2. That means you don’t have to do an integral 
like U = [(dU/dV)dV. All you have to do is throw in the appropriate factor of 1/2, and you 
can pretend that the fields have their constant values E and B everywhere. 


Page 680, problem 42: Use Faraday’s law, and choose an Ampérian surface that is a disk of 
radius R sandwiched between the plates. 


Page 682, problem 51: (a) Magnetic fields are created by currents, so once you’ve decided 
how currents behave under time-reversal, you can figure out how magnetic fields behave. 


Answers to Self-Checks 
Answers to Self-Checks for Chapter 0 
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Page 15: If only he has the special powers, then his results can never be reproduced. 


Page 17: They would have had to weigh the rays, or check for a loss of weight in the object 
from which they were have emitted. (For technical reasons, this was not a measurement they 
could actually do, hence the opportunity for disagreement.) 


Page 25: A dictionary might define “strong” as “possessing powerful muscles,” but that’s not 
an operational definition, because it doesn’t say how to measure strength numerically. One 
possible operational definition would be the number of pounds a person can bench press. 


Page 28: A microsecond is 1000 times longer than a nanosecond, so it would seem like 1000 
seconds, or about 20 minutes. 


Page 29: Exponents have to do with multiplication, not addition. The first line should be 100 
times longer than the second, not just twice as long. 


Page 32: The various estimates differ by 5 to 10 million. The CIA’s estimate includes a 
ridiculous number of gratuitous significant figures. Does the CIA understand that every day, 
people in are born in, die in, immigrate to, and emigrate from Nigeria? 


Page 33: (1) 4; (2) 2; (3) 2 


Page 35: 1 yd? x (3 ft/1 yd)? = 9 ft? 
Lyd? 568 ttyl yd)> 97 Te 


Answers to Self-Checks for Chapter 1 

Page 58: The stream has to spread out. When the velocity becomes zero, it seems like the 
cross-sectional area has to become infinite. In reality, this is the point where the water turns 
around and comes back down. The infinity isn’t real; it occurs mathematically because we used 
a simplified model of the the stream of water, assuming, for instance, that the water’s velocity 
is always straight up. 

Page 60: A positive Ax means the object is moving in the same direction as the positive x 
axis. A negative Az means it’s going the opposite direction. 

Page 66: (1) The effect only occurs during blastoff, when their velocity is changing. Once the 
rocket engines stop firing, their velocity stops changing, and they no longer feel any effect. (2) 
It is only an observable effect of your motion relative to the air. 

Page 68: Galilean relativity says that experiments can’t come out differently just because 
they’re performed while in motion. The tilting of the surface tells us the train is accelerating, 
but it doesn’t tell us anything about the train’s velocity at that instant. The person in the train 
might say the bottle’s velocity was zero (but changing), whereas a person working in a reference 
frame attached to the dirt outside says it’s moving; they don’t agree on velocities. They do 
agree on accelerations. The person in the train has to agree that the train is accelerating, since 
otherwise there’s no reason for the funny tilting effect. 

Page 69: Yes. In U.S. currency, for instance, the quantum of money is one cent. 


Answers to Self-Checks for Chapter 2 

Page 90: There are two reasonable possibilities we could imagine — neither of which ends up 
making much sense — if we insist on the straight-line trajectory. (1) If the car has constant 
speed along the line, then in the * frame we see it going straight down at constant speed. It 
makes sense that it goes straight down in the * frame of reference, since in that frame it was 
never moving horizontally, and there’s no reason for it to start. However, it doesn’t make sense 
that it goes down with constant speed, since falling objects are supposed to speed up the whole 
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time they fall. This violates both Galilean relativity and conservation of energy. (2) If it’s 
speeding up and moving along a diagonal line in the original frame, then it might be conserving 
energy in one frame or the other. But if it’s speeding up along a line, then as seen in the original 
frame, both its vertical motion and its horizontal motion must be speeding up. If its horizontal 
velocity is increasing in the original frame, then it can’t be zero and remain zero in the * frame. 
This violates Galilean relativity, since in the * frame the car apparently starts moving sideways 
for no reason. 


Answers to Self-Checks for Chapter 3 
Page 143: By shifting his weight around, he can cause the center of mass not to coincide with 
the geometric center of whe wheel. 


Page 151: (1) This is motion, not force. (2) This is a description of how the sub is able to get 
the water to produce a forward force on it. (3) The sub runs out of energy, not force. 


Page 153: (1) It’s kinetic friction, because her uniform is sliding over the dirt. (2) It’s static 
friction, because even though the two surfaces are moving relative to the landscape, they’re not 
slipping over each other. (3) Only kinetic friction creates heat, as when you rub your hands 
together. If you move your hands up and down together without sliding them across each other, 
no heat is produced by the static friction. 


Page 152: Frictionless (or nearly frictionless) ice can certainly make a normal force, since 
otherwise a hockey puck would sink into the ice. Friction is not possible without a normal force, 
however: we can see this from the equation, or from common sense, e.g. while sliding down a 
rope you don’t get any friction unless you grip the rope. 


Page 154: (1) Normal forces are always perpendicular to the surface of contact, which means 
right or left in this figure. Normal forces are repulsive, so the cliff’s force on the feet is to the 
right, i.e., away from the cliff. (2) Frictional forces are always parallel to the surface of contact, 
which means right or left in this figure. Static frictional forces are in the direction that would 
tend to keep the surfaces from slipping over each other. If the wheel was going to slip, its surface 
would be moving to the left, so the static frictional force on the wheel must be in the direction 
that would prevent this, i.e., to the right. This makes sense, because it is the static frictional 
force that accelerates the dragster. (3) Normal forces are always perpendicular to the surface of 
contact. In this diagram, that means either up and to the left or down and to the right. Normal 
forces are repulsive, so the ball is pushing the bat away from itself. Therefore the ball’s force is 
down and to the right on this diagram. 


Page 170: The dashed lines on the graph are about twice as far apart in the second cycle 
compared to the first, so the amplitude has doubled. For sufficiently small oscillations around 
an equilibrium with x = 0 and U(0) = 0, it’s always a good approximation to take U « 2?, 
so the energy is proportional to the square of the amplitude; this is a general fact about all 
oscillations, provided that the amplitude is small. Since the amplitude doubled, the energy 


quadrupled. 


Page 173: The two graphs start off with the same amplitude, but the solid curve loses amplitude 
more rapidly. For a given time, t, the quantity e~% is apparently smaller for the solid curve, 
meaning that ct is greater. The solid curve has the higher value of c. 


Page 175: A decaying exponential never dies out to zero in any finite amount of time. 
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Page 179: In the expression 
Fim 
my] (w? — w2)? + waw2Q-? 


from page 857, substituting w = wo makes the first term inside the square root vanish, which 
should make the denominator pretty small, thereby producing a pretty big amplitude. In the 
limit of Q = oo, Q~? = 0, so the second term vanishes, and w = we actually produces an infinite 
amplitude. For values of Q that are large but finite, we still expect to get resonance pretty close 
to W = Wo. Setting w = wp in the finite-Q case, the first term vanishes, we can simplify the 
square root, and the result ends up being A « 1/\/Q-? « Q. This is only an approximation, 
because we had to assume early on that Q was large. 


Page 193: F = ma 
Page 194: 


1.5Q -Q 


Answers to Self-Checks for Chapter 4 

Page 254: Torques 1, 2, and 4 all have the same sign, because they are trying to twist the 
wrench clockwise. The sign of 3 is opposite to the signs of 1, 2, and 4. The magnitude of 3 is the 
greatest, since it has a large r and the force is nearly all perpendicular to the wrench. Torques 
1 and 2 are the same because they have the same values of r and F',. Torque 4 is the smallest, 
due to its small r. 


Page 263: One person’s 6-t graph would simply be shifted up or down relative to the others. 
The derivative equals the slope of the tangent line, and this slope isn’t changed when you shift 
the graph, so both people would agree on the angular velocity. 


Page 265: Reversing the direction of w also reverses the direction of motion, and this is reflected 
by the relationship between the plus and minus signs. In the equation for the radial acceleration, 
w is squared, so even if w is negative, the result is positive. This makes sense because the 
acceleration is always inward in circular motion, never outward. 


Page 277: All the rotations around the x axis give w vectors along the positive x axis (thumb 
pointing along positive x), and all the rotations about the y axis have w vectors with positive y 
components. 


Page 280: For example, if we take (A x B), = A,B, — B,A, and reverse the A’s and B’s, we 
get (B x A); = B,A, — A,B, which is just the negative of the original expression. 


Answers to Self-Checks for Chapter 5 

Page 299: Solids can exert shear forces. A solid could be in an equilibrium in which the shear 
forces were canceling the forces due to sideways pressure gradients. For example, if I push on 
a brick wall, it will give by perhaps a millionth of an inch, but it will reach an equilibrium, in 
which the shear forces cancel out the effect of the pressure gradient. 


Page 299: (1) Not valid. The equation only applies to fluids. (2) Valid. The density of the air 
is nearly constant between the top and bottom of the building. (3) Not valid. There is a large 
difference is the density of the air between the top and the bottom of the mountain. (4) Not 
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valid, because g isn’t constant throughout the interior of the earth. (5) Not valid, because the 
air is flowing around the wing. The air is accelerating, so it is not in equilibrium. 


Page 317: Heating the gas at constant pressure requires adding heat to it, which increases its 
entropy. To increase the gas’s pressure while keeping its temperature constant, we would have 
to compress it, which would give it a smaller volume to inhabit, and therefore fewer possible 
positions for each atom. The whole thing has to be proportional to n because entropy is additive. 


Answers to Self-Checks for Chapter 6 
Page 338: The leading edge is moving up, the trailing edge is moving down, and the top of the 
hump is motionless for one instant. 


Page 357: The energy of a wave is usually proportional to the square of the amplitude. Squaring 
a negative number gives a positive result, so the energy is the same 


Page 358: A substance is invisible to sonar if the speed of sound waves in it is the same as in 
water. Reflections occur only at boundaries between media in which the wave speed is different. 


Page 359: No. A material object that loses kinetic energy slows down, but a wave is not a 
material object. The velocity of a wave ordinarily only depends on the medium, not on the 
amplitude. The speed of soft sound, for example, is the same as the speed of loud sound. 


Page 366: No. To get the best possible interference, the thickness of the coating must be such 
that the second reflected wave train lags behind the first by an integer number of wavelengths. 
Optimal performance can therefore only be produced for one specific color of light. The typical 
greenish color of the coatings shows that it does the worst job for green light. 


Page 367: The period is the time required to travel a distance 2L at speed v, T = 2L/v. The 
frequency is f = 1/T = v/2L. 


Page 371: The wave pattern will look like this: & Three quarters of a wavelength fit in the 
tube, so the wavelength is three times shorter than that of the lowest-frequency mode, in which 
one quarter of a wave fits. Since the wavelength is smaller by a factor of three, the frequency is 
three times higher. Instead of fo, 2fo, 3fo, 4fo, ..., the pattern of wave frequencies of this air 
column goes fo, 3fo, 5fo, Tho, --. 


Answers to Self-Checks for Chapter 7 
Page 386: At v =0, we get y=1,sot=T. There is no time distortion unless the two frames 
of reference are in relative motion. 


Page 393: The total momentum is zero before the collision. After the collision, the two momenta 
have reversed their directions, but they still cancel. Neither object has changed its kinetic energy, 
so the total energy before and after the collision is also the same. 


Page 399: At v = 0, we have y = 1, so the mass-energy is mc? as claimed. As v approaches c, 


approaches infinity, so the mass energy becomes infinite as well. 


Answers to Self-Checks for Chapter 8 

Page 413: Either type can be involved in either an attraction or a repulsion. A positive charge 
could be involved in either an attraction (with a negative charge) or a repulsion (with another 
positive), and a negative could participate in either an attraction (with a positive) or a repulsion 
(with a negative). 


Page 414: It wouldn’t make any difference. The roles of the positive and negative charges in 
the paper would be reversed, but there would still be a net attraction. 
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Page 424: Yes. In U.S. currency, the quantum of money is the penny. 


Page 445: Thomson was accelerating electrons, which are negatively charged. This apparatus 
is supposed to accelerated atoms with one electron stripped off, which have positive net charge. 
In both cases, a particle that is between the plates should be attracted by the forward plate and 
repelled by the plate behind it. 


Page 453: The hydrogen-1 nucleus is simple a proton. The binding energy is the energy required 
to tear a nucleus apart, but for a nucleus this simple there is nothing to tear apart. 


Answers to Self-Checks for Chapter 9 
Page 479: The large amount of power means a high rate of conversion of the battery’s chemical 
energy into heat. The battery will quickly use up all its energy, i.e. “burn out.” 


Answers to Self-Checks for Chapter 10 
Page 517: The reasoning is exactly analogous to that used in example 1 on page 515 to derive 
an equation for the gravitational field of the earth. The field is F/q = (kKQa:/r?)/q@ = kQ/r?. 


Page 524: 


od e 
~ dg \r 


Page 525: (a) The voltage (height) increases as you move to the east or north. If we let the 
positive x direction be east, and choose positive y to be north, then dV/da and dV /dy are both 
positive. This means that E, and E, are both negative, which makes sense, since the water is 
flowing in the negative x and y directions (south and west). 

(b) The electric fields are all pointing away from the higher ground. If this was an electrical 
map, there would have to be a large concentration of charge all along the top of the ridge, and 
especially at the mountain peak near the south end. 


Page 538: (a) The energy density depends on E- E, which equals E? + EF? + E?. 
(b) Since FE, is squared, reversing its sign has no effect on the energy density. This makes sense, 
because otherwise we’d be saying that the positive and negative x axes in space were somehow 
physically different in their behavior, which would violate the symmetry of space. 


Page 538: 


Page 547: Yes. The mass has the same kinetic energy regardless of which direction it’s moving. 
Friction coverts mechanical energy into heat at the same rate whether the mass is sliding to the 
right or to the left. The spring has an equilibrium length, and energy can be stored in it either 
by compressing it (x < 0) or stretching it (x > 0). 
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Page 548: Velocity, v, is the derivative of position, x, with respect to time. This is exactly 
analogous to I = dq/dt. 


Page 563: The impedance depends on the frequency at which the capacitor is being driven. It 
isn’t just a single value for a particular capacitor. 


Page 558: Say we’re looking for u = \/Z, i.e., we want a number wu that, multiplied by itself, 
equals z. Multiplication multiplies the magnitudes, so the magnitude of u can be found by 
taking the square root of the magnitude of z. Since multiplication also adds the arguments 
of the numbers, squaring a number doubles its argument. Therefore we can simply divide the 
argument of z by two to find the argument of u. This results in one of the square roots of z. 
There is another one, which is —u, since (—u)? is the same as u?. This may seem a little odd: 
if w was chosen so that doubling its argument gave the argument of z, then how can the same 
be true for —u? Well for example, suppose the argument of z is 4°. Then argu = 2°, and 
arg(—u) = 182°. Doubling 182 gives 364, which is actually a synonym for 4 degrees. 


Page 562: Only cos(6t — 4) can be represented by a complex number. Although the graph of 
cos? t does have a sinusoidal shape, it varies between 0 and 1, rather than —1 and 1, and there 
is no way to represent that using complex numbers. The function tant doesn’t even have a 
sinusoidal shape. 


Page 575: The quantity 47kq;n, is now negative, so we'd better get a negative flux on the other 
side of Gauss’ theorem. We do, because each field vector E; is inward, while the corresponding 
area vector, Aj, is outward. Vectors in opposite directions make negative dot products. 


Answers to Self-Checks for Chapter 11 

Page 612: For instance, imagine a small sphere around the negative charge, which we would 
sketch on the two-dimensional paper as a circle. The field points inward at every point on 
the sphere, so all the contributions to the flux are negative. There is no cancellation, and the 
total flux is negative, which is consistent with Gauss’ law, since the sphere encloses a negative 
charge. Copying the same surface onto the field of the bar magnet, however, we find that there is 
inward flux on the top and outward flux on the bottom, where the surface is inside the magnet. 
According to Gauss’ law for magnetism, these cancel exactly, which is plausible based on the 
figure. 


Page 610: From the top panel of the figure, where the magnetic field is turned off, we can 
see that the beam leaves the cathode traveling upward, so in the bottom figure the electrons 
must be circlng in the counterclockwise direction. To produce circular motion, the force must be 
towards the center of the circle. We can arbitrarily pick a point on the circle at which to analyze 
the vectors — let’s pick the right-hand side. At this point, the velocity vector of the electrons 
is upward. Since the electrons are negatively charged, the force gv x B is given by —v x B, 
not +v x B. Circular orbits are produced when the motion is in the plane perpendicular to 
the field, so the field must be either into or out of the page. If the field was into the page, the 
right-hand rule would give v x B to the left, which is towards the center, but the force would 
be in the direction of —v x B, which would be outwards. The field must be out of the page. 


Page 616: Plugging z = 0 into the equation gives B, = 4kI/c*h. This is simply twice the field 
of a single wire at a distance h. At this location, the fields contributed by the two wires are 
parallel, so vector addition simply gives a vector twice as strong. 


Page 628: The circulation around the Ampérian surface we used was counterclockwise, since 
the field on the bottom was to the right. Applying the right-hand rule, the current [tnrough must 
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have been out of the page at the top of the solenoid, and into the page at the bottom. 


Page 628: The quantity @ came in because we set 7 = NI/¢. Based on that, it’s clear that @ 
represents the length of the solenoid, not the length of the wire. 


Page 629: Doubling the radius of the solenoid would mean that every distance in the problem 
would be doubled, which would tend to make the fields weaker, since fields fall off with distance. 
However, doubling the radius would also mean that we had twice as much wire, and therefore 
twice as many moving charges to create magnetic fields. Since the magnetic field of a wire falls 
off like 1/r, it’s not surprising that the first effect amounts to exactly a factor of 1/2, which is 
exactly enough to cancel out the factor of 2 from the second effect. 


Page 641: Unless the engine is already turning over, the permanent magnet isn’t spinning, so 
there is no change in the magnetic field. Only a changing magnetic field creates an induced 
electric field. 


Page 646: Let’s get all the electrical units in terms of Teslas. Electric field units can be 
expressed as T- m/s. The circulation of the electric field has units of electric field multiplied by 
distance, or T-m?/s. On the right side, the derivative 0B/0t has units of T/s, and multiplying 
this my area gives units of T - m?/s, just like on the left side. 


Page 664: An (idealized) battery is a circuit element that always maintains the same voltage 
difference across itself, so by the loop rule, the voltage difference across the capacitor must 
remain unchanged, even while the dielectric is being withdrawn. The bound charges on the 
surfaces of the dielectric have been attracting the free charges in the plates, causing them to 
charge up more than they ordinarily would have. As the dielectric is withdrawn, the capacitor 
will be partially discharged, and we will observe a current in the ammeter. Since the dielectric 
is attracted to the plates, positive work is done in extracting it, indicating that there must be an 
increase in the electrical energy stored in the capacitor. This may seem paradoxical, since the 
energy stored in a capacitor is (1/2)CV”, and we are decreasing the capacitance. However, the 
energy (1/2)CV? is calculated in terms of the work required to deposit the free charge on the 
plates. In addition to this energy, there is also energy stored in the dielectric itself. By moving 
its bound charges farther away from the free charges in the plates, to which they are attracted, 
we have increased their electrical energy. This energy of the bound charges is inaccessible to 
the electric circuit. 


Answers to Self-Checks for Chapter 12 
Page 698: Only 1 is correct. If you draw the normal that bisects the solid ray, it also bisects 
the dashed ray. 


Page 702: You should have found from your ray diagram that an image is still formed, and it 
has simply moved down the same distance as the real face. However, this new image would only 
be visible from high up, and the person can no longer see his own image. 


Page 707: Increasing the distance from the face to the mirror has decreased the distance from 
the image to the mirror. This is the opposite of what happened with the virtual image. 


Page 716: At the top of the graph, d; approaches infinity when d, approaches f. Interpretation: 
the rays just barely converge to the right of the mirror. 


On the far right, d; approaches f as dy approaches infinity; this is the definition of the focal 
length. 


At the bottom, d; approaches negative infinity when d, approaches f from the other side. 
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Interpretation: the rays don’t quite converge on the right side of the mirror, so they appear to 
have come from a virtual image point very far to the left of the mirror. 


Page 725: (1) If n1 and ng are equal, Snell’s law becomes sin 6; = sin 02, which implies 0; = 02, 
since both angles are between 0 and 90°. The graph would be a straight line along the diagonal 
of the graph. (2) The graph is farthest from the diagonal when the angles are large, i.e., when 
the ray strikes the interface at a grazing angle. 


Page 730: (1) In 1, the rays cross the image, so it’s real. In 2, the rays only appear to have 
come from the image point, so the image is virtual. (2) A rays is always closer to the normal in 
the medium with the higher index of refraction. The first left turn makes the ray closer to the 
normal, which is what should happen in glass. The second left turn makes the ray farther from 
the normal, and that’s what should happen in air. (3) Take the topmost ray as an example. It 
will still take two right turns, but since it’s entering the lens at a steeper angle, it will also leave 
at a steeper angle. Tracing backward to the image, the steeper lines will meet closer to the lens. 


Page 736: It would have to have a wavelength on the order of centimeters or meters, the same 
distance scale as that of your body. These would be microwaves or radio waves. (This effect can 
easily be noticed when a person affects a TV’s reception by standing near the antenna.) None 
of this contradicts the correspondence principle, which only states that the wave model must 
agree with the ray model when the ray model is applicable. The ray model is not applicable 
here because A/d is on the order of 1. 


Page 738: At this point, both waves would have traveled nine and a half wavelengths. They 
would both be at a negative extreme, so there would be constructive interference. 


Page 742: Judging by the distance from one bright wave crest to the next, the wavelength 
appears to be about 2/3 or 3/4 as great as the width of the slit. 


Page 743: Since the wavelengths of radio waves are thousands of times longer, diffraction causes 
the resolution of a radio telescope to be thousands of times worse, all other things being equal. 
(To compensate for the wavelength, it’s desirable to make the telescope very large, as in figure 
y on page 743.) 


Answers to Self-Checks for Chapter 13 

Page 776: (1) Most people would think they were positively correlated, but it’s possible that 
they’re independent. (2) These must be independent, since there is no possible physical mecha- 
nism that could make one have any effect on the other. (3) These cannot be independent, since 
dying today guarantees that you won’t die tomorrow. 


Page 778: The area under the curve from 130 to 135 cm is about 3/4 of a rectangle. The area 
from 135 to 140 cm is about 1.5 rectangles. The number of people in the second range is about 
twice as much. We could have converted these to actual probabilities (1 rectangle = 5 cm x 
0.005 cem~! = 0.025), but that would have been pointless because we were just going to compare 
the two areas. 


Page 791: The axes of the graph are frequency and photon energy, so its slope is Planck’s 
constant. It doesn’t matter if you graph eAV rather than W + eAV, because that only changes 
the y-intercept, not the slope. 


Page 803: Wavelength is inversely proportional to momentum, so to produce a large wavelength 
we would need to use electrons with very small momenta and energies. (In practical terms, this 
isn’t very easy to do, since ripping an electron out of an object is a violent process, and it’s not 
so easy to calm the electrons down afterward.) 
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Page 812: Under the ordinary circumstances of life, the accuracy with which we can measure 
position and momentum of an object doesn’t result in a value of ApAzx that is anywhere near 
the tiny order of magnitude of Planck’s constant. We run up against the ordinary limitations 
on the accuracy of our measuring techniques long before the uncertainty principle becomes an 
issue. 


Page 817: No. The equation K = p?/2m is nonrelativistic, so it can’t be applied to an electron 
moving at relativistic speeds. Photons always move at relativistic speeds, so it can’t be applied 
to them either. 


Page 819: Dividing by Planck’s constant, a small number, gives a large negative result inside 
the exponential, so the probability will be very small. 


Page 833: The original argument was that a kink would have a zero wavelength, which would 
correspond to an infinite momentum and an infinite kinetic energy, and that would violate 
conservation of energy. But the kink in this example occurs at r = 0, which is right on top 
of the proton, where the electrical energy —ke?/r is infinite and negative. Since the electrical 
energy is negative and infinite, we’re actually required to have an infinite positive kinetic energy 
in order to come up with a total that conserves energy. 


Page 825: If you trace a circle going around the center, you run into a series of eight complete 
wavelengths. Its angular momentum is 8h. 


Page 828: n = 3, =0, ¢, = 0: one state; n = 3, £= 1, 2, = —1, 0, or 1: three states; n = 3, 
£= 2, ¢, = —2, —1, 0, 1, or 2: five states 


Answers 
Answers for Chapter 2 


Page 124, problem 37: kK = ki ko/(ky + kz) = 1/(1/k4 + 1/k2) 


Answers for Chapter 3 
Page 216, problem 5: After the collision it is moving at 1/3 of its initial speed, in the same 
direction it was initially going (it “follows through”). 


Page 225, problem 41: Q = 1//2 
Page 225, problem 43: (a) 7 x 10~® radians, or about 4 x 10~° degrees. 
Page 227, problem 51: (a) R = (2v2/g)sin@cos@ (c) 45 ° 


Page 227, problem 52: (a) The optimal angle is about 40°, and the resulting range is about 
124 meters, which is about the length of a home run. (b) It goes about 9 meters farther. For 
comparison with reality, the stadium’s web site claims a home run goes about 11 meters farther 
there than in a sea-level stadium. 


Answers for Chapter 5 
Page 332, problem 7: (c) n = 16 


Page 333, problem 9: (a) ~ 2— 10% (b) 5% (c) The high end for the body’s actual efficiency 
is higher than the limit imposed by the laws of thermodynamics. However, the high end of 
the 1-5 watt range quoted in the problem probably includes large people who aren’t just lying 
around. Still, it’s impressive that the human body comes so close to the thermodynamic limit. 


Page 333, problem 10: (a) Looking up the relevant density for air, and converting everything 
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to mks, we get a frequency of 730 Hz. This is on the right order of magnitude, which is 
promising, considering the crudeness of the approximation. (b) This brings the result down to 
400 Hz, which is amazingly close to the observed frequency of 300 Hz. 


Answers for Chapter 6 
Page 374, problem 9: (b) g/2 


Page 375, problem 13: Check: The actual length of a flute is about 66 cm from the tip of 
the mouthpiece to the end of the bell. 


Page 374, problem 8: (a) T = jw? r? 
Page 376, problem 17: (a) f = 4a/(1 +a)? (b) v2 = 0103 


Answers for Chapter 10 
Page 590, problem 22: (a) EF = 2k\/R. 


Answers for Chapter 11 
Page 671, problem 5: (a) I = Av. 


Page 671, problem 6: (b) 2kI, [nL /c?R. 


Answers for Chapter 13 
Page 848, problem 40: about 10710 


Solutions 
Solutions for Chapter 0 


Page 47, problem 6: 


102 e y 1073 kg 


= 1.34 1074 kg 
1 mg lg 


134 mg x 
Page 47, problem 8: (a) Let’s do 10.0 g and 1000 g. The arithmetic mean is 505 grams. It 
comes out to be 0.505 kg, which is consistent. (b) The geometric mean comes out to be 100 
g or 0.1 kg, which is consistent. (c) If we multiply meters by meters, we get square meters. 
Multiplying grams by grams should give square grams! This sounds strange, but it makes sense. 
Taking the square root of square grams (g”) gives grams again. (d) No. The superduper mean 
of two quantities with units of grams wouldn’t even be something with units of grams! Related 
to this shortcoming is the fact that the superduper mean would fail the kind of consistency test 
carried out in the first two parts of the problem. 


Page 48, problem 12: (a) They’re all defined in terms of the ratio of side of a triangle to 
another. For instance, the tangent is the length of the opposite side over the length of the 
adjacent side. Dividing meters by meters gives a unitless result, so the tangent, as well as the 
other trig functions, is unitless. (b) The tangent function gives a unitless result, so the units on 
the right-hand side had better cancel out. They do, because the top of the fraction has units of 
meters squared, and so does the bottom. 


Page 49, problem 17: The problem requires us to relate a and t, for a fixed value of the 
distance Ax. To find a relationship among these three variables, we start with d?x /dt? = a, and 
integrate twice to find Ax = sat. This tells us that for a fixed value of Ax, we have t x 1/\/a. 
Decreasing a by a factor of 3 means that ¢ will increase by a factor of 3 = 1.7. (The given 
piece of data, 3, only has one sig fig, but rounding the final result off to one sig fig, giving 2 
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rather then 1.7, would be a little too severe. As discussed in section 0.1.10, sig figs are only a 
rule of thumb, and when in doubt, you can change the input data to see how much the output 
would have changed. The ratio of the gravitational fields on Earth and Mars must be in the 
range from 2.5 to 3.5, since otherwise the given data would not have been rounded off to 3. 
Using this range of inputs, the possible range of values for the final result becomes 1.6 to 1.9. 
The final digit in the 1.7 is therefore a little uncertain, but it’s not complete garbage. It carries 
useful information, and should not be thrown out.) 


Page 50, problem 19: (a) Solving for Ax = gat? for a, we find a = 2Aa/t? = 5.51 m/s?. 
(b) v = V2aAz = 66.6 m/s. (c) The actual car’s final velocity is less than that of the idealized 
constant-acceleration car. If the real car and the idealized car covered the quarter mile in the 
same time but the real car was moving more slowly at the end than the idealized one, the real 
car must have been going faster than the idealized car at the beginning of the race. The real car 
apparently has a greater acceleration at the beginning, and less acceleration at the end. This 
make sense, because every car has some maximum speed, which is the speed beyond which it 
cannot accelerate. 


Page 51, problem 31: 


Page 51, problem 32: The bigger scope has a diameter that’s ten times greater. Area scales 
as the square of the linear dimensions, so its light-gathering power is a hundred times greater 
(10 x 10). 


Page 51, problem 33: Since they differ by two steps on the Richter scale, the energy of the 
bigger quake is 10000 times greater. The wave forms a hemisphere, and the surface area of the 
hemisphere over which the energy is spread is proportional to the square of its radius. If the 
amount of vibration was the same, then the surface areas much be in the ratio of 10000:1, which 
means that the ratio of the radii is 100:1. 


Page 52, problem 38: The cone of mixed gin and vermouth is the same shape as the cone 
of vermouth, but its linear dimensions are doubled, so its volume is 8 times greater. The ratio 
of gin to vermouth is 7 to 1. 


Page 52, problem 40: Scaling down the linear dimensions by a factor of 1/10 reduces the 
volume by a factor of (1/10)? = 1/1000, so if the whole cube is a liter, each small one is one 
milliliter. 


Page 53, problem 41: Let’s estimate the Great Wall’s volume, and then figure out how many 
bricks that would represent. The wall is famous because it covers pretty much all of China’s 
northern border, so let’s say it’s 1000 km long. From pictures, it looks like it’s about 10 m high 
and 10 m wide, so the total volume would be 10° m x 10 m x 10 m = 10° m®. If a single brick 
has a volume of 1 liter, or 107? m?, then this represents about 10!! bricks. If one person can 
lay 10 bricks in an hour (taking into account all the preparation, etc.), then this would be 10!° 
man-hours. 


Page 53, problem 44: Directly guessing the number of jelly beans would be like guessing 
volume directly. That would be a mistake. Instead, we start by estimating the linear dimensions, 
in units of beans. The contents of the jar look like they’re about 10 beans deep. Although the 
jar is a cylinder, its exact geometrical shape doesn’t really matter for the purposes of our order- 
of-magnitude estimate. Let’s pretend it’s a rectangular jar. The horizontal dimensions are also 
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something like 10 beans, so it looks like the jar has about 10 x 10 x 10 or ~ 10° beans inside. 


Solutions for Chapter 1 

Page 71, problem 12: To the person riding the moving bike, bug A is simply going in circles. 
The only difference between the motions of the two wheels is that one is traveling through space, 
but motion is relative, so this doesn’t have any effect on the bugs. It’s equally hard for each of 
them. 


Solutions for Chapter 2 

Page 118, problem 1: (a) The energy stored in the gasoline is being changed into heat via 
frictional heating, and also probably into sound and into energy of water waves. Note that the 
kinetic energy of the propeller and the boat are not changing, so they are not involved in the 
energy transformation. (b) The crusing speed would be greater by a factor of the cube root of 
2, or about a 26% increase. 


Page 118, problem 2: We don’t have actual masses and velocities to plug in to the equa- 
tion, but that’s OK. We just have to reason in terms of ratios and proportionalities. Ki- 
netic energy is proportional to mass and to the square of velocity, so B’s kinetic energy equals 
(13.4 J)(3.77)/(2.34)? = 9.23 J. 


Page 118, problem 3: Room temperature is about 20°C. The fraction of the power that 
actually goes into heating the water is 


(250 g) /(0.24 J/g°C) x (100°C-20°C) /126 s 


= 0.53 
1.25 x 103 J/s 


So roughly half of the energy is wasted. The wasted energy might be in several forms: heating 
of the cup, heating of the oven itself, or leakage of microwaves from the oven. 


Page 118, problem 5: 


Etotal,i = Etotal,f 
U; + heat; = Us + heat r + Ky 


1 
xe = U; — Us + heat; — heats 
= —AU — Aheat 
i (= — ate) 
P= [72 
m 
= 6.4 m/s 


Solutions for Chapter 3 

Page 216, problem 4: A conservation law is about addition: it says that when you add up 
a certain thing, the total always stays the same. Funkosity would violate the additive nature of 
conservation laws, because a two-kilogram mass would have twice as much funkosity as a pair 
of one-kilogram masses moving at the same speed. 


Page 217, problem 12: Momentum is a vector. The total momentum of the molecules 
is always zero, since the momenta in different directions cancal out on the average. Cooling 
changes individual molecular momenta, but not the total. 
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Page 218, problem 15: a = Av/At, and also a = F'/m, so 


Av 
re 
mAv 
F 
(1000 kg)(50 m/s — 20 m/s) 
3000 N 


At = 


=10s 


Page 220, problem 23: (a) This is a measure of the box’s resistance to a change in its state 
of motion, so it measures the box’s mass. The experiment would come out the same in lunar 
gravity. 

(b) This is a measure of how much gravitational force it feels, so it’s a measure of weight. In 
lunar gravity, the box would make a softer sound when it hit. 

(c) As in part a, this is a measure of its resistance to a change in its state of motion: its mass. 
Gravity isn’t involved at all. 


Page 222, problem 34: (a) The swimmer’s acceleration is caused by the water’s force on the 
swimmer, and the swimmer makes a backward force on the water, which accelerates the water 
backward. (b) The club’s normal force on the ball accelerates the ball, and the ball makes a 
backward normal force on the club, which decelerates the club. (c) The bowstring’s normal force 
accelerates the arrow, and the arrow also makes a backward normal force on the string. This 
force on the string causes the string to accelerate less rapidly than it would if the bow’s force 
was the only one acting on it. (d) The tracks’ backward frictional force slows the locomotive 
down. The locomotive’s forward frictional force causes the whole planet earth to accelerate by 
a tiny amount, which is too small to measure because the earth’s mass is so great. 


Page 223, problem 37: (a) Spring constants in parallel add, so the spring constant has to 
be proportional to the cross-sectional area. Two springs in series give half the spring constant, 
three springs in series give 1/3, and so on, so the spring constant has to be inversely proportional 
to the length. Summarizing, we have k « A/L. 

(b) With the Young’s modulus, we have k = (A/L)E.The spring constant has units of N/m, so 
the units of E would have to be N/m?. 


Page 226, problem 44: By conservation of momentum, the total momenta of the pieces after 
the explosion is the same as the momentum of the firework before the explosion. However, there 
is no law of conservation of kinetic energy, only a law of conservation of energy. The chemical 
energy in the gunpowder is converted into heat and kinetic energy when it explodes. All we can 
say about the kinetic energy of the pieces is that their total is greater than the kinetic energy 
before the explosion. 


Page 226, problem 45: Let m be the mass of the little puck and M = 2.3m be the mass 
of the big one. All we need to do is find the direction of the total momentum vector before 
the collision, because the total momentum vector is the same after the collision. Given the two 
components of the momentum vector py = mv and py = Mv, the direction of the vector is 
tan~!(py/px) = 23° counterclockwise from the big puck’s original direction of motion. 


Page 229, problem 62: We want to find out about the velocity vector vgq of the bullet 
relative to the ground, so we need to add Annie’s velocity relative to the ground v4g to the 
bullet’s velocity vector vga relative to her. Letting the positive x axis be east and y north, we 
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have 


UBA,x = (140 mi/hr) cos 45° 


= 100 mi/hr 
UBAy = (140 mi/hr) sin 45° 
= 100 mi/hr 
and 
VAG a = 0 


VAG,y = 30 mi/hr 
The bullet’s velocity relative to the ground therefore has components 


UBG,« = 100 mi/hr 
and 


UBGy = 130 mi/hr 


Its speed on impact with the animal is the magnitude of this vector 


lvec| = (100 mi/hr)? + (130 mi/hr)? 
= 160 mi/hr 


(rounded off to two significant figures). 


Page 229, problem 63: Since its velocity vector is constant, it has zero acceleration, and the 
sum of the force vectors acting on it must be zero. There are three forces acting on the plane: 
thrust, lift, and gravity. We are given the first two, and if we can find the third we can infer the 
plane’s mass. The sum of the y components of the forces is zero, so 


0= Fthrust,y + Pitty 30 Poy 
= IF thrust! sin @ + [Fuse cos 8 — mg 


The mass is 


m= (lFehrust| sin# + [Fu sel COs 0)/g 
= 6.9 x 10* kg 


Page 229, problem 64: (a) Since the wagon has no acceleration, the total forces in both the 
x and y directions must be zero. There are three forces acting on the wagon: T’, F,, and the 
normal force from the ground, F,,. If we pick a coordinate system with x being horizontal and y 
vertical, then the angles of these forces measured counterclockwise from the x axis are 90° — ¢, 
270°, and 90° + @, respectively. We have 


Fr, total = T cos(90° — ¢) + Fy cos(270°) + Frcos(90° + A) 
Fy total = T sin(90° — ¢) + Fy sin(270°) + F,,sin(90° + 6) , 
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which simplifies to 


0=T sing — F, sind 
0=Tcos¢@— Fy, + F, cos. 


The normal force is a quantity that we are not given and do not with to find, so we should 
choose it to eliminate. Solving the first equation for F;, = (sin¢/sin@)T, we eliminate F,, from 
the second equation, 
0=Tcos¢— Fy + T sin ¢cos@/sin@ 
and solve for T, finding 
= Fg 
~ cos@+sin¢cos6/sin 8 


Multiplying both the top and the bottom of the fraction by sin 6, and using the trig identity for 
sin(@ + ¢) gives the desired result, 


(b) The case of ¢ = 0, i.e. pulling straight up on the wagon, results in T = Fy: we simply 
support the wagon and it glides up the slope like a chair-lift on a ski slope. In the case of 
o = 180° — 0, T becomes infinite. Physically this is because we are pulling directly into the 
ground, so no amount of force will suffice. 


Page 230, problem 65: (a) If there was no friction, the angle of repose would be zero, so the 
coefficient of static friction, w;, will definitely matter. We also make up symbols 0, m and g for 
the angle of the slope, the mass of the object, and the acceleration of gravity. The forces form a 
triangle just like the one in example 65 on page 201, but instead of a force applied by an external 
object, we have static friction, which is less than u,F,. As in that example, F’, = mgsin@, and 
FP, < psFn, so 

mg sind < UsFy 


From the same triangle, we have F,, = mgcos6@, so 
mg sin@ < u,smgcosé 


Rearranging, 
6 < tan! ps 


(b) Both m and g canceled out, so the angle of repose would be the same on an asteroid. 


Solutions for Chapter 4 

Page 286, problem 1: The pliers are not moving, so their angular momentum remains 
constant at zero, and the total torque on them must be zero. Not only that, but each half of 
the pliers must have zero total torque on it. This tells us that the magnitude of the torque at 
one end must be the same as that at the other end. The distance from the axis to the nut is 
about 2.5 cm, and the distance from the axis to the centers of the palm and fingers are about 8 
cm. The angles are close enough to 90°that we can pretend they’re 90 degrees, considering the 
rough nature of the other assumptions and measurements. The result is (300 N)(2.5 cm)=(F’)(8 
cm), or F=90 N. 


Page 292, problem 37: The foot of the rod is moving in a circle relative to the center of the 
rod, with speed v = 1b/T, and acceleration v?/(b/2) = (x?/8)g. This acceleration is initially 
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upward, and is greater in magnitude than g, so the foot of the rod will lift off without dragging. 
We could also worry about whether the foot of the rod would make contact with the floor again 
before the rod finishes up flat on its back. This is a question that can be settled by graphing, 
or simply by inspection of figure i on page 274. The key here is that the two parts of the 
acceleration are both independent of m and )b, so the result is univeral, and it does suffice to 
check a graph in a single example. In practical terms, this tells us something about how difficult 
the trick is to do. Because 77/8 = 1.23 isn’t much greater than unity, a hit that is just a little 
too weak (by a factor of 1.23!/2 = 1.11) will cause a fairly obvious qualitative change in the 
results. This is easily observed if you try it a few times with a pencil. 


Solutions for Chapter 9 
Page 498, problem 1: At = Aq/I = e/I = 0.16 ps 


Page 500, problem 12: In series, they give 11 kQ. In parallel, they give (1/1 kQ + 
1/10 kQ)-' = 0.9 kQ. 


Page 502, problem 25: The actual shape is irrelevant; all we care about is what’s connected to 
what. Therefore, we can draw the circuit flattened into a plane. Every vertex of the tetrahedron 
is adjacent to every other vertex, so any two vertices to which we connect will give the same 
resistance. Picking two arbitrarily, we have this: 


This is unfortunately a circuit that cannot be converted into parallel and series parts, and 
that’s what makes this a hard problem! However, we can recognize that by symmetry, there 
is zero current in the resistor marked with an asterisk. Eliminating this one, we recognize the 
whole arrangement as a triple parallel circuit consisting of resistances R, 2R, and 2R. The 
resulting resistance is R/2. 


Page 504, problem 29: (a) Conservation of energy gives 


Us, =Up+ Kp 
Kp=U,—UpB 
1 
zw =eAV 


/2eAV 
— 
m 


(b) Plugging in numbers, we get 5.9 x 107 m/s. This is about 20% of the speed of light, so the 
nonrelativistic assumption was good to at least a rough approximation. 


Page 505, problem 32: It’s much more practical to measure voltage differences. To measure 
a current, you have to break the circuit somewhere and insert the meter there, but it’s not 
possible to disconnect the circuits sealed inside the board. 


Solutions for Chapter 10 
Page 588, problem 16: By symmetry, the field is always directly toward or away from the 
center. We can therefore calculate it along the x axis, where r = x, and the result will be valid 
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for any location at that distance from the center. 


dV 
Pte 
OS one Sat 
=o ae (ate) 
aug ie og le 


At small x, near the proton, the first term dominates, and the exponential is essentially 1, so we 
have E x x~?, as we expect from the Coulomb force law. At large x, the second term dominates, 
and the field approaches zero faster than an exponential. 


Page 596, problem 56: 
sin( (a+b) = e (a+b) __ ear) ) /2i 


( 
= (ee - ohe-e) /2% 
[(c 
[( 


osa +isina)(cosb + isin b) — (cosa — isina)(cosb — isin b)} /2i 


cosa + isina)(cos b + isin b) — (cosa — isina)(cos b — isin b)] /2% 


cosasinb + sinacosb 


By a similar computation, we find cos(a + b) = cosacosb — sinasinb. 


Page 596, problem 57: If z* = 1, then we know that |z| = 1, since cubing z cubes its 
magnitude. Cubing z triples its argument, so the argument of z must be a number that, when 
tripled, is equivalent to an angle of zero. There are three possibilities: 0 x 3 = 0, (27/3) x3 = 27, 
and (47/3) x 3 = 4a. (Other possibilities, such as (3272/3), are equivalent to one of these.) The 
solutions are: 


z=1, es: Ati /3 


Page 596, problem 58: We can think of this as a polynomial in x or a polynomial in y — 
their roles are symmetric. Let’s call x the variable. By the fundamental theorem of algebra, it 
must be possible to factor it into a product of three linear factors, if the coefficients are allowed 
to be complex. Each of these factors causes the product to be zero for a certain value of x. But 
the condition for the expression to be zero is x? = y®, which basically means that the ratio of x 
to y must be a third root of 1. The problem, then, boils down to finding the three third roots 
of 1, as in problem 57. Using the result of that problem, we find that there are zeroes when 2/y 
equals 1, e?7*/3, and e47*/3. This tells us that the factorization is («—y)(x—e?7*/3y) (a—e47/3y). 


The second part of the problem asks us to factorize as much as possible using real coefficients. 
Our only hope of doing this is to multiply out the two factors that involve complex coefficients, 
and see if they produce something real. In fact, we can anticipate that it will work, because 
the coefficients are complex conjugates of one another, and when a quadratic has two complex 
roots, they are conjugates. The result is (2 — y)(a? + ry + y?). 


Solutions for Chapter 11 

Page 682, problem 51: (a) For a material object, p = mv. The velocity vector reverses 
itself, but mass is still positive, so the momentum vector is reversed. 

(b) In the forward-time universe, conservation of momentum is p1; + P2,; = P1,f + P2,f- In the 
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backward-time universe, all the momenta are reversed, but that just negates both sides of the 
equation, so momentum is still conserved. 


Page 684, problem 54: Note that in the Biot-Savart law, the variable r is defined as a vector 
that points from the current to the point at which the field is being calculated, whereas in the 
polar coordinates used to express the equation of the spiral, the vector more naturally points 
the opposite way. This requires some fiddling with signs, which I’ll suppress, and simply identify 


dé with dr. 
_ kl fdéxr 
2 r3 


The vector dr has components dz = w(cos@ — @sin@) and dy = w(sin@ + 6cos@). Evaluating 
the vector cross product, and substituting 6/w for r, we find 


B= kI ce 


cw 98 
_ pao 
ew 
_ KL, b 
— ew Oy 
kI b 
== In* 
Cw a 


Solutions for Chapter 12 

Page 749, problem 16: See the ray diagram below. Decreasing 0, decreases 6;, so the 
equation 67 = +6; + +0, must have opposite signs on the right. Since @, is bigger than 6;, the 
only way to get a positive 0; is if the signs are 0¢ = —0; + 05. This gives 1/f = —1/d; + 1/do. 


Page 749, problem 19: (a) The object distance is less than the focal length, so the image is 
virtual: because the object is so close, the cone of rays is diverging too strongly for the mirror to 
bring it back to a focus. (b) At an object distance of 30 cm, it’s clearly going to be real. With 
the object distance of 20 cm, we’re right at the crossing-point between real and virtual. For 
this object position, the reflected rays will be parallel. We could consider this to be an image 
at infinity. (c),(d) A diverging mirror can only make virtual images. 


Page 753, problem 39: Since d, is much greater than d;, the lens-film distance d; is es- 
sentially the same as f. (a) Splitting the triangle inside the camera into two right triangles, 
straightforward trigonometry gives 


‘W 
6 = 2tan- | — 
an of 
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for the field of view. This comes out to be 39° and 64° for the two lenses. (b) For small angles, 
the tangent is approximately the same as the angle itself, provided we measure everything in 
radians. The equation above then simplifies to 


w 

Fa 

The results for the two lenses are .70 rad = 40°, and 1.25 rad = 72°. This is a decent approxi- 
mation. 


— 


(c) With the 28-mm lens, which is closer to the film, the entire field of view we had with the 
50-mm lens is now confined to a small part of the film. Using our small-angle approximation 
6=w/f, the amount of light contained within the same angular width 6 is now striking a piece 
of the film whose linear dimensions are smaller by the ratio 28/50. Area depends on the square 
of the linear dimensions, so all other things being equal, the film would now be overexposed by 
a factor of (50/28)? = 3.2. To compensate, we need to shorten the exposure by a factor of 3.2. 


Chapter Appendix 4: Hints and Solutions 


Appendix 5: Useful Data 


.0.7 Notation and terminology, compared with other books 


Almost all the notation and terminology in Simple Nature is standard, but there are some 
cases where there is no universal standard, and a very few cases where I’ve intentionally deviated 
from a universal standard. The notation used by physicists is also different from that used by 
electrical and mechanical engineers; I use physics terminology and notation (notably /—1 = 
i, not j, and “torque” rather than “moment”), but employ the SI system of units used in 
engineering, rather than the cgs units favored by some physicists. 


Nonstandard terminology: 


Potential energy is referred to in this book as interaction energy, or according to its type: 
gravitational energy, electrical energy, etc. 


The potential, in an electrical context, is referred to as voltage, e.g. I say that V = kq/r is 
the voltage surrounding a point charge. 


Heat and thermal energy are both referred to as heat. This is in keeping with casual usage 
among scientists, but formal written usage dictates the use of “thermal energy” to mean 
the kinetic energy an object has because of its molecules’ random motion, while “heat” is 
the transfer of thermal energy. 


Notation for which there is no universal standard: 


Kinetic energy is written K. Standard notation is K, T, or KE. 
Interaction energy is written U. Standard notation is U, V, or PE. 
The unit vectors are x,y,z. Standard notation is either X, y,z or i,j, k. 


Distance from an axis in cylindrical coordinates is R. A more common notation in math 
books is p, but this would conflict with the standard physics notation for the charge 
density. 


Vibrations do not have very well standardized terminology or notation. I use “frequency” to 
refer to both f and w, depending on the context to make it clear which is meant. The 
frequency of free, damped oscillations is wr, which is only approximately the same as 
Wo = \V/k/m. The full width at half-maximum of the resonance peak (on a plot of energy 
versus frequency) is Aw. 


The coupling constants for electricity and magnetism are written as k and k/c?. This is 
standard notation, but it would be more common in SI calculations to see everything 
expressed in terms of €) = 1/4rk and pio = 47k/c?. Numerically, we have k = 8.99 x 
10° N-m?/C? and k/c? = 10-" N/A?, the latter being an exact relation. 
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.0.8 Notation and units 


quantity unit symbol 

distance meter, m x, Ax 

time second, s t, At 

mass kilogram, kg m 

density kg/m? p 

force newton, 1 N=1 kg-m/s? | F 

velocity m/s v 

acceleration m/s? a 

gravitational field J/kg-m or m/s? g 

energy joule, J E (also electric field) 
momentum kg-m/s p 

angular momentum kg-m?/s or J-s L (also inductance) 
power watt, 1 W=1J/s P (also pressure) 
pressure 1 Pa=1 N/m? P (also power) 
temperature K T (also period) 
period S T (also temperature) 
wavelength m » 

frequency s! or Hz f 

charge coulomb, C q 

voltage volt, 1 V=1J/C V 

current ampere, 1 A = 1 C/s I 

resistance ohm, 1Q=1V/A R 

capacitance farad, 1 F =1C/V C 

inductance henry, 1 H=1V-s/A L (also angular momentum) 
electric field V/m or N/C E (also energy) 
magnetic field tesla, 1 T=1N-s/C-m | B 

focal length m f 

magnification unitless M 

index of refraction unitless n 

electron wavefunction | m~°/? w 


.0.9 Fundamental constants 


gravitational constant 
Boltzmann constant 
Coulomb constant 
quantum of charge 
speed of light 


Planck’s constant 


G = 6.67 x 10°" J-m/kg? 
k = 1.38 x 10-73 J/K 

k = 8.99 x 10° J-m/C? or 
= 160 < 10-4 -C 

c = 3.00 x 10° m/s 

h = 6.63 x 10-4 J-s 


N-m?/C? 


Note the use of the same notation, k, for both the Boltzmann constant and the Coulomb 


constant. 
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0.10 Me 


tric prefixes 


M- mega- | 10° 
k- kilo- | 10° 
m- milli- | 107% 
u- (Greek mu) | micro- | 107° 
n- nano- | 107° 
p- pico- Lo=* 
f- femto- | 1071° 


Note that the exponents go in steps of three. The exception is centi-, 1077, which is used only 
in the centimeter, and this doesn’t require memorization, because a cent is 10~? dollars. 


.0.11 No 


nmetric units 


Nonmetric units in terms of metric ones: 


1 inch = 25.4 mm (by definition) 
1 pound (Ib) = 4.5 newtons of force 

1 scientific calorie = 4.18 J 

1 nutritional calorie | = 4.18 x 10° J 

1 gallon = 3.78 x 10° cm? 

1 horsepower = 746 W 


The pound is a unit of force, so it converts to newtons, not kilograms. A one-kilogram mass at 
the earth’s surface experiences a gravitational force of (1 kg)(9.8 m/s?) = 9.8 N = 2.2 lb. The 
nutritional information on food packaging typically gives energies in units of calories, but those 


so-called calories are really kilocalories. 


Relationships among U.S. units: 


1 foot (ft) = 12 inches 

1 yard (yd) | = 3 feet 

1 mile (mi) | = 5280 feet 

1 ounce (oz) | = 1/16 pound 


.0.12 The Greek alphabet 
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alpha I 
beta 
gamma 
delta 
epsilon 
zeta 
eta 


theta 
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iota 
kappa 
lambda 
mu 

nu 

xl 
omicron 
pi 


rho 
sigma 
tau 
upsilon 
phi 

chi 

psi 
omega 
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.0.13 Subatomic particles 


particle | mass (kg) charge | radius (fm) 
electron | 9.109 x 10-*! | —e < 0.01 
proton 1.673 x 107?" | +e ~ 1.1 
neutron | 1.675 x 10-7" | 0 ~ 11 
neutrino | ~ 107°9 kg ? | 0 ? 


The radii of protons and neutrons can only be given approximately, since they have fuzzy 
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surfaces. For comparison, a typical atom is about a million fm in radius. 


.0.14 Earth, moon, and sun 


body | mass (kg) | radius (km) | radius of orbit (km) 
earth | 5.97 x 1074 | 6.4 x 10° 1.49 x 108 

moon | 7.35 x 102% | 1.7 x 10° 3.84 x 10° 

sun 1.99 x 10®° | 7.0 x 10° — 


.0.15 The periodic table 


1 2 
H He 
3 4 5 6 71/8 | 9 | 10 
Li | Be B!|!C;|N]O]|F |Ne 
11 12 13 14 15 16 17 18 
Na|Mg Al| Si} P| S| Cl] Ar 
19 | 20 | 21 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29] 30 | 31 | 32 | 33 | 34 | 35 | 36 
K |Ca| Sc] | Ti | V | Cr) Mn} Fe} Co} Ni | Cu; Zn} Ga} Ge} As | Se} Br | Kr 
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 
Rb| Sr} Y | | Zr | No|Mo} Te} Ru} Rh} Pd} Ag} Cd] In | Sn| Sb} Te} | | Xe 
55 | 56 | 57 | | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 
Cs| Ba} La |*| Hf | Ta) W | Re} Os] Ir | Pt} Au] Hg) Tl | Po} Bi | Po} At | Rn 
87 | 88 | 89 104 | 105 | 106 | 107 | 108 | 109 | 110 | 111 | 112 | 113 | 114 ] 115 | 116 | 117 | 118 
Fr | Ra] Ac ws Rf | Ha 


58 | 59 | 60 | 61 62 | 63 | 64 | 65 | 66 | 67 | 68 |_69 | 70 | 71 
*|Ce| Pr | Nd}/Pm|Sm]| Eu}Gd| Tb | Dy} Ho/Er |Tm) Yb} Lu 
90 | 91 92 | 93 | 94 | 95 | 96 | 97 | 98 | 99 | 100 | 101 | 102 | 103 
**|Th | Pa) U |No}|Pu | AmjC k | Cf | Es | Fm|Md| No} Lr 


.0.16 Atomic masses 


These atomic masses are given in atomic mass units (u), where by definition the mass of 
an atom of the isotope carbon-12 equals 12 u. One atomic mass unit is the same as about 
1.66 x 107?" kg. Data are only given for naturally occurring elements. 


Ag 107.9] Eu 152.0 | Mo 95.9 Sc 45.0 
Al 27.0 F 19.0 N 14.0 Se 79.0 
Ar 39.9 Fe 55.8 Na 23.0 Si 28.1 
As 74.9 Ga 69.7 Nb 92.9 Sn 118.7 
Au 197.0 | Gd 157.2 | Nd 144.2 | Sr 87.6 
B 10.8 Ge 72.6 Ne 20.2 Ta 180.9 
Ba 137.3 | H 1.0 Ni 58.7 Tb = 158.9 
Be 9.0 He 4.0 O 16.0 Te 127.6 
Bi 209.0 | Hf 178.5 | Os 190.2 | Ti 47.9 
Br 79.9 Hg 200.6 | P 31.0 Tl 204.4 
Cc 12.0 Ho 164.9] Pb 207.2 | Tm 168.9 
Ca 40.1 In 114.8 | Pd 106.4 | U 238 
Ce 140.1 | Ir 192.2 | Pt 195.1 | V 50.9 
Cl 35.5 K 39.1 Pr 140.9 | W 183.8 
Co 58.9 Kr 83.8 Rb 85.5 Xe 131.3 
Cr 52.0 La 138.9 | Re 186.2 | Y 88.9 
Cs 132.9 | Li 6.9 Rh 102.9 | Yb 173.0 
Cu 63.5 Lu 175.0] Ru 101.1 | Zn 65.4 
Dy 162.5 | Mg 24.3 S 32.1 Zr 91.2 
Er 167.3 | Mn 54.9 Sb 121.8 
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Appendix 6: Summary 


Notation and units are summarized on page 886. 


Chapter 0, Introduction and Review, page 13 

Physics is the use of the scientific method to study the behavior of light and matter. The 
scientific method requires a cycle of theory and experiment, theories with both predictive and 
explanatory value, and reproducible experiments. 


The metric system is a simple, consistent framework for measurement built out of the meter, 
the kilogram, and the second plus a set of prefixes denoting powers of ten. The most systematic 
method for doing conversions is shown in the following example: 


1 
370 ms x u = 0.378 


Mass is a measure of the amount of a substance. Mass can be defined gravitationally, by 
comparing an object to a standard mass on a double-pan balance, or in terms of inertia, by 
comparing the effect of a force on an object to the effect of the same force on a standard mass. 
The two definitions are found experimentally to be proportional to each other to a high degree 
of precision, so we usually refer simply to “mass,” without bothering to specify which type. 


A force is that which can change the motion of an object. The metric unit of force is the 
Newton, defined as the force required to accelerate a standard 1-kg mass from rest to a speed 
of 1 m/s in 1s. 


Scientific notation means, for example, writing 3.2 x 10° rather than 320000. 


Writing numbers with the correct number of significant figures correctly communicates how 
accurate they are. As a rule of thumb, the final result of a calculation is no more accurate than, 
and should have no more significant figures than, the least accurate piece of data. 


Nature behaves differently on large and small scales. Galileo showed that this results fun- 
damentally from the way area and volume scale. Area scales as the second power of length, 
A «x L?, while volume scales as length to the third power, V « L°. 


An order of magnitude estimate is one in which we do not attempt or expect an exact answer. 
The main reason why the uninitiated have trouble with order-of-magnitude estimates is that 
the human brain does not intuitively make accurate estimates of area and volume. Estimates of 
area and volume should be approached by first estimating linear dimensions, which one’s brain 
has a feel for. 


Velocity, dx/dt, measures how fast an object is moving. Acceleration, d?2/dt?, measures 
how quickly its velocity is changing. For motion with constant acceleration, we have these useful 
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relations: 
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UF =v? + 2aAr 


Chapter 1, Conservation of Mass, page 55 
Conservation laws are the foundation of physics. A conservation law states that a certain 
quantity can be neither created nor destroyed; the total amount of it remains the same. 


Mass is a conserved quantity in classical physics, i.e. physics before Einstein. This is 
plausible, since we know that matter is composed of subatomic particles; if the particles are 
neither created or destroyed, then it makes sense that the total mass will remain the same. 
There are two ways of defining mass. 


Gravitational mass is defined by measuring the effect of gravity on a particular object, and 
comparing with some standard object, taking care to test both objects at a location where the 
strength of gravity is the same. 


Inertial mass is defined by measuring how much a particular object resists a change in its 
state of motion. For instance, an object placed on the end of a spring will oscillate if the spring 
is initially compressed, and a more massive object will take longer to complete one oscillation. 


Inertial and gravitational mass are equivalent: experiments show to a very high degree of 
precision that any two objects with the same inertial mass have the same gravitational mass as 
well. 


The definition of inertial mass depends on a correct but counterintuitive assumption: that 
an object resists a change in its state of motion. Most people intuitively believe that motion 
has a natural tendency to slow down. This cannot be correct as a general statement, because 
“to slow down” is not a well-defined concept unless we specify what we are measuring motion 
relative to. This insight is credited to Galileo, and the general principle of Galilean relativity 
states that the laws of physics are the same in all inertial frames of reference. In other words, 
there is no way to distinguish a moving frame of reference from one that is at rest. To establish 
which frames of reference are inertial, we first must find one inertial frame in which objects 
appear to obey Galilean relativity. The surface of the earth is an inertial frame to a reasonably 
good approximation, and the frame of reference of the stars is an even better one. Once we have 
found one inertial frame of reference, any other frame is inertial which is moving in a straight 
line at constant velocity relative to the first one. For instance, if the surface of the earth is an 
approximately inertial frame, then a train traveling in a straight line at constant speed is also 
approximately an inertial frame. 


The unit of mass is the kilogram, which, along with the meter and the second, forms the 
basis for the SI system of units (also known as the mks system). A fundamental skill in science 
is to know the definitions of the most common metric prefixes, which are summarized on page 
887, and to be able to convert among them. 


One consequence of Einstein’s theory of special relativity is that mass can be converted to 
energy and energy to mass. This prediction has been verified amply by experiment. Thus the 
conserved quantity is not really mass but rather the total “mass-energy,” m+ E/c?, where c is 
the speed of light. Since the speed of light is a large number, the E'/c? term is ordinarily small 
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in everyday life, which is why we can usually neglect it. 


Chapter 2, Conservation of Energy, page 73 

We observe that certain processes are physically impossible. For example, there is no process 
that can heat up an object without using up fuel or having some other side effect such as cooling 
a different object. We find that we can neatly separate the possible processes from the impossible 
by defining a single numerical quantity, called energy, which is conserved. Energy comes in many 
forms, such as heat, motion, sound, light, the energy required to melt a solid, and gravitational 
energy (e.g. the energy that depends on the distance between a rock and the earth). Because 
it has so many forms, we can arbitrarily choose one form, heat, in order to define a standard 
unit for our numerical scale of energy. Energy is measured in units of joules (J), and one joule 
can be defined as the amount of energy required in order to raise the termperature of a certain 
amount of water by a certain number of degrees. (The numbers are not worth memorizing.) 
Power is defined as the rate of change of energy P = dE/dt, and the unit of power is the watt, 
1W=14J/s. 


Once we have defined one type of energy numerically, we can perform experiments that es- 
tablish the mathematical rules governing other types of energy. For example, in his paddlewheel 
experiment, James Joule allowed weights to drop through a certain height and spin paddlewheels 
inside sealed canisters of water, thereby heating the water through friction. Since in this book 
we define the joule unit in terms of the temperature of water, we can think of the paddlewheel 
experiment as establishing a rule for the gravitational energy of a mass which is at a certain 
height, 

dU, = mg dy ‘ 


where dU, is the infinitesimal change in the gravitational energy of a mass m when its height 
is changed by an infinitesimal amount dy in the vertical direction. The quantity g is called 
the gravitational field, and at the earth’s surface it has a numerical value of about 10 J/kg-m. 
That is, about 10 joules of energy are required in order to raise a one-kilogram mass by one 
meter. (The gravitational field g also has the interpretation that when we drop an object, its 
acceleration, d?y/dt?, is equal to g.) 


Using similar techniques, we find that the energy of a moving object, called its kinetic energy, 
is given by 
1 
K= 5m ; 
where m is its mass and v its velocity. The proportionality factor equals 1/2 exactly by the 
design of the SI system of units, and since the SI is based on the meter, the kilogram, and the 
second, the joule is considered to be a derived unit, 1 J = 1 kg-m?/s?. 


When the interaction energy U has a local maximum or minimum with respect to the position 
of an object (dU/dx = 0), then the object is in equilibrium at that position. For example, if 
a weight is hanging from a rope, and is initially at rest at the bottom, then it must remain at 
rest, because this is a position of minimum gravitational energy U,; to move, it would have to 
increase both its kinetic and its gravitational energy, which would violate conservation of energy, 
since the total energy would increase. 


Since kinetic energy is independent of the direction of motion, conservation of energy is 
often insufficient to predict the direction of an object’s motion. However, many of the physically 
impossible motions can be ruled out by the trick of imposing conservation of energy in some other 
frame of reference. By this device, we can solve the important problem of projectile motion: even 
if the projectile has horizontal motion, we can imagine ourselves in a frame of reference in which 
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we are moving along with the projectile horizontally. In this frame of reference, the projectile 
has no horizontal motion, and its vertical motion has constant acceleration g. Switching back 
to a frame of reference in which its horizontal velocity is not zero, we find that a projectile’s 
horizontal and vertical motions are independent, and that the horizontal motion is at constant 
velocity. 


Even in one-dimensional motion, it is seldom possible to solve real-world problems and 
predict the motion of an object in closed form. However, there are straightforward numerical 
techniques for solving such problems. 


From observations of the motion of the planets, we infer that the gravitational interaction 
between any two objects is given by U, = —Gm,mg2/r, where r is the distance between them. 
When the sizes of the objects are not small compared to their separation, the definition of r 
becomes vague; for this reason, we should interpret this fundamentally as the law governing the 
gravitational interactions between individual atoms. However, in the special case of a spherically 
symmetric mass distribution, there is a shortcut: the shell theorem states that the gravitational 
interaction between a spherically symmetric shell of mass and a particle on the outside of the 
shell is the same as if the shell’s mass had all been concentrated at its center. An astronomical 
body like the earth can be broken down into concentric shells of mass, and so its gravitational 
interactions with external objects can also be calculated simply by using the center-to-center 
distance. 


Energy appears to come in a bewildering variety of forms, but matter is made of atoms, and 
thus if we restrict ourselves to the study of mechanical systems (containing material objects, 
not light), all the forms of energy we observe must be explainable in terms of the behabior and 
interactions of atoms. Indeed, at the atomic level the picture is much simpler. Fundamentally, 
all the familiar forms of mechanical energy arise from either the kinetic energy of atoms or the 
energy they have because they interact with each other via gravitational or electrical forces. For 
example, when we stretch a spring, we distort the latticework of atoms in the metal, and this 
change in the interatomic distances involves an increase in the atoms’ electrical energies. 


An equilibrium is a local minimum of U(x), and up close, any minimum looks like a parabola. 
Therefore, small oscillations around an equilibrium exhibit universal behavior, which depends 
only on the object’s mass, m, and on the tightness of curvature of the minimum, parametrized 
by the quantity k = d?U/dx?. The oscillations are sinusoidal as a function of time, and the 
period is T = 27\/m/k, independent of amplitude. When oscillations are small enough for these 
statements to be good approximations, we refer to them the oscillations as simple harmonic. 


Chapter 3, Conservation of Momentum, page 129 

Since the kinetic energy of a material object depends on v?, it isn’t obvious that conservation 
of energy is consistent with Galilean relativity. Even if a certain mechanical system conserves 
energy in one frame of reference, the velocities involved will be different as measured in another 
frame, and therefore so will the kinetic energies. It turns out that consistency is achieved only 
if there is a new conservation law, conservation of momentum, 


p=mv 


In one dimension, the direction of motion is described using positive and negative signs of 
the velocity v, and since mass is always positive, the momentum carries the same sign. Thus 
conservation of momentum, unlike conservation of energy, makes direct predictions about the 
direction of motion. Although this line of argument was based on the assumption of a mechanical 
system, momentum need not be mechanical. Light has momentum. 
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A moving object’s momentum equals the sum of the momenta of all its atoms. To avoid 
having to carry out this sum, we can use the concept of the center of mass. The center of mass 
can be defined as a kind of weighted average of the positions of all the atoms in the object, 


dL M;X; 
cm. — 
mM; 


and although the definition does involve a sum, we can often locate the center of mass by 
symmetry or by physically determining an object’s balance point. The total momentum of the 
object is then given by 

Ptotal = Mtotal Vem 


The rate of transfer of momentum is called force, F = dp/dt, and is measured in units of 
newtons, 1 N = 1 kg-m/s?. As a direct. consequence of conservation of momentum, we have the 
following statements, known as Newton’s laws of motion: 


If the total force on an object is zero, it remains in the same state of motion. 
F = dp/dt 


Forces always come in pairs: if object A exerts a force on object B, then object B exerts 
a force on object A which is the same strength, but in the opposite direction. 


Although the fundamental forces at the atomic level are gravity, electromagnetism, and 
nuclear forces, we use a different and more practical classification scheme in everyday situations. 
In this scheme, the forces between solid objects are described as follows: 

A normal force, F;,, is perpendicular to the surface of contact, and prevents 
objects from passing through each other by becoming 
as strong as necessary (up to the point where the ob- 
jects break). “Normal” means perpendicular. 

Static friction, Fs, is parallel to the surface of contact, and prevents the 
surfaces from starting to slip by becoming as strong as 
necessary, up to a maximum value of F's maz. “Static” 
means not moving, i.e. not slipping. 

Kinetic friction, Fy, is parallel to the surface of contact, and tends to slow 
down any slippage once it starts. “Kinetic” means 
moving, i.e. slipping. 


Work is defined as the transfer of energy by a force. (“By a force” is meant to exclude energy 
transfer by heat conduction.) The work theorem states that when a force occurs at a single point 
of contact, the amount of energy transferred by that force is given by dW = F'- dx, where dx is 
the distance traveled by the point of contact. The kinetic energy theorem is dKem = Ftotat:dXcm; 
where dK m is the change in the energy, (1/2)mv2,,, an object possesses due to the motion of 


its center of mass, Fio¢ai is the total force acting on the object, and dx. is the distance traveled 
by the center of mass. 


The relationship between force and interaction energy is U = —dF'/dz. Any interaction can 
be described either by giving the force as a function of distance or the interaction energy as a 
function of distance; the other quantity can then be found by integration or differentiation. 


An oscillator subject to friction will, if left to itself, suffer a gradual decrease in the amplitude 
of its motion as mechanical energy is transformed into heat. The quality factor, Q, is defined as 
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the number of oscillations required for the mechanical energy to fall off by a factor of e2?7 & 535. 
To maintain an oscillation indefinitely, an external force must do work to replace this energy. 
We assume for mathematical simplicity that the external force varies sinusoidally with time, 
F = F,,sinwt. If this force is applied for a long time, the motion approaches a steady state, 
in which the oscillator’s motion is sinusoidal, matching the driving force in frequency but not 
in phase. The amplitude of this steady-state motion, A, exhibits the phenomenon of resonance: 
the amplitude is maximized at a driving frequency which, for large Q, is essentially the same 
as the natural frequency of the free vibrations, wy (and for large Q this is also nearly the same 
as Wo = \/k/m). When the energy of the steady-state oscillations is graphed as a function of 
frequency, both the height and the width of the resonance peak depend on Q. The peak is taller 
for greater Q, and its full width at half-maximum is Aw * w,/Q. For small values of Q, all 
these approximations become worse, and at Q < 1/2 qualitatively different behavior sets in. 


For three-dimensional motion, a moving object’s motion can be described by three different 
velocities, v; = da/dt, and similarly for v, and v,. Thus conservation of momentum becomes 
three different conservation laws: conservation of py = mv,z, and so on. The principle of 
rotational invariance says that the laws of physics are the same regardless of how we change 
the orientation of our laboratory: there is no preferred direction in space. As a consequence of 
this, no matter how we choose our x, y, and z coordinate axes, we will still have conservation of 
Pz, Py, and p,. To simplify notation, we define a momentum vector, p, which is a single symbol 
that stands for all the momentum information contained in the components p,, py, and p,. The 
concept of a vector is more general than its application to the momentum: any quantity that 
has a direction in space is considered a vector, as opposed to a scalar like time or temperature. 
The following table summarizes some vector operations. 
operation definition 


|vector| vector? + vector? + vector? 


vector + vector | Add component by component. 
vector — vector | Subtract component by component. 
vector - scalar | Multiply each component by the scalar. 
vector / scalar | Divide each component by the scalar. 
Differentiation and integration of vectors is defined component by component. 


There is only one meaningful (rotationally invariant) way of defining a multiplication of 
vectors whose result is a scalar, and it is known as the vector dot product: 


b- c= byCy + bycy + bzez 
= |b| |c| cos A 


The dot product has most of the usual properties associated with multiplication, except that 
there is no “dot division.” 


Chapter 4, Conservation of Angular Momentum, page 243 
Angular momentum is a conserved quantity. For motion confined to a plane, the angular mo- 
mentum of a material particle is 

L=mv r ; 


where r is the particle’s distance from the point chosen as the axis, and v, is the component 
of its velocity vector that is perpendicular to the line connecting the particle to the axis. The 
choice of axis is arbitrary. In a plane, only two directions of rotation are possible, clockwise and 
counterclockwise. One of these is considered negative and the other positive. Geometrically, 
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angular momentum is related to rate at which area is swept out by the line segment connecting 
the particle to the axis. 


Torque is the rate of change of angular momentum, tT = dL/dt. The torque created by a 
given force can be calculated using any of the relations 
7=rF sind,p 
=rF, 
= rik ’ 
where the subscript | indicates a component perpendicular to the line connecting the axis to 


the point of application of the force. 


In the special case of a rigid body rotating in a single plane, we define 


dé 
w= — [angular velocity] 
dt 
and 
dw ‘ 
errs ; [angular acceleration] 
in terms of which we have 
DL= Iw 
and 
T=Ta ; 


where the moment of inertia, I, is defined as 


2 
T= ) MT; : 


summing over all the atoms in the object (or using calculus to perform a continuous sum, i.e. 
an integral). The relationship between the angular quantities and the linear ones is 


Ut = wr tangential velocity of a point 
Up = 0 [radial velocity of a point 
Qt = ar ; radial acceleration of a point 


at a distance r from the axis] 


Ap = Wr radial acceleration of a point 


at a distance r from the axis] 


In three dimensions, torque and angular momentum are vectors, and are expressed in terms of 
the vector cross product, which is the only rotationally invariant way of defining a multiplication 
of two vectors that produces a third vector: 

L=rxp 


T=rxF 
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In general, the cross product of vectors b and c has magnitude 
|b x c| = |b] |e] sin ™,, . 


which can be interpreted geometrically as the area of the parallelogram formed by the two 
vectors when they are placed tail-to-tail. The direction of the cross product lies along the line 
which is perpendicular to both vectors; of the two such directions, we choose the one that is 
right-handed, in the sense that if we point the fingers of the flattened right hand along b, then 
bend the knuckles to point the fingers along c, the thumb gives the direction of b x c. In terms 
of components, the cross product is 


(BX chy = be = Gy be 
(bX), = be, — czby 
(b x c), = bgly — Crby 


The cross product has the disconcerting properties 


axb=-bxa [noncommutative] 
and 


ax (bxc)#(axb)xc [(nonassociative] : 


and there is no “cross-division.” 


For rigid-body rotation in three dimensions, we define an angular velocity vector w, which 
lies along the axis of rotation and bears a right-hand relationship to it. Except in special cases, 
there is no scalar moment of inertia for which L = Jw; the moment of inertia must be expressed 
as a matrix. 


Chapter 5, Thermodynamics, page 295 

A fluid is any gas or liquid, but not a solid; fluids do not exhibit shear forces. A fluid in equilib- 
rium exerts a force on any surface which is proportional to the surface’s area and perpendicular 
to the surface. We can therefore define a quantity called the pressure, P, which is ratio of force 


to area, 
Py 
P=— ; 
A 


where the subscript | indicates the component of the fluid’s force which is perpendicular to the 
surface. 


Usually it is only the difference in pressure between the two sides of a surface that is physically 
significant. Pressure doesn’t just “press down” on things; air pressure upward under your chin 
is the same as air pressure downward on your shoulders. In a fluid acted on by gravity, pressure 
varies with depth according to the equation 


dP = —pg- dy. 


This equation is only valid if the fluid is in equilibrium, and if g and r are constant with respect 
to height. 


Temperature can be defined according to the volume of an ideal gas under conditions of 
standard pressure. The Kelvin scale of temperature used throughout this book equals zero at 
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absolute zero, the temperature at which all random molecular motion ceases, and equals 273 K 
at the freezing point of water. We can get away with using the Celsius scale as long as we are 
only interested in temperature differences; a difference of 1 degree C is the same as a difference 
of 1 degree K. 


It is an observed fact that ideal gases obey the ideal gas law, 
PV =nkT ; 


and this equation can be explained by the kinetic theory of heat, which states that the gas 
experts pressure on its container because its molecules are constantly in motion. In the kinetic 
theory of heat, the temperature of a gas is proportional to the average energy per molecule. 


Not all the heat energy in an object can be extracted to do mechanical work. We therefore 
describe heat as a lower grade of energy than other forms of energy. Entropy is a measure of 
how much of a system’s energy is inaccessible to being extracted, even by the most efficient heat 
engine; a high entropy corresponds to a low grade of energy. The change in a system’s entropy 
when heat Q is deposited into it is 


_Q 
AS = 7 


The efficiency of any heat engine is defined as 


energy we get in useful form 


efficiency = 
. energy we pay for 


and the efficiency of a Carnot engine, the most efficient of all, is 


Ly 

ffici =1-— 

efficiency Ti 
These results are all closely related. For instance, example 11 on page 309 uses AS = Q/T and 
efficiency = 1—T,,/Ty to show that a Carnot engine doesn’t change the entropy of the universe. 


Fundamentally, entropy is defined as the being proportional to the natural logarithm of 
the number of states available to a system, and the above equation then serves as a definition 
of temperature. The entropy of a closed system always increases; this is the second law of 
thermodynamics. 


Chapter 6, Waves, page 335 
Wave motion differs in three important ways from the motion of material objects: 


Waves obey the principle of superposition. When two waves collide, they simply add 
together. 


The medium is not transported along with the wave. The motion of any given point in 
the medium is a vibration about its equilibrium location, not a steady forward motion. 


The velocity of a wave depends on the medium, not on the amount of energy in the wave. 
(For some types of waves, notably water waves, the velocity may also depend on the shape 
of the wave.) 
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Sound waves consist of increases and decreases (typically very small ones) in the density of 
the air. Light is a wave, but it is a vibration of electric and magnetic fields, not of any physical 
medium. Light can travel through a vacuum. 


A periodic wave is one that creates a periodic motion in a receiver as it passes it. Such 
a wave has a well-defined period and frequency, and it will also have a wavelength, which is 
the distance in space between repetitions of the wave pattern. The velocity, frequency, and 
wavelength of a periodic wave are related by the equation 


v= fa 


A wave emitted by a moving source will undergo a Doppler shift in wavelength and frequency. 

The shifted wavelength is given by the equation 
N=(1-=)a, 
v 

where v is the velocity of the waves and uw is the velocity of the source, taken to be positive 
or negative so as to produce a Doppler-lengthened wavelength if the source is receding and a 
Doppler-shortened one if it approaches. A similar shift occurs if the observer is moving, and 
in general the Doppler shift depends approximately only on the relative motion of the source 
and observer if their velocities are both small compared to the waves’ velocity. (This is not just 
approximately but exactly true for light waves, and this fact forms the basis of Einstein’s theory 
of relativity.) 


Whenever a wave encounters the boundary between two media in which its speeds are 
different, part of the wave is reflected and part is transmitted. The reflection is always reversed 
front-to-back, but may also be inverted in amplitude. Whether the reflection is inverted depends 
on how the wave speeds in the two media compare, e.g. a wave on a string is uninverted when it 
is reflected back into a segment of string where its speed is lower. The greater the difference in 
wave speed between the two media, the greater the fraction of the wave energy that is reflected. 
Surprisingly, a wave in a dense material like wood will be strongly reflected back into the wood 
at a wood-air boundary. 


A one-dimensional wave confined by highly reflective boundaries on two sides will display 
motion which is periodic. For example, if both reflections are inverting, the wave will have a 
period equal to twice the time required to traverse the region, or to that time divided by an 
integer. An important special case is a sinusoidal wave; in this case, the wave forms a stationary 
pattern composed of a superposition of sine waves moving in opposite direction. 


Chapter 7, Relativity, page 377 

Experiments show that space and time do not have the properties claimed by Galileo and 
Newton. Time and space as seen by one observer are distorted compared to another observer’s 
perceptions if they are moving relative to each other. This distortion is quantified by the factor 


where v is the relative velocity of the two observers, and c is a universal velocity that is the same 
in all frames of reference. Light travels at c. A clock appears to run fastest to an observer who 
is not in motion relative to it, and appears to run too slowly by a factor of 7 to an observer who 
has a velocity v relative to the clock. Similarly, a meter-stick appears longest to an observer 
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who sees it at rest, and appears shorter to other observers. Time and space are relative, not 
absolute. In particular, there is no well-defined concept of simultaneity. 


All of these strange effects, however, are very small when the relative velocities are small 
compared to c. This makes sense, because Newton’s laws have already been thoroughly tested 
by experiments at such speeds, so a new theory like relativity must agree with the old one in 
their realm of common applicability. This requirement of backwards-compatibility is known as 
the correspondence principle. 


Relativity has implications not just for time and space but also for the objects that inhabit 
time and space. The correct relativistic equation for momentum is 


p=my , 


which is similar to the classical p = mv at low velocities, where y ~ 1, but diverges from it 
more and more at velocities that approach c. Since y becomes infinite at v = c, an infinite force 
would be required in order to give a material object enough momentum to move at the speed of 
light. In other words, material objects can only move at speeds lower than c. Relativistically, 
mass and energy are not separately conserved. Mass and energy are two aspects of the same 
phenomenon, known as mass-energy, and they can be converted to one another according to the 
equation 
E=mc 


The mass-energy of a moving object is E = myc?. When an object is at rest, y = 1, and the 
mass-energy is simply the energy-equivalent of its mass, mc?. When an object is in motion, the 
excess mass-energy, in addition to the mc’, can be interpreted as its kinetic energy. 


Chapter 8, Atoms and Electromagnetism, page 409 

All the forces we encounter in everyday life boil down to two basic types: gravitational forces 
and electrical forces. A force such as friction or a “sticky force” arises from electrical forces 
between individual atoms. 


Just as we use the word mass to describe how strongly an object participates in gravitational 
forces, we use the word charge for the intensity of its electrical forces. There are two types of 
charge. Two charges of the same type repel each other, but objects whose charges are different 
attract each other. Charge is measured in units of coulombs (C). 


Mobile charged particle model: A great many phenomena are easily understood if we imagine 
matter as containing two types of charged particles, which are at least partially able to move 
around. 


Positive and negative charge: Ordinary objects that have not been specially prepared have 
both types of charge spread evenly throughout them in equal amounts. The object will then 
tend not to exert electrical forces on any other object, since any attraction due to one type of 
charge will be balanced by an equal repulsion from the other. (We say “tend not to” because 
bringing the object near an object with unbalanced amounts of charge could cause its charges 
to separate from each other, and the force would no longer cancel due to the unequal distances.) 
It therefore makes sense to describe the two types of charge using positive and negative signs, 
so that an unprepared object will have zero total charge. 


The Coulomb force law states that the magnitude of the electrical force between two charged 
particles is given by 
k\qi aa 
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Conservation of charge: An even more fundamental reason for using positive and negative 
signs for charge is that with this definition the total charge of a closed system is a conserved 
quantity. 


Quantization of charge: Millikan’s oil drop experiment showed that the total charge of an 
object could only be an integer multiple of a basic unit of charge, e. This supported the idea 
the the “flow” of electrical charge was the motion of tiny particles rather than the motion of 
some sort of mysterious electrical fluid. 


Einstein’s analysis of Brownian motion was the first definitive proof of the existence of 
atoms. Thomson’s experiments with vacuum tubes demonstrated the existence of a new type of 
microscopic particle with a very small ratio of mass to charge. Thomson correctly interpreted 
these as building blocks of matter even smaller than atoms: the first discovery of subatomic 
particles. These particles are called electrons. 


The above experimental evidence led to the first useful model of the interior structure of 
atoms, called the raisin cookie model. In the raisin cookie model, an atom consists of a relatively 
large, massive, positively charged sphere with a certain number of negatively charged electrons 
embedded in it. 


Rutherford and Marsden observed that some alpha particles from a beam striking a thin 
gold foil came back at angles up to 180 degrees. This could not be explained in the then-favored 
raisin-cookie model of the atom, and led to the adoption of the planetary model of the atom, in 
which the electrons orbit a tiny, positively-charged nucleus. Further experiments showed that 
the nucleus itself was a cluster of positively-charged protons and uncharged neutrons. 


Radioactive nuclei are those that can release energy. The most common types of radioactivity 
are alpha decay (the emission of a helium nucleus), beta decay (the transformation of a neutron 
into a proton or vice-versa), and gamma decay (the emission of a type of very-high-frequency 
light). Stars are powered by nuclear fusion reactions, in which two light nuclei collide and form 
a bigger nucleus, with the release of energy. 


Human exposure to ionizing radiation is measured in units of millirem. The typical person 
is exposed to about 100 mrem worth of natural background radiation per year. 


Chapter 9, DC Circuits, page 465 

All electrical phenomena are alike in that that arise from the presence or motion of charge. Most 
practical electrical devices are based on the motion of charge around a complete circuit, so that 
the charge can be recycled and does not hit any dead ends. The most useful measure of the flow 
of charge is current, 
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An electrical device whose job is to transform energy from one form into another, e.g. a lightbulb, 
uses power at a rate which depends both on how rapidly charge is flowing through it and on how 
much work is done on each unit of charge. The latter quantity is known as the voltage difference 
between the point where the current enters the device and the point where the current leaves it. 
Since there is a type of electrical energy associated with electrical forces, the amount of work 
they do is equal to the difference in potential energy between the two points, and we therefore 
define voltage differences directly in terms of electrical energy, 
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The rate of power dissipation is 
P=IAV 


Many important electrical phenomena can only be explained if we understand the mecha- 
nisms of current flow at the atomic level. In metals, currents are carried by electrons, in liquids 
by ions. Gases are normally poor conductors unless their atoms are subjected to such intense 
electrical forces that the atoms become ionized. 


Many substances, including all solids, respond to electrical forces in such a way that the 
flow of current between two points is proportional to the voltage difference between those points 
(assuming the voltage difference is small). Such a substance is called ohmic, and an object made 
out of an ohmic substance can be rated in terms of its resistance, 
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An important corollary is that a perfect conductor, with R = 0, must have constant voltage 
everywhere within it. 
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A schematic is a drawing of a circuit that standardizes and stylizes its features to make it 
easier to understand. Any circuit can be broken down into smaller parts. For instance, one big 
circuit may be understood as two small circuits in series, another as three circuits in parallel. 
When circuit elements are combined in parallel and in series, we have two basic rules to guide 
us in understanding how the parts function as a whole: 


The junction rule: In any circuit that is not storing or releasing charge, conservation of 
charge implies that the total current flowing out of any junction must be the same as the 
total flowing in. 


The loop rule: Assuming the standard convention for plus and minus signs, the sum of 
the voltage drops around any closed loop in a circuit must be zero. 


The simplest application of these rules is to pairs of resistors combined in series or parallel. 
In such cases, the pair of resistors acts just like a single unit with a certain resistance value, called 
their equivalent resistance. Resistances in series add to produce a larger equivalent resistance, 


R=Rh+R. , 


because the current has to fight its way through both resistances. Parallel resistors combine to 
produce an equivalent resistance that is smaller than either individual resistance, 


Ps 
BR Ry 
because the current has two different paths open to it. 


An important example of resistances in parallel and series is the use of voltmeters and 
ammeters in resistive circuits. A voltmeter acts as a large resistance in parallel with the resistor 
across which the voltage drop is being measured. The fact that its resistance is not infinite 
means that it alters the circuit it is being used to investigate, producing a lower equivalent 
resistance. An ammeter acts as a small resistance in series with the circuit through which the 
current is to be determined. Its resistance is not quite zero, which leads to an increase in the 
resistance of the circuit being tested. 
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Chapter 10, Fields, page 511 

Newton conceived of a universe where forces reached across space instantaneously, but we now 
know that there is a delay in time before a change in the configuration of mass and charge in 
one corner of the universe will make itself felt as a change in the forces experienced far away. 
We imagine the outward spread of such a change as a ripple in an invisible universe-filling field 
of force. 


As an alternative to our earlier energy-based definition, we can define the gravitational field 
at a given point as the force per unit mass exerted on objects inserted at that point, and likewise 
the electric field is defined as the force per unit charge. These fields are vectors, and the fields 
generated by multiple sources add according to the rules of vector addition. 


The relationship between the electric field and the voltage is 
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which can be notated more compactly as a gradient, 


EK=-—-VV 


Fields of force contain energy, and the density of energy is proportional to the square of the 
magnitude of the field, 
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The equation for the energy stored in the magnetic field is given explicitly in the next chapter; 
for now, we only need the fact that it behaves in the same general way as the first two equations: 
the energy density is proportional to the square of the field. In the case of static electric fields, 
we can calculate potential energy either using the previous definition in terms of mechanical 
work or by calculating the energy stored in the fields. If the fields are not static, the old method 
gives incorrect results and the new one must be used. 


Capacitance, C’, and inductance, L, are defined as 
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measured in units of farads and henries, respectively. The voltage across a capacitor or inductor 
is given by 


_4 
Vo=G 
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In the equation for the inductor, the direction of the voltage drop (plus or minus sign) is such 
that the inductor resists the change in current. Although the equation for the voltage across 
an inductor follows directly from fundamental arguments concerning the energy stored in the 
magnetic field, the result is a surprise: the voltage drop implies the existence of electric fields 
which are not created by charges. This is an induced electric field, discussed in more detail in 
the next chapter. 


A series LRC circuit exhibits oscillation, and, if driven by an external voltage, resonates. 
The Q of the circuit relates to the resistance value. For large Q, the resonant frequency is 
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Ww ey —— 
VIC 


A series RC or RL circuit exhibits exponential decay, 


£ 
qd = dW €Xp (-z5) 


I =I,exp (-F+) ; 


and the quantity RC or L/R is known as the time constant. 


or 


When driven by a sinusoidal AC voltage with amplitude V, a capacitor, resistor, or inductor 
responds with a current having amplitude 


I= 
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where the impedance, Z, is a frequency-dependent quantity having units of ohms. In a capacitor, 
the current has a phase that is 90° ahead of the voltage, while in an inductor the current is 
90° behind. We can represent these phase relationships by defining the impedances as complex 
numbers: 
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The arguments of the complex impedances are to be interpreted as phase relationships between 
the oscillating voltages and currents. The complex impedances defined in this way combine in 
series and parallel according to the same rules as resistances. 


When a voltage source is driving a load through a transmission line, the maximum power is 
delivered to the load when the impedances of the line and the load are matched. 


Gauss’ law states that for any region of space, the flux through the surface, 


b=) °E;-A; ) 
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is related by 
®=47 kdin 


to the charge enclosed within the surface. 


Chapter 11, Electromagnetism, page 601 

Relativity implies that there must be an interaction between moving charges and other moving 
charges. This magnetic interaction is in addition to the usual electrical one. The magnetic field 
can be defined in terms of the magnetic force exerted on a test charge, 


F=qvxB P 


or, alternatively, in terms of the torque on a magnetic test dipole, 
T 
|B| = 


|m;| sin 6 


where @ is the angle between the dipole vector and the field. The magnetic dipole moment m of 
a loop of current has magnitude m = JA, and is in the (right-handed) direction perpendicular 
to the loop. 


The magnetic field has no sources or sinks. Gauss’ law for magnetism is 


®p=0 


The external magnetic field of a long, straight wire is 
2kI 
B= > 
CR , 
forming a right-handed circular pattern around the wire. 


The energy of the magnetic field is 
2 
dU, = =— B’dv 
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The magnetic field resulting from a set of currents can be computed by finding a set of 
dipoles that combine to give those currents. The field of a dipole is 


km _ 
B,= <x (3cos* 6-1) r 3 


k 
Br = —F (3sind cos 0) r~3 ; 


which reduces to B, = km/c?r® in the plane perpendicular to the dipole moment. By construct- 
ing a current loop out of dipoles, one can prove the Biot-Savart law, 
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which gives the field when we integrate over a closed current loop. All of this is valid only for 
static magnetic fields. 
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Ampere’s law is another way of relating static magnetic fields to the static currents that 
created them, and it is more easily extended to nonstatic fields than is the Biot-Savart law. 
Ampére’s law states that the circulation of the magnetic field, 


Tp =) °s;-B; 5 


around the edge of a surface is related to the current [through passing through the surface, 
Atk 
T= pee Ithrough 


In the general nonstatic case, the fundamental laws of physics governing electric and magnetic 
fields are Maxwell’s equations, which state that for any closed surface, the fluxes through the 
surface are 


Op = Arkdin and 
pep =0 


For any surface that is not closed, the circulations around the edges of the surface are given by 


O® 
Tep= ara and 
O® 
eT = ae + AtkIthrough 


The most important result of Maxwell’s equations is the existence of electromagnetic waves 
which propagate at the velocity of light — that’s what light is. The waves are transverse, and 
the electric and magnetic fields are perpendicular to each other. There are no purely electric or 
purely magnetic waves; their amplitudes are always related to one another by B = E’/c. They 
propagate in the right-handed direction given by the cross product E x B, and carry momentum 
p= fe: 


A complete statement of Maxwell’s equations in the presence of electric and magnetic ma- 
terials is as follows: 


Op = free 
®p=0 
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where the auxiliary fields D and H are defined as 


D=cE and 
B 

H=— F 
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and € and yp are the permittivity and permeability of the substance. 


Chapter 12, Optics, page 687 


The ray model of light: We can understand many phenomena involving light without having 
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to use sophisticated models such as the wave model or the particle model. Instead, we simply 
describe light according to the path it takes, which we call a ray. The ray model of light is useful 
when light is interacting with material objects that are much larger than a wavelength of light. 
Since a wavelength of visible light is so short compared to the human scale of existence, the ray 
model is useful in many practical cases. 


A smooth surface produces specular reflection, in which the reflected ray exits at the same 
angle with respect to the normal as that of the incoming ray. A rough surface gives diffuse 
reflection, where a single ray of light is divided up into many weaker reflected rays going in 
many directions. 


Images: A large class of optical devices, including lenses and flat and curved mirrors, operates 
by bending light rays to form an image. A real image is one for which the rays actually cross at 
each point of the image. A virtual image, such as the one formed behind a flat mirror, is one 
for which the rays only appear to have crossed at a point on the image. A real image can be 
projected onto a screen; a virtual one cannot. 


Mirrors and lenses will generally make an image that is either smaller than or larger than the 
original object. The scaling factor is called the magnification. In many situations, the angular 
magnification is more important than the actual magnification. 


Every lens or mirror has a property called the focal length, which is defined as the distance 
from the lens or mirror to the image it forms of an object that is infinitely far away. A stronger 
lens or mirror has a shorter focal length. 


Locating images: The relationship between the locations of an object and its image formed 
by a lens or mirror can always be expressed by equations of the form 


60, = +0; 46, 
1 1 1 
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The choice of plus and minus signs depends on whether we are dealing with a lens or a mirror, 
whether the lens or mirror is converging or diverging, and whether the image is real or virtual. 
A method for determining the plus and minus signs is as follows: 


1. Use ray diagrams to decide whether 9; and 6, vary in the same way or in opposite ways. 
Based on this, decide whether the two signs in the equation are the same or opposite. If 
the signs are opposite, go on to step 2 to determine which is positive and which is negative. 


2. If the signs are opposite, we need to decide which is the positive one and which is the 
negative. Since the focal angle is never negative, the smaller angle must be the one with 
a minus sign. 


Once the correct form of the equation has been determined, the magnification can be found 


via the equation 


d; 
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This equation expresses the idea that the entire image-world is shrunk consistently in all three 
dimensions. 
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Refraction: Refraction is a change in direction that occurs when a wave encounters the 
interface between two media. Together, refraction and reflection account for the basic principles 
behind nearly all optical devices. 


Snell discovered the equation for refraction, 
ny sind, = ng sin 2 : 


[angles measured with respect to the normal] 


through experiments with light rays, long before light was proven to be a wave. Snell’s law can 
be proven based on the geometrical behavior of waves. Here n is the index of refraction. Snell 
invented this quantity to describe the refractive properties of various substances, but it was later 
found to be related to the speed of light in the substance, 


n=- : 

v 
where c is the speed of light in a vacuum. In general a material’s index of refraction is different 
for different wavelengths of light. Total internal reflection occurs when there is no angle that 
satisfies Snell’s law. 


Wave optics: Wave optics is a more general theory of light than ray optics. When light 
interacts with material objects that are much larger then one wavelength of the light, the ray 
model of light is approximately correct, but in other cases the wave model is required. 


Huygens’ principle states that, given a wavefront at one moment in time, the future behavior 
of the wave can be found by breaking the wavefront up into a large number of small, side-by-side 
wave peaks, each of which then creates a pattern of circular or spherical ripples. As these sets 
of ripples add together, the wave evolves and moves through space. Since Huygens’ principle is 
a purely geometrical construction, diffraction effects obey a simple scaling rule: the behavior is 
unchanged if the wavelength and the dimensions of the diffracting objects are both scaled up 
or down by the same factor. If we wish to predict the angles at which various features of the 
diffraction pattern radiate out, scaling requires that these angles depend only on the unitless 
ratio A/d, where d is the size of some feature of the diffracting object. 


Double-slit diffraction is easily analyzed using Huygens’ principle if the slits are narrower 
than one wavelength. We need only construct two sets of ripples, one spreading out from each 
slit. The angles of the maxima (brightest points in the bright fringes) and minima (darkest 
points in the dark fringes) are given by the equation 

A _ siné 


d om , 
where d is the center-to-center spacing of the slits, and m is an integer at a maximum or an 
integer plus 1/2 at a minimum. 


If some feature of a diffracting object is repeated, the diffraction fringes remain in the same 
places, but become narrower with each repetition. By repeating a double-slit pattern hundreds 
or thousands of times, we obtain a diffraction grating. 


A single slit can produce diffraction fringes if it is larger than one wavelength. Many practical 
instances of diffraction can be interpreted as single-slit diffraction, e.g., diffraction in telescopes. 
The main thing to realize about single-slit diffraction is that it exhibits the same kind of rela- 
tionship between A, d, and angles of fringes as in any other type of diffraction. 
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Chapter 13, Quantum Physics, page 771 

Quantum physics differs from classical physics in many ways, the most dramatic of which is 
that certain processes at the atomic level, such as radioactive decay, are random rather than 
deterministic. There is a method to the madness, however: quantum physics still rules out any 
process that violates conservation laws, and it also offers methods for calculating probabilities 
numerically. The most important of these generic methods is the law of independent probabil- 
ities, which states that if two random events are not related in any way, then the probability 
that they will both occur equals the product of the two probabilities, 


probability of A and B 
= P4Pp (if A and B are independent] 


When discussing a random variable x that can take on a continuous range of values, we cannot 
assign any finite probability to any particular value. Instead, we define the probability dis- 
tribution D(x), defined so that its integral over some range of x gives the probability of that 
range. 


In radioactive decay, the time that a radioactive atom has a 50% chance of surviving is called 
the half-life, ¢j/2. The probability of surviving for two half-lives is (1/2)(1/2) = 1/4, and so on. 
In general, the probability of surviving a time t is given by 


Pyury(t) = 0.5¢/4/2 


Related quantities such as the rate of decay and probability distribution for the time of decay 
are given by the same type of exponential function, but multiplied by certain constant factors. 


Around the turn of the twentieth century, experiments began to show problems with the 
classical wave theory of light. In any experiment sensitive enough to detect very small amounts 
of light energy, it becomes clear that light energy cannot be divided into chunks smaller than a 
certain amount. Measurements involving the photoelectric effect demonstrate that this smallest 
unit of light energy equals hf, where f is the frequency of the light and h is a number known 
as Planck’s constant. We say that light energy is quantized in units of hf, and we interpret this 
quantization as evidence that light has particle properties as well as wave properties. Particles 
of light are called photons. 


The only method of reconciling the wave and particle natures of light that has stood the test 
of experiment is the probability interpretation: the probability that the particle is at a given 
location is proportional to the square of the amplitude of the wave at that location. 


One important consequence of wave-particle duality is that we must abandon the concept 
of the path the particle takes through space. To hold on to this concept, we would have 
to contradict the well established wave nature of light, since a wave can spread out in every 
direction simultaneously. 


Light is both a particle and a wave. Matter is both a particle and a wave. The equations 
that connect the particle and wave properties are the same in all cases: 


E=hf 
p=h/nr 


Unlike the electric and magnetic fields that make up a photon-wave, the electron wavefunction 
is not directly measurable. Only the square of the wavefunction, which relates to probability, 
has direct physical significance. 
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A particle that is bound within a certain region of space is a standing wave in terms of 
quantum physics. The two equations above can then be applied to the standing wave to yield 
some important general observations about bound particles: 


1. The particle’s energy is quantized (can only have certain values). 
2. The particle has a minimum energy. 


3. The smaller the space in which the particle is confined, the higher its kinetic energy must 
be. 


These immediately resolve the difficulties that classical physics had encountered in explaining 
observations such as the discrete spectra of atoms, the fact that atoms don’t collapse by radiating 
away their energy, and the formation of chemical bonds. 


A standing wave confined to a small space must have a short wavelength, which corresponds 
to a large momentum in quantum physics. Since a standing wave consists of a superposition 
of two traveling waves moving in opposite directions, this large momentum should actually be 
interpreted as an equal mixture of two possible momenta: a large momentum to the left, or a 
large momentum to the right. Thus it is not possible for a quantum wave-particle to be confined 
to a small space without making its momentum very uncertain. In general, the Heisenberg 
uncertainty principle states that it is not possible to know the position and momentum of a 
particle simultaneously with perfect accuracy. The uncertainties in these two quantities must 
satisfy the approximate inequality 

ApAz 2h 


When an electron is subjected to electric forces, its wavelength cannot be constant. The 
“wavelength” to be used in the equation p = h/X should be thought of as the wavelength of the 
sine wave that most closely approximates the curvature of the wavefunction at a specific point. 


Infinite curvature is not physically possible, so realistic wavefunctions cannot have kinks in 
them, and cannot just cut off abruptly at the edge of a region where the particle’s energy would 
be insufficient to penetrate according to classical physics. Instead, the wavefunction “tails off” 
in the classically forbidden region, and as a consequence it is possible for particles to “tunnel” 
through regions where according to classical physics they should not be able to penetrate. If 
this quantum tunneling effect did not exist, there would be no fusion reactions to power our 
sun, because the energies of the nuclei would be insufficient to overcome the electrical repulsion 
between them. 


Hydrogen, with one proton and one electron, is the simplest atom, and more complex atoms 
can often be analyzed to a reasonably good approximation by assuming their electrons occupy 
states that have the same structure as the hydrogen atom’s. The electron in a hydrogen atom 
exchanges very little energy or angular momentum with the proton, so its energy and angular 
momentum are nearly constant, and can be used to classify its states. The energy of a hydrogen 
state depends only on its n quantum number. 


In quantum physics, the angular momentum of a particle moving in a plane is quantized in 
units of i. Atoms are three-dimensional, however, so the question naturally arises of how to 
deal with angular momentum in three dimensions. In three dimensions, angular momentum is 
a vector in the direction perpendicular to the plane of motion, such that the motion appears 
clockwise if viewed along the direction of the vector. Since angular momentum depends on 
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both position and momentum, the Heisenberg uncertainty principle limits the accuracy with 
which one can know it. The most the can be known about an angular momentum vector is its 
magnitude and one of its three vector components, both of which are quantized in units of h. 


In addition to the angular momentum that an electron carries by virtue of its motion through 
space, it possesses an intrinsic angular momentum with a magnitude of h/2. Protons and 
neutrons also have spins of h/2, while the photon has a spin equal to h. 


Particles with half-integer spin obey the Pauli exclusion principle: only one such particle 
can exist is a given state, i.e., with a given combination of quantum numbers. 


We can enumerate the lowest-energy states of hydrogen as follows: 


ee ot ee ea 8, = +1/2 or —1/2 | two states 
W=2) C20. Lu, 8, = +1/2 or —1/2 | two states 
n=2, €=1, ¢,=-1,0,orl, s,=+1/2 or —1/2 | six states 


The periodic table can be understood in terms of the filling of these states. The nonreactive 
noble gases are those atoms in which the electrons are exactly sufficient to fill all the states up to 
a given n value. The most reactive elements are those with one more electron than a noble gas 
element, which can release a great deal of energy by giving away their high-energy electron, and 
those with one electron fewer than a noble gas, which release energy by accepting an electron. 
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of a particle in two dimensions, 246 
of light, 246, 659 
quantization of, 824 
angular velocity, 263 
antielectron, 449 
antimatter, 449 
Archimedean spiral, 684 
Archimedes’ principle, 85, 202 
area 
operational definition, 34 
scaling of, 35 
Aristotle, 187 
asteroid, 188 
astrology, 15 
atom, 416 
raisin-cookie model of, 428 
atomic number 
defined, 438 
atoms, 65, 68 
helium, 837 
lithium, 837 
sodium, 838 
with many electrons, 837 
averages, 775 


rule for calculating, 775 
Avogadro’s number, 305 


Bacon, Francis, 19 
ballast, 553 
Balmer, Johann, 830 
Bernoulli, Johann, 93 
beta decay, 449 
beta particle, 434 
Big Bang, 68, 108 
and the arrow of time, 323 
evidence for, 352 
binding energy 
nuclear, 453 
Biot-Savart law, 624 
black hole, 312, 397 
Bohr 
Niels, 735 
Bohr, Niels, 830 
Boltzmann’s constant, 305 
bound states, 807 
box 
particle in a, 807 
brachistochrone, 94 
brightness of light, 693 
Brownian motion, 420 
buoyancy, 85, 202 
Bush, George, 722 


calorie 

unit, 80 
Cambridge, 794 
capacitor, 543 

capacitance, 543 

spherical, 539 
carbon-14 dating, 781 
cat 

Schrédinger’s, 813 
cathode rays, 17 
causality, 378 
Celsius (unit), 302 
Celsius scale, 74 
center of mass, 140 
center of mass frame, 144 
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centi- (metric prefix), 24 
cgs units, 860 
Chadwick, James, 138 
chain reaction, 448 
charge, 411 

quantization of, 421 
chemical bonds 


quantum explanation for hydrogen, 809 


chemical reactions, 68 
Chernobyl, 456 
choice of axis theorem, 250 
proof, 858 
circuit, 469 
complete, 469 
open, 470 
parallel, 482 
series, 482 
short, 479 
circular motion, 207 
inward force, 208 
no forward force, 208 
no outward force, 208 
circular orbit, 98 
circular orbit in a magnetic field, 610 
classical physics, 772 
closed system, 58 
CMB, 109 
coefficient of kinetic friction, 153 
coefficient of static friction, 153 
color, 728 
comet, 133 
complete circuit, 469 
complex numbers, 557 
use in quantum physics, 822 
component, 187 
conductivity, 662 
conductor 
defined, 476 
conservation of mass, 416 
conservation law, 56 
converging, 705 
conversions of units, 29 
correspondence principle, 69, 735 
defined, 377 
for mass-energy, 399 
for relativistic momentum, 395 
for time dilation, 377 
cosmic microwave background, 109 
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cosmological constant, 108 
Coulomb’s law, 413 
coupling constant, 605 
critically damped, 185 
Crookes, William, 419 
cross product, 278 
current 

defined, 467 
current density, 542, 632 
cyclotron, 673 

cyclotron frequency, 673 


damped oscillations, 171 
critically damped, 185 
overdamped 

electrical, 554 
mechanical, 184 
damping 
critical, 185 

dark energy, 108 

Darwin, 18 

Darwin, Charles, 772 

Davisson 
C.J., 800 

de Broglie 
Louis, 800 

decay 
exponential, 779 

Deep Space 1, 132 

definitions 
conceptual, 57 
operational, 57 

degree of freedom, 319 

derivative 
partial, 214 

Descartes, René, 129, 186 

Dialogues Concerning the Two New Sciences, 

36 

diamagnetism, 667 

differential mode, 665 

diffraction 
defined, 733 
double-slit, 737 
fringe, 734 
scaling of, 735 
single-slit, 742 

diffraction grating, 742 

diffuse reflection, 692 


digital camera, 787 
diopter, 714 
dipole 
electric, 518 
field of, 621 
magnetic, 606 
field of, 622 
dipole moment, 519 
dispersive waves, 349 
dissonance, 368 
divergence, 583 
DNA, 456 
Doppler effect, 350 
in relativity, 355 
dot product, 210 
double-slit diffraction, 737 
duality 
wave-particle, 793 
dyne (unit), 860 


E6tvos, Roland, 61 
Eddington 

Arthur, 772 
Einstein 

and randomness, 772 
Einstein, Albert, 771, 786 

and Brownian motion, 420 
electric current 

defined, 467 
electric dipole, 518 

field of, 621 
electric field, 517 

energy density of, 537 

related to voltage, 521 
electric forces, 411 
electrolytes, 484 
electron 

as a wave, 800 

spin of, 835 

wavefunction, 803 
electron capture, 449 
electron decay, 449 
electrostatic unit, 860 
emf, 640 
emission spectrum, 808 
Empedocles of Acragas, 688 
endoscope, 729 
energy, 73 


“free”, 73 

distinguished from force, 150 
equivalence to mass, 395 
heat, 74 

kinetic, 76 

light, 74 


quantization of for bound states, 807 


energy density 

of electric field, 537 

of gravitational field, 541 

of magnetic field, 542 
Enlightenment, 772 
entropy 

macroscopic definition, 309 

microscopic definition, 315 
equilibrium, 86 

metastable, 87 

neutral, 86 

redefined, 257 

stable, 86 

unstable, 87 
equipartition theorem, 319 
equivalent resistance 

of resistors in parallel, 488 
erg (unit), 78, 860 
escape velocity, 102 
esu (electrostatic unit), 860 
ether, 390 
Euler’s formula, 559, 560 
Euler, Leonhard, 559 
evolution, 722 

randomness in, 772 
exclusion principle, 837 
exponential decay, 779 

rate of, 782 
eye 

evolution of, 722 

human, 723 


farad 
defined, 544 
Faraday, Michael, 465 
types of electricity, 466 
Fermat’s principle, 700 
ferrite bead, 666 
ferromagnetism, 668 
Feynman 


Richard, 613 
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field Galileo Galilei, 36 


electric, 517 gamma ray, 434 
gravitational, 513, 514 gamma rays, 16 
fields of force, 511 garage paradox, 388 
flatworm, 722 gas 
fluid spectrum of, 808 
defined, 203 gas discharge tube, 553 
fluorescent light, 553 gauss (unit), 860 
flux Gauss’ law, 579 
additivity by charge, 575 differential form, 583 
additivity by region, 575 Gauss’ theorem, 574 
defined, 572 for gravity, 579 
in Gauss’ theorem, 574 proof of, 578 
focal angle, 712 Gaussian pillbox, 580 
focal length, 713 generator, 552, 641 
focal point, 713 geothermal vents, 771 
force Germer, L., 800 
analysis of forces, 156 GFI, 615 
defined, 145 goiters, 779 
distinguished from energy, 150 gradient, 214 
fields of, 511 graphical addition of vectors, 198 
normal, 152 gravitational field, 82, 513, 514 
transmission, 158 energy density of, 541 
Foucault, 65 gravitational waves, 516 
Fourier’s theorem, 350 ground fault interrupter, 615 
fourier-spectra, 368 group velocity, 805 
frame of reference, 63 
inertial, 63 half-life, 779 
Franklin, Benjamin Halley’s comet, 133 
definition of signs of charge, 413 handedness, 613 
French Revolution, 24 harmonics, 368 
frequency, 116 Hawking radiation, 312 
of waves, 347 heat, 74 
friction compared to temperature, 74 
fluid, 155 compared to thermal energy, 75 
kinetic, 152 heat capacity 
static, 152 at constant pressure, 328 
fringe at constant volume, 328 
diffraction, 734 heat engine, 295 
full width at half maximum, 779 Heisenberg 
full width at half-maximum, 179 Werner, 809 
fundamental, 368 Nazi bomb effort, 812 
fundamental theorem of algebra, 559 Heisenberg uncertainty principle, 809 
FWHM, 179, 779 in three dimensions, 826 
helium, 837 
Galileo, 689 Helmholtz resonator, 329 
Galilean relativity, 62, 190 Hertz 
inertial and gravitational mass, 61 Heinrich, 657 
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Hertz, Heinrich, 789 
Heinrich, 737 

Hiroshima, 457 

homogeneity of spacetime, 382 

Hooke, 410 

Hooke’s law, 169 

hormesis, 457 

Hubble, Edwin, 352 

Hugo, Victor, 409 

Hulse, R.A., 517 

Huygens’ principle, 736 

hydrogen atom, 826 
angular momentum in, 824 
classification of states, 823 
energies of states in, 829 
energy in, 824 
momentum in, 824 
quantum numbers, 826 

hysteresis, 669 


ideal gas law, 305 
images 
formed by curved mirrors, 705 
formed by plane mirrors, 702 
location of, 711 
of images, 707 
real, 706 
virtual, 702 
impedance, 561 
of an inductor, 563 
impedance matching, 567, 666 
incoherent light, 734 
independence 
statistical, 773 
independent probabilities 
law of, 774 
index of refraction 
defined, 725 
related to speed of light, 726 
inductance 
defined, 545 
induction, 552 
inductor, 543 
inductance, 543 
inertial frame of reference, 63 
insulator 
defined, 476 


invariance 


rotational, 189 
Io, 690 
iodine, 779 
ion drive, 132 
isotopes, 445 


Jeans 
James, 772 
joule (unit), 74 
Joule, James, 73 
paddlewheel experiment, 76 
junction rule, 487 
Jupiter, 690 


kelvin (unit), 302 
Kelvin scale, 75 
Kepler’s laws, 96 
Keynes, John Maynard, 410 
kilo- (metric prefix), 24 
kilogram, 26, 56 
standard, 57 
kinetic energy, 76 
compared to momentum, 134 
kinetic energy theorem, 165 
kinetic friction, 152 
coefficient of, 153 


Lagrange, Joseph-Louis, 55 

Laplace, 16 

Laplace, Pierre Simon de, 771 

Laplacian, 821 

Lavoisier, Pierre-André 
conservation of mass, 59 
execution, 55 

Leibniz, 94 

lens, 729 

lensmaker’s equation, 731 

light, 16 
absorption of, 691 
angular momentum of, 659 
brightness of, 693 
defined, 418 
momentum of, 133, 403, 657 
particle model of, 693 
ray model of, 693 
speed of, 689 
wave model of, 693 
waves, 346 

LIGO, 517 
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line integral, 214 

linear no-threshold, 458 

Lipkin linkage, 759 

LNT, 458 

loop rule, 492 

Lorentz transformation, 383 

Lorentz, Hendrik, 383 

LRC circuit, 570 

lumped-circuit approximation 
for capacitors, 532 


magnetic dipole, 606 

field of, 622 
magnetic field 

defined, 606, 607 

energy density of, 542 
magnetic monopoles, 611 
magnification 

angular, 708 

by a converging mirror, 705 
mass 

conservation of, 56, 416 

equivalence to energy, 395 

gravitational, 57 

inertial, 57 

quantization of, 69 
mass-energy 

conservation of, 397 

correspondence principle, 399 

of a moving particle, 399 
matter, 16 

as a wave, 799 

defined, 418 
Maxwell’s equations 

for static fields, 630 

in cgs units, 860 

in differential form, 859 
Maxwell, James Clerk, 737 


measurement in quantum physics, 813 


mechanical system, 130 
median, 779 
mega- (metric prefix), 24 
Mendeleev, Dmitri, 419 
meter (metric unit), 25 
meter (unit), 56 
metric system, 24, 56 
prefixes, 24, 887 
Michelson-Morley experiment, 390 


Index 


micro- (metric prefix), 24 
microwaves, 16 
milli- (metric prefix), 24 
Millikan, Robert, 421 
Millikan, Robert, 791 
mirror 
converging, 711 
mks units, 26 
molecules 
nonexistence in classical physics, 799 
mollusc, 723 
moment 
dipole, 519 
moment of inertia, 266 
tabulated for various shapes, 273 
momentum, 130, 131 
compared to kinetic energy, 134 
nonmechanical, 133 
of light, 133, 403, 657 
relativistic, 392 
monopoles 
magnetic, 611 
Moses, 722 
MRI scan, 610 


nano- (metric prefix), 24 
nautilus, 722 
neutral (electrically), 413 
neutral equilibrium, 86 
neutron 
discovery of, 138 
spin of, 835 
Newton 
Isaac, 771 
newton (unit), 145 
Newton, Isaac, 24, 94, 409, 707, 736 
definition of time, 27 
normal force, 152 
normalization, 775 
nuclear forces, 446, 614 
nuclear reactions, 68 
nucleus 
discovery, 436 


Ohm’s law, 476 
ohmic 

defined, 476 
op-amp, 549 
open circuit, 470 


operational amplifier (op-amp), 549 
operational definitions, 25 
orbit 
circular, 98 
order-of-magnitude estimates, 43 
oscillations, 113 
damped, 171 
overdamped 
electrical, 554 
mechanical, 184 
steady state, 175 
Otto cycle, 325 
Otto, Nikolaus, 325 
overdamped oscillations 
electrical, 554 
mechanical, 184 
ozone layer, 786 


paddlewheel experiment, 76 
parallel axis theorem, 268, 293 
parallel circuit 

defined, 482 
paramagnetism, 667 
Parmenides, 55 
partial derivative, 214, 583 
particle 

definition of, 793 
particle in a box, 807 
particle model of light, 693, 737 
pascal 

unit, 297 
pascal (unit), 203 
path of a photon undefined, 795 
Pauli exclusion principle, 18, 837 
Peaucellier linkage, 759 
Pelton waterwheel, 188 
period 

of waves, 347 
periodic table, 419, 438, 838 
permeability, 664 
permittivity, 663 
perpetual motion machine, 73 
phase velocity, 805 
photoelectric effect, 789 
photon 

Einstein’s early theory, 788 

energy of, 791 

in three dimensions, 798 


spin of, 835 
physics, 16 
pillbox 
Gaussian, 580 
pilot wave hypothesis, 795 
Planck’s constant, 791 
Planck, Max, 791 
polarization, 653 
Pope, 36 
Porro prism, 754 
positron, 398, 449 
positron decay, 449 
power, 80 
Poynting vector, 679 
praxinoscope, 703 
pressure, 296 
as a function of depth, 203 
defined, 203 
prism 
Porro, 754 
probabilities 
normalization of, 775 
probability distributions 
averages of, 778 
widths of, 778 
probability distributions, 777 
probability interpretation, 795 
protein molecules, 823 
proton 
spin of, 835 
Pythagoras, 688 


quality factor, 175 
quantization, 421 

of mass, 69 
quantum dot, 807 
quantum moat, 824 
quantum numbers, 827 
quantum physics, 772 


radar, 786 

radiation hormesis, 457 
radio, 786 

radio waves, 16 

raisin cookie model, 428 
randomness, 772 

ray diagrams, 695 

ray model of light, 693, 737 
RC circuit, 555 
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RC time constant, 555 Schroédinger’s cat, 813 


reactions scientific method, 14 
chemical, 68 sea-of-arrows representation, 515 
nuclear, 68 second (unit), 25, 56 

reductionism, 19 series circuit 

reflection defined, 482 
diffuse, 692 shell theorem, 102 
of waves, 356 short circuit 
specular, 696 defined, 479 

reflections SI units, 26, 78 
inverted and uninverted, 358 Sievert (unit), 456 

refraction sigma notation, 141 
and color, 728 significant figures, 31 
defined, 723 simple machine, 167 

relativity single-slit 
Galilean, 62, 190 diffraction, 742 

Renaissance, 13 sinks in fields, 515 

repetition of diffracting objects, 741 Sirius, 808 

resistance skin depth, 662 
defined, 476 Snell’s law, 724 
in parallel, 487 derivation of, 727 
in series, 491 mechanical model of, 726 

resistivity sodium, 838 
defined, 493 solar constant, 680 

resistor, 480 solar sail, 203 

resistors solenoid, 544 
in parallel, 488 magnetic field of, 628 

retina, 707 sound 

reversibility, 698 speed of, 371 

RHIC accelerator, 387 waves, 345 

RL circuit, 555 sources of fields, 515 

RMS (root mean square), 566 spark plug, 556 

Roemer, 690 specific heat, 74 

root mean square, 566 spectrum 

rotational invariance, 189 absorption, 808 

Russell emission, 808 
Bertrand, 772 spin, 835 

Rutherford neutron’s, 835 
Ernest, 771 of electron, 835 

photon’s, 835 

scalar proton’s, 835 
defined, 192 spin theorem, 251 

scaling, 35 proof, 859 

schematic, 486 spiral 

schematics, 486 Archimedean, 684 

Schrédinger spring constant, 114 
Erwin, 813 Squid, 723 

Schrédinger equation, 816 stability, 86 
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standing wave, 370 
standing waves, 370 
Star Trek, 808 
states 
bound, 807 
static friction, 152 
coefficient of, 153 
statvold, 860 
steady state, 175 
strong nuclear force, 446 
strong nuclear force, 446 
superposition 
of waves, 336 
Swift, Jonathan, 35 
symmetry, 613 
system 
closed, 58 


Taylor, G.I., 794 
Taylor, J.H., 517 
telescope, 707, 743 
temperature, 74, 296 
absolute zero, 75, 302 
Celsius, 302 
compared to heat, 74 
Kelvin, 302 
macroscopic definition, 302 
microscopic definition, 316 
tension, 159, 167 
Tesla 
Nikola, 177 
tesla (unit), 606 
thermal energy 
compared to heat, 75 
thermodynamics, 295 
first law of, 296, 325 
laws of 
summarized, 325 
second law of, 311, 323, 325 
third law of, 324, 325 
zeroth law of, 301, 325 
thermometer, 302 
Thomson, J.J. 
cathode ray experiments, 425 
time 
arrow of, 323 
time constant 


RC, 555 


time reversal, 698 
torque 

defined, 252 

related to force, 253, 282 
total internal reflection, 729 
transformer, 552, 641 
transmission 

of waves, 356 
transmission of forces, 158 
tunneling, 815 


ultraviolet light, 786 
uncertainty principle, 809 

in three dimensions, 826 
units 

nonmetric, 887 
units, conversion of, 29 
unstable equilibrium, 87 


vector 
addition, 193 
defined, 192 


division by a scalar, 193 
dot product, 210 
magnitude of, 193 
multiplication by a scalar, 193 
subtraction, 193 
vector addition 
analytic, 197 
graphical, 198 
vector cross product, 278 
vector product, cross, 278 


velocity 

group, 805 

phase, 805 
velocity filter, 610, 611 
vision, 688 
volt (unit) 

defined, 471 
voltage 

defined, 472 


related to electric field, 521 
volume 
operational definition, 34 
scaling of, 35 
Voyager space probe, 402 


water 
specific heat, 74 
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wave 
definition of, 793 
dispersive, 804 
energy related to amplitude, 359 
wave model of light, 693, 737 
wave-particle duality, 793 
pilot-wave interpretation of, 795 
probability interpretation of, 795 
wavefunction 
complex numbers in, 822 
of the electron, 803 
wavelength, 348 
waves 
absorption of, 356 
dispersive, 349 
frequency of, 347 
gravitational, 516 
interference, 364 
light, 346 
medium not transported with, 338 
on a string, 342 
patterns, 340 
period of, 347 
reflection of, 356 
sound, 345 
standing, 370 
superposition of, 336 
transmission of, 356 
velocity of, 339 
wavelength, 348 
weak nuclear force, 614 
weak nuclear force, 449 
whale songs, 358 
Wicked Witch of the West, 799 
Wigner, Eugene, 711 
work 
defined, 162 
work theorem, 164 


x-rays, 16 


Yarkovsky effect, 188 
Young, Thomas, 737 
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